Final CfP & CLARIN workshop - cmccorpora17

24 views

Skip to first unread message

Ciara Wigham

unread,

May 24, 2017, 4:31:36 AM5/24/17

to tei-cmc

Final Call for Papers (deadline extended) and information about a Post-conference CLARIN workshop: How to use TEI for the annotation of CMC and social media resources: a practical introduction

5th Conference on CMC and Social Media Corpora for the Humanities
(cmccorpora17)

http://cmc-corpora2017.eurac.edu/

The 5th conference on CMC and Social Media Corpora for the Humanities will be held in Bolzano/Bozen, Italy on 3-4 October 2017 and will focus on the collection, analysis and processing of mono and multimodal, synchronous and asynchronous communications. The focus will encompass different CMC genres. These include, but are not limited to, discussion forums, blogs, newsgroups, emails, SMS and WhatsApp, text chats, wiki discussions, social network exchanges (such as Facebook, Twitter, Linkedin), discussions in multimodal and/or 3D environments (virtual worlds, gaming worlds).

The conference will bring together researchers who are interested in the collection, organization, processing, analysis and sharing of CMC data for research purposes. We invite submissions on corpus analysis of various types of CMC data for linguistic or applied linguistic purposes and Natural Language Processing.

The conference is hosted by Eurac Research and will include a post-conference workshop on using the TEI for annotating CMC and social media resources (5 October). It will be followed by the 4th Learner Corpus Research Conference which will be held at the same venue from 5-7 October.

Topics of interest

Development of CMC corpora

Building CMC corpora: from data collection to publication
Open data for research on CMC: questions of ethics and rights
Annotation of CMC genres: representation of CMC genres, annotation of linguistic phenomena, metadata
Multimodal corpora

Analysis of CMC corpora

Sociolinguistic studies of CMC
Discourse analysis of CMC
Linguistic characteristics of CMC
Multimodal aspects of CMC
Language in contact and code-switching in CMC
CMC in language learning & teaching

Natural Language Processing of CMC

Normalization
PoS Tagging
Lemmatization
Syntactic parsing
Named-entity recognition

Submission procedure

We invite submissions for papers, posters and software/corpus demonstrations on any topic relevant to the above list of themes. For this conference, we are requesting extended abstracts (2-4 pages) in English. All abstracts will be peer-reviewed by the scientific committee. All submissions should follow the template which you can download here:

MSWord Template (Example document)
Libre-/Openoffice Template (Example document)
LaTeX Package (Example included)

Please submit your paper via the online conference system.

Paper presentations will consist of a 20 minute talk followed by 10 minutes for questions and discussion. The poster presentationand software/corpus demonstration session will be opened with each presenter/demonstrator giving a one-minute ‘teaser talk.’ Accepted papers will be published in online proceedings before the conference. After the conference, authors of best-reviewed papers will be invited to submit extended versions of their papers to be published in an edited monograph to appear in 2018.

Important dates

21st June	Submission deadline
21st July	Notification of acceptance
21st August	Submission of camera-ready version
3rd & 4th October	Conference
5th October	Post-conference workshop (CLARIN tutorial)

Post-conference workshop:
CLARIN Tutorial: How to use TEI for the annotation of CMC and social media resources: a practical introduction

October 5th, 2017, 10:30–14:00, Eurac Research, Italy

The goal of the event is to give a practical introduction into the annotation of language data from genres of computer-mediated communication (CMC) and social media using the formats of the Text Encoding Initiative (TEI). In an introductory section participants will learn about the general architecture of TEI encoding schemas and about rules for the creation of so-called customizations which allow for extending the use of TEI with textual genres and in domains which are not yet covered by the current version of the TEI guidelines. Examples for TEI customizations are the representation schemas for CMC/social media genres developed in the TEI special interest group “computer-mediated communication”.

In a hands-on session, participants will learn how to use these customizations to create a basic TEI representation for their own CMC/social media data. For this purpose participants may bring samples from their own data/corpora or select a sample from collections of Wikipedia talk pages in several languages prepared by the instructors. Format specifications for participants’ own data will be announced in advance. For the hands-on session, participants will be asked to bring a laptop computer with WLAN and a full or trial license of the oXygen XML editor.

The tutorial is funded as a CLARIN User Involvement Event. Registration is free. Details about the tutorial program will be announced via http://cmc-corpora2017.eurac.edu/uievent/.

The tutorial is held by:

Harald Lüngen (Institute for the German Language, Mannheim, Germany)
Michael Beißwenger (Universität Duisburg Essen, Germany)
Laura Herzberg (University of Mannheim, Germany)

The event is organized by:

Michael Beißwenger (University of Duisburg-Essen, Germany)
Ciara R. Wigham (Université Clermont Auvergne, France)
Egon W. Stemle (Eurac Research, Italy)

Registration

Registration for the conference and for the post-conference workshop will be open as of June 30, 2017. The registration fee for regular participants is 75 EUR and includes conference materials, coffee breaks, and lunch.

There will be no special registration fee for the workshop.

More information about registration will be announced via
http://cmc-corpora2017.eurac.edu/registration/.

Scientific and Organizing Committee of cmccorpora17

Chair

Ciara R. Wigham (Université Clermont Auvergne, France)

Co-Chairs

Michael Beißwenger (University of Duisburg-Essen, Germany)

Darja Fišer (University of Ljubljana, Slovenia)

Members

Andrea Abel (Eurac Research, Italy)

Steven Coats (University of Oulu, Finland)

Daria Dayter (University of Basel, Switzerland)

Tomaž Erjavec (Jožef Stefan Institute, Slovenia)

Jennifer Frey (Università di Bologna, Italy)

Aivars Glaznieks (Eurac Research, Italy)

Axel Herold (Berlin-Brandenburgische Akademie der Wissenschaften, Germany)

Dawn Knight (Cardiff University, United Kingdom)

Julien Longhi (Université de Cergy-Pontoise, France)

Harald Lüngen (Institut für Deutsche Sprache, Germany)

Maja Miličević (University of Belgrade, Serbia)

María-Teresa Ortego-Antón (University of Valladolid, Spain)

Céline Poudat (University of Nice Sophia Antipolis, France)

Muge Satar (Newcastle University, United Kingdom)

Stefania Spina (University for Foreigners, Italy)

Egon W. Stemle (Eurac Research, Italy)

Angelika Storrer (Universitaet Mannheim, Germany)

Organizing Committee

Egon W. Stemle (Eurac Research, Italy)

Daniela Gasser (Eurac Research, Italy)

Reply all

Reply to author

Forward

0 new messages