The 5th conference on CMC and Social Media Corpora for the Humanities will be held in Bolzano/Bozen, Italy on 3-4 October 2017 and will focus on the collection, analysis and processing of mono and multimodal, synchronous and asynchronous communications. The focus will encompass different CMC genres. These include, but are not limited to, discussion forums, blogs, newsgroups, emails, SMS and WhatsApp, text chats, wiki discussions, social network exchanges (such as Facebook, Twitter, Linkedin), discussions in multimodal and/or 3D environments (virtual worlds, gaming worlds).
The conference will bring together researchers who are interested in the collection, organization, processing, analysis and sharing of CMC data for research purposes. We invite submissions on corpus analysis of various types of CMC data for linguistic or applied linguistic purposes and Natural Language Processing.
The conference is hosted by Eurac Research and will include a post-conference workshop on using the TEI for annotating CMC and social media resources (5 October). It will be followed by the 4th Learner Corpus Research Conference which will be held at the same venue from 5-7 October.
We invite submissions for papers, posters and software/corpus demonstrations on any topic relevant to the above list of themes. For this conference, we are requesting extended abstracts (2-4 pages) in English. All abstracts will be peer-reviewed by the scientific committee. All submissions should follow the template which you can download here:
Please submit your paper via the online conference system.
Paper presentations will consist of a 20 minute talk followed by 10 minutes for questions and discussion. The poster presentationand software/corpus demonstration session will be opened with each presenter/demonstrator giving a one-minute ‘teaser talk.’ Accepted papers will be published in online proceedings before the conference. After the conference, authors of best-reviewed papers will be invited to submit extended versions of their papers to be published in an edited monograph to appear in 2018.
21st June | Submission deadline |
21st July | Notification of acceptance |
21st August | Submission of camera-ready version |
3rd & 4th October | Conference |
5th October | Post-conference workshop (CLARIN tutorial) |
October 5th, 2017, 10:30–14:00, Eurac Research, Italy
The goal of the event is to give a practical introduction into the annotation of language data from genres of computer-mediated communication (CMC) and social media using the formats of the Text Encoding Initiative (TEI). In an introductory section participants will learn about the general architecture of TEI encoding schemas and about rules for the creation of so-called customizations which allow for extending the use of TEI with textual genres and in domains which are not yet covered by the current version of the TEI guidelines. Examples for TEI customizations are the representation schemas for CMC/social media genres developed in the TEI special interest group “computer-mediated communication”.
In a hands-on session, participants will learn how to use these customizations to create a basic TEI representation for their own CMC/social media data. For this purpose participants may bring samples from their own data/corpora or select a sample from collections of Wikipedia talk pages in several languages prepared by the instructors. Format specifications for participants’ own data will be announced in advance. For the hands-on session, participants will be asked to bring a laptop computer with WLAN and a full or trial license of the oXygen XML editor.
The tutorial is funded as a CLARIN User Involvement Event. Registration is free. Details about the tutorial program will be announced via http://cmc-corpora2017.eurac.edu/uievent/.
Harald Lüngen (Institute for the German Language, Mannheim, Germany)
Michael Beißwenger (Universität Duisburg Essen, Germany)
Laura Herzberg (University of Mannheim, Germany)
Michael Beißwenger (University of Duisburg-Essen, Germany)
Ciara R. Wigham (Université Clermont Auvergne, France)
Egon W. Stemle (Eurac Research, Italy)
Registration for the conference and for the post-conference workshop will be open as of June 30, 2017. The registration fee for regular participants is 75 EUR and includes conference materials, coffee breaks, and lunch.
There will be no special registration fee for the workshop.
More information about registration will be announced via
http://cmc-corpora2017.eurac.edu/registration/.
Ciara R. Wigham (Université Clermont Auvergne, France)
Michael Beißwenger (University of Duisburg-Essen, Germany)
Darja Fišer (University of Ljubljana, Slovenia)
Andrea Abel (Eurac Research, Italy)
Steven Coats (University of Oulu, Finland)
Daria Dayter (University of Basel, Switzerland)
Tomaž Erjavec (Jožef Stefan Institute, Slovenia)
Jennifer Frey (Università di Bologna, Italy)
Aivars Glaznieks (Eurac Research, Italy)
Axel Herold (Berlin-Brandenburgische Akademie der Wissenschaften, Germany)
Dawn Knight (Cardiff University, United Kingdom)
Julien Longhi (Université de Cergy-Pontoise, France)
Harald Lüngen (Institut für Deutsche Sprache, Germany)
Maja Miličević (University of Belgrade, Serbia)
María-Teresa Ortego-Antón (University of Valladolid, Spain)
Céline Poudat (University of Nice Sophia Antipolis, France)
Muge Satar (Newcastle University, United Kingdom)
Stefania Spina (University for Foreigners, Italy)
Egon W. Stemle (Eurac Research, Italy)
Angelika Storrer (Universitaet Mannheim, Germany)
Egon W. Stemle (Eurac Research, Italy)
Daniela Gasser (Eurac Research, Italy)