Hi all,
Thanks so much for your patience this far. We're ready to go!
On the shared task page, you can now download new items:
- a README describing the entity types
- development data, with BIO labels
- test data, with no labels
In addition, the timeline is set! It's compact, so please be mindful of this. Papers should be around four pages in length, EMNLP format; formal details will go up soon.
- Test data released: 21 June 2017
- Result submission: 28 June 2017
- Shared-task results and gold annotations for test data: 30 June 2017
- System Description Papers Due: 7 July 2017
- Reviews Returned: 15 July 2017
- Camera Ready Deadline: 19 July 2017
- Workshop Date: Sep 7 2017
Good luck! The data is tough and dirty, and the scoring is aggressive, so please bring your best tools to the table. The test set draws from a mixture of sources: Twitter, Reddit and Stack Overflow. The development set draws from YouTube comments. Note that, as with most user-generated text, there is vulgar language in some of this data. All deadlines are in the latest timezone possible, so UTC+12. You have
one week.
All the best,
Leon Eric Marieke and Nut