Hello,
We are happy to announce the release of the dataset for HASOC 2021. Please find more details below.
------------------------------------------------------------------------
Hate Speech and Offensive Content Identification (HASOC)
Task 1 - English and Indo-Aryan Languages
Datasets in English, Hindi and Marathi
Task on conversational Hate Speech including contextual information (Mixed Script)
---------------------------------------------------------------------------------------------------------------------
Task 1 Description:--------------------------
HASOC provides a forum and a data challenge for multilingual research on the identification of problematic content.
This
year, we offer again English, Marathi and Hindi with, altogether with
Thousands of annotated tweets from Twitter. Participants in this year’s shared task can choose to participate in one or two of the subtasks.
Sub-task A: Identifying Hate, offensive and profane content
Sub-task
A focus on Hate speech and Offensive language identification offered
for English, Marathi, Hindi. Sub-task A is a coarse-grained binary
classification in which participating systems are required to classify
tweets into two classes, namely: Hate and Offensive (HOF) and Non- Hate
and offensive (NOT).
- (NOT) Non-Hate-Offensive - This post does not contain any Hate speech, profane, offensive content.
- (HOF) Hate and Offensive - This post contains Hate, offensive, and profane content.
Sub-task B:- Discrimination between Hate, profane and offensive postsThis
sub-task is a fine-grained classification offered for English,
Marathi, Hindi.. Hate-speech and offensive posts from the sub-task A are
further classified into three categories.
- (HATE) Hate speech:- Posts under this class contain Hate speech content.
- (OFFN) Offensive:- Posts under this class contain offensive content.
- (PRFN) Profane:- These posts contain profane words.
-----------
Timeline main track
------------
22th July Task announcement, training data fully available
1 August Release of Training data
16 August Registration deadline (see a link to form at the website)
20 August Release of Test data
27 August Run submission
22 September Paper submission (Easychair)
22 October Notification and Reviews
27 October Revised system description paper submission
13-17 December FIRE takes place virtually, India.
December Accepted participant papers appear at CEUR WS
----------------Organisers----------------Thomas Mandl :- University of Hildesheim, GermanySandip Modha :- DA-IICT & LDRP-ITR, Gandhinagar, IndiaGautam Kishore Shahi: - University of Duisburg-Essen, GermanyDurgesh Nandini :- University of Bamberg, GermanyJohannes Schäfer: - University of Hildesheim, GermanyAmit Kumar Jaiswal: - University of Bedfordshire, UKPrasenjit Majumder :- DA-IICT, Gandhinagar, IndiaTharindu Ranasinghe :- University of Wolverhampton, UKMarcos Zampieri :- Rochester Institute of Technology, USA