HASOC 2021 task-2- The ICHCL task- Data available Register now

68 views
Skip to first unread message

sandip modha

unread,
Aug 7, 2021, 3:07:24 AM8/7/21
to fire...@googlegroups.com
Hello all,

We are happy to announce the release of the dataset for  Identification of Conversational Hate-Speech in Code-Mixed Languages (ICHCL) HASOC 2021. Please find more details below.
------------------------------------------------------------------------
Hate Speech and Offensive Content Identification  (HASOC)
Task 2 -   
 Identification of Conversational Hate-Speech in Code-Mixed Languages (ICHCL)
Shared task at FIRE 2021,  13 - 17 December, Virtual Event: http://fire.irsi.res.in
After the registration, you receive the password for the ICHCL dataset

ICHCL Task description
This proposal aims to study the various forms of problematic content such as aggressiveness, hate, offensive, abusive content in conversational dialogue (with context) on online platforms such as Twitter. Please look at the following screenshot. 


image.png


The Source Tweet: Modi Ji COVID situation ko solve karne ke liye ideas maang rahe the. Mera idea hai resignation dedo please… 
Translation : Modi ji (PM of India) was asking for ideas to solve the covid situation of India. My idea to him is to resign.

The Comment: Doctors aur Scientists se manga hai. Ch*tiyo se nahi. Baith niche. [HOF]
Translation: They have asked [advise from] Doctors and Scientists. Not f*ckers. Sit down. [HOF]

The reply: You totally nailed it, can’t stop laughing. [HOF]

This sub-task focused on the binary classification of such conversational tweets with tree-structured data into 

  • (NOT) Non-Hate-Offensive - This tweet, comment, or reply does not contain any Hate speech, profane, offensive content. 
  • (HOF) Hate and Offensive - This tweet, comment, or reply contains Hate, offensive, and profane content in itself or supports hate expressed in the parent tweet
The ICHCL dataset is manually sampled from controversial stories from the topic such as covid-crisis, religious intolerance that have a high probability of containing hate, offensive, and profane posts.

Please visit the track website for the task details, dataset, and timeline. 

Organizers
----------------
Thomas Mandl :- University of Hildesheim, Germany
Sandip Modha:- DA-IICT & LDRP-ITR, Gandhinagar, India
Prasenjit Majumder :- DA-IICT, Gandhinagar, India
Hiren Madhu - IISC Banglore
Shrey Satapara -DA-IICT



--
Regards
Sandip Modha

Reply all
Reply to author
Forward
0 new messages