HASOC 2021 task-2- The ICHCL task- Data available Register now

70 views

Skip to first unread message

sandip modha

unread,

Aug 7, 2021, 3:07:24 AM8/7/21

to fire...@googlegroups.com

Hello all,

We are happy to announce the release of the dataset for Identification of Conversational Hate-Speech in Code-Mixed Languages (ICHCL) HASOC 2021. Please find more details below.

------------------------------------------------------------------------

Hate Speech and Offensive Content Identification (HASOC)
Task 2 - Identification of Conversational Hate-Speech in Code-Mixed Languages (ICHCL)

Shared task at FIRE 2021, 13 - 17 December, Virtual Event: http://fire.irsi.res.in

Website: https://hasocfire.github.io/hasoc/2021/ichcl/index.html

Get register at: https://forms.gle/RDwsJdKTQNLVZp668

After the registration, you receive the password for the ICHCL dataset

ICHCL Task description

This proposal aims to study the various forms of problematic content such as aggressiveness, hate, offensive, abusive content in conversational dialogue (with context) on online platforms such as Twitter. Please look at the following screenshot.

The Source Tweet: Modi Ji COVID situation ko solve karne ke liye ideas maang rahe the. Mera idea hai resignation dedo please…
Translation : Modi ji (PM of India) was asking for ideas to solve the covid situation of India. My idea to him is to resign.

The Comment: Doctors aur Scientists se manga hai. Ch*tiyo se nahi. Baith niche. [HOF]
Translation: They have asked [advise from] Doctors and Scientists. Not f*ckers. Sit down. [HOF]

The reply: You totally nailed it, can’t stop laughing. [HOF]

This sub-task focused on the binary classification of such conversational tweets with tree-structured data into

(NOT) Non-Hate-Offensive - This tweet, comment, or reply does not contain any Hate speech, profane, offensive content.
(HOF) Hate and Offensive - This tweet, comment, or reply contains Hate, offensive, and profane content in itself or supports hate expressed in the parent tweet

The ICHCL dataset is manually sampled from controversial stories from the topic such as covid-crisis, religious intolerance that have a high probability of containing hate, offensive, and profane posts.

Please visit the track website for the task details, dataset, and timeline.

https://hasocfire.github.io/hasoc/2021/ichcl/index.html

Organizers

----------------
Thomas Mandl :- University of Hildesheim, Germany
Sandip Modha:- DA-IICT & LDRP-ITR, Gandhinagar, India
Prasenjit Majumder :- DA-IICT, Gandhinagar, India

Hiren Madhu - IISC Banglore

Shrey Satapara -DA-IICT

Regards

Sandip Modha

Reply all

Reply to author

Forward

0 new messages