Apologies for the multiple postings.
----
-------------------------------------------------------
The
first shared task on Indian Language Summarization (ILSUM) aims at
creating an evaluation benchmark dataset for Indian Languages.
While large-scale datasets exist for a number of
languages like English, Chinese, French, German, Spanish, etc. no such
datasets
exist for any Indian languages. Through
this shared task, we aim to bridge the existing gap.
In the first
edition, we cover two major Indian languages Hindi and Gujarati
alongside Indian English, a widely recognized dialect of the English
Language. It is a classic summarization task, where we will provide
~10,000 article-summary pairs for each language and the participants are
expected to generate a fixed-length summary.
Timeline
-------------
8th June - Task announced and Registrations open
22nd June - Training Data Release
30th August - Test Data Release
10th September - Run Submission Deadline
15th September - Results Declared
5th October - Working notes due
9th-13th December - FIRE 2022 (Hybrid Event hosted at Kolkata)
Organisers
----------------
Bhavan Modha, University of Texas at Dallas, USA
Shrey Satapara, Indian Institute of Technology, Hyderabad, India
Sandip Modha, LDRP-ITR, Gandhinagar, India
Parth Mehta, Parmonic, USA