Call for Participation in the 1st Shared Task on Multilingual Clause-level Morphology

25 views
Skip to first unread message

Gozde Gul

unread,
May 26, 2022, 11:19:59 AM5/26/22
to ml-...@googlegroups.com
Dear all,

This is to invite you to participate in the Multilingual Clause-level Morphology shared task co-located with the 2nd Workshop on Multilingual Representation Learning (MRL) at EMNLP 8 December 2022, Abu Dhabi. More details given below:

------------------------


Description and Objectives


Morphology has been widely studied as a word-level task, although in many languages it has complex hierarchical relationships with different layers of language, such as phonetic, syntactic or semantic representations of phrase or sentence-level utterances. The extent of this relationship as well as its complexity, however, still remain unknown. The new shared task on multilingual clause-level morphology aims to investigate methods for morphological analysis or generation of different forms in languages with varying typology, where the modeling and alignment of morphosyntactic structure is accomplished at the level of clauses.


The shared task aims to provide a new benchmark that can help bring novel understandings in:

  • The relationship between morphology and syntax in different languages

  • How morphosyntactic structure aligns across languages with varying typology

  • The performance of conventional statistical methods for language modeling or representation learning in learning abstract linguistic features that can generalize across forms and languages

  • The limitations of conventional methods for morphological or syntactic modeling as well as the specifications required for developing more comprehensive and theoretically complete models of language


Languages


The shared task will initially include six languages from different language families and with varying morphological characteristics: English, French, German, Hebrew, Russian and Turkish. We anticipate the extension of the benchmark to include more languages as time and resources become available.


Tasks


The shared task can be studied in terms of three parts. 


Task 1: Inflection


In this task the input is verbal lemma (the form given as a lexicon entry) and a specific set of inflectional features. The task requires generating the desired output clause manifesting the features.

Examples


Input

Output

English

give IND;FUT;NOM(1,SG);ACC(3,SG,MASC);DAT(3,SG,FEM)

I will give him to her

German

geben IND;FUT;NOM(1,SG);ACC(3,SG,MASC);DAT(3,SG,FEM)

Ich werde ihn ihr geben

Turkish

vermek IND;FUT;NOM(1,SG);ACC(3,SG);DAT(3,SG)

Onu ona vereceğim

Hebrew

נתן IND;FUT;NOM(1,SG);ACC(3,SG,MASC);DAT(3,SG,FEM)

אתן אותו לה



Task 2: Reinflection


In this task the input is an inflected clause, accompanied by its features, and a new set of features representing the desired form. The task is to generate the desired output that will represent the desired features.


Examples


Input

Output

English

I will give him to her IND;FUT;NOM(1,SG);ACC(3,SG,MASC);DAT(3,SG,FEM) IND;PRS;NOM(1,PL);ACC(2);DAT(3,PL);NEG

We don't give you to them

German

Ich werde ihn ihr geben IND;FUT;NOM(1,SG);ACC(3,SG,MASC);DAT(3,SG,FEM) 

IND;PRS;NOM(1,PL);ACC(2,SG);DAT(3,PL);NEG

Wir geben dich ihnen nicht

Turkish

Onu ona vereceğim IND;FUT;NOM(1,SG);ACC(3,SG,MASC);DAT(3,SG,FEM) IND;PRS;PROG;NOM(1,PL);ACC(2,SG);DAT(3,PL);NEG

Seni onlara vermiyoruzem

Hebrew

אתן אותו לה IND;FUT;NOM(1,SG);ACC(3,SG,MASC);DAT(3,SG,FEM) IND;PRS;NOM(1,PL,MASC);ACC(2,SG,MASC);DAT(3,PL,FEM);NEG

אנחנו לא נותנים אותך להן



Task 3: Analysis


This task is the opposite of task 1, where a system is required to analyze given clauses and generate the lemma and features underlying them.


Examples


Input

Output

English

I will give him to her

give IND;FUT;NOM(1,SG);ACC(3,SG,MASC);DAT(3,SG,FEM)

German

Ich werde ihn ihr geben

geben IND;FUT;NOM(1,SG);ACC(3,SG,MASC);DAT(3,SG,FEM)

Turkish

Onu ona vereceğim

vermek IND;FUT;NOM(1,SG);ACC(3,SG);DAT(3,SG)

Hebrew

אתן אותו לה

נתן IND;FUT;NOM(1,SG);ACC(3,SG,MASC);DAT(3,SG,FEM)



Participation


Interested parties are invited to join the mailing list at participants-mc...@googlegroups.com to be involved in the competition. 


All participating systems will be evaluated together with our baselines against the same held-out test set, to be released shortly before evaluation. Submitted systems can compete in some or all sub-tasks.


Participating teams will be invited to submit a short paper describing their work to the MRL workshop and to present it in a special session in the workshop.


Important Dates


May 16, 2022: Release of training and development data

July 20, 2022: Release of testing data

July 30, 2022: Deadline for submission of systems

August 15, 2022: Release of rankings and results

September 7, 2022: Deadline for submitting system description papers


Evaluation


System outputs will be evaluated using standard evaluation metrics used in morphological analysis and inflection, including the exact match accuracy ratings (precision, recall and F-1) as well as metrics for generated text, such as the edit distance.


Organizers


Omer Goldman, Bar Ilan University

Reut Tsarfaty, Bar Ilan University

Djame Seddah, University Paris-Sorbonne

Benjamin Muller, University Paris-Sorbonne

Hila Gonen, University of Washington and Meta AI

Jamshidbek Mirzakhalov, Salesforce

Kelechi Ogueji, University of Waterloo

Francesco Tinner, University of Zurich

Duygu Ataman, New York University



Contact

mrlw...@gmail.com

---------------


On behalf of the MRL 2022 Organizers,


Asst. Prof. Gözde Gül Şahin,

Koç University, KUIS AI Fellow

Istanbul/Turkey

https://gozdesahin.github.io


Reply all
Reply to author
Forward
0 new messages