TAYIR: The competition is on

90 views
Skip to first unread message

Rémi Eyraud

unread,
Mar 23, 2023, 11:24:16 AM3/23/23
to ml-...@googlegroups.com

TAYSIR: Transformer+RNN: Algorithms to Yield Simple and Interpretable Representation

https://remieyraud.github.io/TAYSIR/ 


CALL FOR PARTICIPATION: competition is fully on!


The Transformers+RNN: Algorithms to Yield Simple and Interpretable Representations (TAYSIR) competition is an on-line challenge on extracting simpler models from already trained neural networks. These neural nets are trained on tasks involving sequences of symbols. Some of these tasks are artificial and some come from real world problems in domains like natural language processing (NLP), bioinformatics, software engineering and others. Taysir means "simple" in Arabic.


The quality of the extracted models is evaluated in two ways:

  • How well the extracted model approximates the original model

  • The simplicity of the extracted model as measured by assorted metrics


There are two tracks in the competition corresponding to the kind of function the trained neural networks produce. 

  • Neural nets trained for binary classification. These networks represent functions Σ* ➝ {0,1}. This task can be thought of as extracting models for formal languages.

  • Neural nets trained for language modeling and used as density estimators. These networks represent functions Σ* ➝ ℝ. 


Each track consists of about 10 trained models. The trained models are in PyTorch but available in a MLFlow format for compatibility with other frameworks.


The competition has started and will last until April 30th 2023.


Half a day will be dedicated to the competition results during the 16th International Conference on Grammatical Inference to be held in Morocco in July 2023 at the Faculty of Sciences, Mohammed V University in Rabat, Morocco.

http://www.fsr.ac.ma/icgi2023/


Participants in TAYSIR will be encouraged to attend ICGI 2023 and to submit an extended abstract presenting their work (2 to 4 pages, including appendices) by May 15th which will be appended to the proceedings of ICGI (publisher: PMLR) in a track dedicated to the competition. These abstracts will be peer-reviewed primarily for clarity of presentation.


HOW TO PARTICIPATE

Everything can be found on our website: https://remieyraud.github.io/TAYSIR/ 

The submission website for the binary classification Track is here: https://codalab.lisn.upsaclay.fr/competitions/11249 

The submission website for the language modeling Track is here: https://codalab.lisn.upsaclay.fr/competitions/11683 

You will need to register at the submission websites.


Trained neural nets are provided as MLFlow models. After running your extraction algorithms, participants will upload their extracted model in MLFlow format to the website for evaluation (we provide a submission toolkit to help).


MAIN ORGANIZERS

  • Chihiro Shibata, Hosei University, Japan

  • Jeffrey Heinz, Stony Brook University, New York, USA

  • Rémi Eyraud, Université Jean Monnet, Saint-Etienne, France

  • Dakotah Lambert, Université Jean Monnet, Saint-Etienne, France


DEV TEAM

  • Aidar Gaffarov, MLDM Mater program

  • Badr Tahri Joutei, MLDM Master program

  • Mathias Cabanne, Eura NOVA


SCIENTIFIC COMMITTEE

  • Ariadna Quattoni, Universitat Politècnica de Catalunya

  • Bob Frank, Yale University, USA

  • Borja Balle, DeepMind

  • François Coste, INRIA Rennes

  • Jean-Christophe Janodet, Université Paris-Saclay

  • Matthias Gallé, Cohere

  • Sicco Verwer, TU Delft

Reply all
Reply to author
Forward
0 new messages