CALL FOR TASK PARTICIPATION—SHINRA2020-ML Classification Task

5 views
Skip to first unread message

Yue Li

unread,
Apr 16, 2020, 12:26:28 AM4/16/20
to SIngapore Natural language processing special interest Group (SING)

========================================================

         CALL FOR TASK PARTICIPATION

      SHINRA2020-ML Classification Task
      http://shinra-project.info/shinra2020ml/?lang=en

 Data release: January 2020
 Registration & Result submission deadline: July 31, 2020              
 NTCIR-15 Conference: December 2020        
========================================================



*OVERVIEW

SHINRA is a resource creation project started in the year 2017, aiming to structure the knowledge in Wikipedia. SHINRA2020-ML is the first shared-task of text classification in SHINRA project, tackling the problem of classifying 30 language Wikipedia entities in fine-grained categories. The task is conducted as one of the NTCIR-15 tasks.

  [Video] (approx.11 min):
  Introduction of SHINRA2020-ML task
  (categorization of 30-language Wikipedia into ENE)
  https://youtu.be/yON7uECDWWo  



*TASK

The task is to classify 30 language (*1) Wikipedia pages into 219 categories in Extended Named Entity, using categorized Japanese Wikipedia pages and the interlanguage links to the corresponding pages in target languages.

The participants are expected to select one or more target languages, and for each language, use the Wikipedia pages linked from the categorized Japanese pages as the training data, and run the system to classify the remaining pages which are not linked from the Japanese pages.

After the task is over, we (including the participants) will combine the results by all the participants (i.e. by Ensemble learning), and publish the results to the public.


It is a scheme called “Resource by Collaborative Contribution (RbCC)” and we are expecting many participants with a good will.


  (*1): The 30 languages are English, Spanish, French, German, Chinese, Russian, Portuguese, Italian, Arabic, Indonesian, Turkish, Dutch, Polish, Persian, Swedish, Vietnamese, Korean, Hebrew, Romanian, Norwegian, Czech, Ukrainian, Hindi, Finnish, Hungarian, Danish, Thai, Catalan, Greek, Bulgarian.



*IMPORTANT DATES

January 2020  Data release
July 31, 2020  Registration & Result submission deadline
August 20, 2020  Evaluation results due back to participants
December 2020  NTCIR-15 Conference (NII, Tokyo)



*HOW TO PARTICIPATE

We encourage new participants to have a look at the data in “Trial datasets”. How to download the datasets and participate to the task is described in the SHINRA2020-ml page:
http://shinra-project.info/shinra2020ml/howtoparticipate/?lang=en

Please note that the task is a task in NTCIR-15 and you have to register through the following NTCIR 15 Registration page to participate in it.

NTCIR-15 Registration page (How to participate to NTCIR-15 Task(s)):
http://research.nii.ac.jp/ntcir/ntcir-15/howto.html



*ORGANIZERS


CHAIR:
Satoshi Sekine (RIKEN AIP, Japan)


ORGANIZING COMITTEE:
Masako Nomoto (RIKEN AIP, Japan)
Asuka Sumida (RIKEN AIP, Japan)
Kouta Nakayama (University of Tsukuba/ RIKEN AIP, Japan)
Koji Matsuda (Tohoku University/ RIKEN AIP, Japan)


PC MEMBER:
Jiewen Wu (A*STAR, Singapore)
Christophe Gravier (Université de Lyon, France)
Hsin-Hsi Chen (National Taiwan University, Taiwan)
Haizhou Li (National University of Singapore, Singapore)
Virach Sornlertlamvanich (Thammasat Univercity,
Thailand / Musashino University, Japan)
Massimo Poesio (Mary Queen University of London, England)
Rafael Muñoz Guillena (Universitat d’Alacant, Spain)
Min Zhang (Soochow University, China)
Wenliang Chen (Soochow University, China)
Johan Bos (University of Groningen, Netherland)
Gerhard Weikum (DFKI, Germany)
Asif Ekbal (IIT Patna, India)
Gjergji Kasneci (Tübingen University, Germany)
Vasudeva Varma (IIIT Hyderabad, India)
Asanee Kasetsart (Kasetsart University, Thailand)
Pierpaolo Basile (Università degli Studi di Bari Aldo Moro, Italy)
David Nadeau (Innodata, Canada)
Murat Can Ganiz (Marmara University, Turkey)
Adrian Iftene (“Alexandru Ioan Cuza” University, Romania)
Tommi A Pirinen (Universität Hamburg, Germany)
Tru Cao (Ho Chi Minh City University of Technologies, Vietnam)
Petya Osenove (Sofia University “St. Kl. Ohridski”, Bulgaria)
Le Hong Phuong (Vietnam National University, Hanoi, Vietnam)
Nguyen Thi Minh Huyen (Vietnam National University, Hanoi Vietnam)
Nicolas Heist (Universität Mannheim, Germany)
Zdenek Zabokrtsky (Charles University, Czech Republic)
Tim Finin (University of Maryland, USA)
Su Jian (A*STAR, Singapore)
Manar Alkhatib (The British University in Dubai, United Arab Emirates)
Key-Sun Choi  (Korea Advanced Institute of Science and Technology, Korea)
Nigel Collier (University of Cambridge, UK)
Ikuya Yamada (Studio Ousia, Japan)
Kentaro Inui (Tohoku University, Japan)
Tomoya Iwakura (Fujitsu, Japan)
Mehrnoush Shamsfard (Shahid Beheshti University, Iran)
Galia Angelova (Bulgarian Academy of Sciences, Bulgaria)
Yusuke Miyao (The University of Tokyo, Japan)
Kiril Simov (Bulgarian Academy of Sciences, Bulgaria)
Yukino Baba (University of Tsukuba, Japan)
Masaharu Yoshioka (Hokkaido University, Japan)
Heng Ji (University of Illinois at Urbana-Champaign, USA)
Miloslav Konopik (University of West Bohemia, Czech Republic)
Steven Skiena (Stony Brook University, USA)
Catherine Legg (Deakin University, Australia)



*CONTACT

Email to the organizers: shinra20...@googlegroups.com

Slack among the participants and organizers: http://shinra2020-ml.slack.com



*LINKS

SHINRA2020-ML homepage: http://shinra-project.info/shinra2020ml/?lang=en

Extended Named Entity: http://ene-project.info/?lang=en

NTCIR-15 Task Overview and Call for Task Participation: http://research.nii.ac.jp/ntcir/ntcir-15/tasks.html



Reply all
Reply to author
Forward
0 new messages