ندعوكم للمشاركة في المسابقة العلمية الثانية لاكتشاف الاعلام في النصوص العربية. سيحصل المشاركين على مدونة وجود الجديدة (٥٥٠ الف كلمة + انواع مفصلة من الاعلام). يوجد ثلاث مهام في المسابقة يمكن المشاركة باي منها، احدى المهام حول الحرب على غزة ويمكن للمشاركين استخدام بيانات خارجية فيها
BASELINES
Two baseline models trained on WojoodFine (flat and nested) are provided (See Liqreina et al., 2023). The code used to produce these baselines is available on GitHub.
| Subtask | Precision | Recall | Average Micro-F1 |
| Flat Fine-Grain NER (Subtask 1) | 0.8870 | 0.8966 | 0.8917 |
| Nested Fine-Grain NER (Subtask 2) | 0.9179 | 0.9279 | 0.9229 |
GOOGLE COLAB NOTEBOOKS
To allow you to experiment with the baseline, we authored four Google Colab notebooks that demonstrate how to train and evaluate our baseline models.
[1] Train Flat Fine-Grain NER: This notebook can be used to train our ArabicNER model on the flat Fine-grain NER task using the sample Wojood_Fine data.
[2] Evaluate Flat Fine-Grain NER: This notebook will use the trained model saved from the notebook above to perform evaluation on unseen dataset.
[3] Train Nested Fine-Grain NER: This notebook can be used to train our ArabicNER model on the nested Fine-grain task using the sample Wojood data.
[4] Evaluate Nested Fine-Grain NER: This notebook will use the trained model saved from the notebook above to perform evaluation on unseen dataset.
REGISTRATION
Participants need to register via this form (NERSharedTask 2024). Participating teams will be provided with common training development datasets. No external manually labelled datasets are allowed. Blind test data set will be used to evaluate the output of the participating teams. Each team is allowed a maximum of 3 submissions. All teams are required to report on the development and test sets (after results are announced) in their write-ups.
FAQ
For any questions related to this task, please check our Frequently Asked Questions
IMPORTANT DATES
- February 25, 2024: Shared task announcement.
- March 1, 2024: Release of training data, development sets, scoring script, and Codalab links.
- April 5, 2024: Registration deadline.
- April 26, 2024: Test set made available.
- May 3, 2024: Codalab Test system submission deadline.
- May 10, 2024: Shared task system paper submissions due.
- June 17, 2024: Notification of acceptance.
- July 1, 2024: Camera-ready version.
- August 16, 2024: ArabicNLP 2024 conference in Thailand.
CONTACT
For any questions related to this task, please contact the organizers directly using the following email address: NERSha...@gmail.com .
ORGANIZERS
- Mustafa Jarrar, Birzeit University
- Muhammad Abdul-Mageed, University of British Columbia & MBZUAI
- Mohammed Khalilia, Birzeit University
- Bashar Talafha, University of British Columbia
- AbdelRahim Elmadany, University of British Columbia
- Nagham Hamad, Birzeit University