Call for Participation: Arabic Sentence Segmentation Shared Task 2026

2 views
Skip to first unread message

Bashar Alhafni

unread,
9:56 AM (5 hours ago) 9:56 AM
to sig...@googlegroups.com
@


We are excited to announce the Arabic Sentence Segmentation Shared Task, which focuses on automatically identifying sentence boundaries in Arabic documents. The task is formulated as a binary token classification problem: given an Arabic document as input, systems must predict whether a sentence boundary follows each token.

Task 1: Paragraph-Aware Arabic Sentence Segmentation
Given an Arabic document with its paragraph boundaries, predict for each token whether a sentence boundary follows it.

Task 2: No-Punctuation Paragraph-Aware Arabic Sentence Segmentation
Given an Arabic document with its paragraph boundaries but without punctuation, predict for each token whether a sentence boundary follows it.

Task 3: No-Paragraph Arabic Sentence Segmentation
Given an Arabic document without its paragraph boundaries, predict for each token whether a sentence boundary follows it.

Task 4: No-Punctuation No-Paragraph Arabic Sentence Segmentation
Given an Arabic document without its paragraph boundaries and without punctuation, predict for each token whether a sentence boundary follows it.

For each task, there will be two tracks, allowing different data sources for training: Closed and Open. Participants may compete in any combination of subtasks and tracks.

Important Dates:
All deadlines are 11:59pm UTC-12 (anywhere on Earth):
  • June 1, 2026: Release of training, dev and open test data, and evaluation scripts.
  • July 20, 2026: Registration deadline and release of test data.
  • July 25, 2026: End of evaluation cycle (test set submission closes).
  • August 8, 2026: System description paper submissions due.
  • August 15, 2026: Notification of acceptance.
  • August 15, 2026: Final results released.
  • August 22, 2026: Camera-ready versions due.

Awards:
Top-performing Systems:
  • We will recognize the top-performing system in each task-track combination (4 tasks x 2 tracks), with a $100 prize awarded to the winning team in each category.

Best System Description Paper:
  • We will also award a $200 prize for the best system description paper, recognizing clarity, technical quality, reproducibility, and insight, independent of shared task performance.

Organizers:

Shared Task Website: https://www.araseg.aramlab.ai/

Shared Task Registration Link: https://forms.gle/wsP2tTYQ7NrqvTEC7

Contact:
For any questions related to the task, check out the FAQs. Feel free to post your questions on our Slack workspace. You are also welcome to contact the organizers directly at this email address: araseg26....@aramlab.ai



Reply all
Reply to author
Forward
0 new messages