Shared Task on Readability-Controlled Text Simplification @ TSAR Workshop (EMNLP2025)

10 views

Skip to first unread message

Joseph Marvin R. Imperial

unread,

Jul 19, 2025, 2:53:18 PM7/19/25

to siglex-...@googlegroups.com

Shared Task on Readability-Controlled Text Simplification @ TSAR Workshop

We invite participation in the TSAR 2025 Shared Task on Readability-Controlled Text Simplification, aimed at generating simplifications of texts that conform to a specified target readability level, balancing reduced linguistic complexity with meaning preservation and fluency.

Description

The task targets English-language paragraphs written at upper-intermediate or advanced levels and requires participants to simplify them according to a specified target readability level, defined using the Common European Framework of Reference for Languages (CEFR). Specifically, participants will be asked to simplify texts originally at B2 level or above to target levels of A2 and B1.

Participants are expected to adjust linguistic complexity while preserving the core meaning and coherence of the original paragraph.

Important Dates

All dates are 11:59 PM UTC-12:00 (“anywhere on Earth”).

18 July 2025 – Trial data and evaluation scripts released
15 August 2025 – Test data released
26 August 2025 – System outputs due
2 September 2025 – Evaluation results published
23 September 2025 – System description papers due
30 September 2025 – Reviews and notifications sent
7 October 2025 – Camera-ready papers due

Data

This is a fully open task in terms of system development. Participants are free to use any publicly-available data or resources. No training data will be provided.

Trial Data

We will release a trial dataset including:

Input paragraphs with associated target CEFR levels
One or more reference simplifications
Official evaluation scripts

This release is intended to help participants understand the data format, expected output, and evaluation process.

Test Data

The final test set will consist of:

Paragraphs with target CEFR levels
No references will be provided

Participants must submit their outputs strictly following a prespecified format for official evaluation.

Evaluation

Submissions will be evaluated using the following metrics:

CEFR Compliance: A CEFR-level classifier will verify whether the simplified paragraph meets the specified target level.
Meaning Preservation: Semantic similarity between the source paragraph and the system output.
Similarity to References: Semantic similarity between the system output and references.

The official evaluation scripts will be released together with the trial data.

Participation

All participants must register in advance using the following form:

Registration Form

Registered participants will receive announcements, updates, and submission instructions.

Publication

Participants are invited to submit a system description paper to the TSAR 2025 Workshop. All papers will undergo peer review and accepted papers will appear in the workshop proceedings.

Organizers

Fernando Alva-Manchego (Cardiff University, UK)
Regina Stodden (University of Bielefeld, Germany)
Kai North (Cambium Assessment, USA)
Joseph Marvin Imperial (National University Philippines and University of Bath, UK)
Abdullah Barayan (Cardiff University, UK)
Harish Tayyar Madabushi (University of Bath, UK)

Contact

For questions, please contact us at tsarworkshop@googlegroups.com, adding [Shared Task] to the email subject.

DISCLAIMER

National University accepts no liability for the content of this email, or for the consequences of any actions taken on the basis of the information provided unless that information is subsequently confirmed in writing. If you are not the intended recipient, you are notified that disclosing, copying, distributing or taking any action in reliance on the contents of this information is strictly prohibited. To learn more about our Privacy Policies, please visit privacy.national-u.edu.ph

Reply all

Reply to author

Forward

0 new messages