Dear participants,
the data for the WMT 2023 Shared Task on Parallel Data Curation task has been made available. Instructions for how to download the data can be found on the website of the shared task:
http://www2.statmt.org/wmt23/data-task.htmlAlong with the data we make available evaluation scripts. These scripts read in data in the shared task submission format and produce end-to-end Machine Translation quality results. More details on how to run the evaluation scripts with an example for a baseline can be found here:
https://github.com/awslabs/sockeye/blob/wmt23_data_task/README.mdIf there are any questions on the data or how to run an evaluation, please don't hesitate to reach out to us.
Best,
WMT 2023 Shared Task on Parallel Data Curation Organizers