Dear participants,
We have a couple of big changes to announce:
1. We detected that the current test set has some wrong answers, because the dataset is LLM generated. We have curated a new test set, which is manually verified by 2 humans to be correct. We request all participants to abandon the existing competition link (
https://codalab.lisn.upsaclay.fr/19140), and instead participate on the new link
https://codalab.lisn.upsaclay.fr/competitions/19747. We had to create a new competition link because codalab does not allow adding new phases to an existing competition.
2. In the new competition, we have Exact Match and F-Score as measures. We abandon the METEOR score as a measure.
3. To see both the metrics for your submitted solutions, you must make the submission public. This is a codalab limitation which we could not find a workaround for.
4. On the request of participants, we have extended the deadlines by a further 2 weeks.