Hi,
we have updated our deadlines to align with EMNLP and the submission week for this year is 13th July to 20th July.
http://www2.statmt.org/wmt23/translation-task.html
Secondly, with the recent rise of LLMs trained on unknown data, we want to create challenging test sets that can also evaluate LLMs capacities.
Thus, we are looking for unseen or fresh data ideally created this year. Please, contact us, if you know about any source, where we could collect fresh monolingual data with research permissive licence.
Or let us know if you can donate monolingual data to General MT. We are looking for even small quantities of couple of hundred sentences.
Thank you and have a lovely day,
Tom
(in Germany, he/him)
--
You received this message because you are subscribed to the Google Groups "Workshop on Statistical Machine Translation" group.
To unsubscribe from this group and stop receiving emails from it, send an email to wmt-tasks+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/wmt-tasks/AS4PR83MB05235DA367C59EE998FAEB91FA909%40AS4PR83MB0523.EURPRD83.prod.outlook.com.
Hi Tom, and WMT orgnisers,
Please have a look into our Multi-lingual parallel corpora AlphaMWE: https://github.com/aaronlifenghan/AlphaMWE
It is overall 750 sentences split into 5 files, covering English, Chinese, Polish, German, and Arabic (partial). it is mixed domain data, also featuring multi-word expressions.
Kind regards,
Lifeng
--
You received this message because you are subscribed to the Google Groups "Workshop on Statistical Machine Translation" group.
To unsubscribe from this group and stop receiving emails from it, send an email to wmt-tasks+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/wmt-tasks/AS4PR83MB05235DA367C59EE998FAEB91FA909%40AS4PR83MB0523.EURPRD83.prod.outlook.com.
News:
CFPs and participants: HealTAC23 (Manchester June 14-16) |
CF-Participants & Sponsorship MWE23@EACL (joint w ClinicalNLP@ACL) |
ClinicalMT@WMT22_w_EMNLP | Meta-eval Tutorial / HumanEval (paper_w_tool) / TranslationUncertainty (paper) @LREC22 | ClinicalTextMinging (ML-Tools) @HealTAC2022 || Covid-Topic-Modeling (arXiv-2023) | Measuring_IRR(inter-rater reliability) | AlphaMWE-Arabic (corpus)
Serving as ACL2023 AC (area chair): resource and evaluation |
MWE-SIGLEX elected Standing Committee Board member (2022-2024) |
Ph.D. in Computer Application (Machine Translation, thesis), M.Sc. (Software Engineering, thesis excellent-award), B.Sc. (Math, GPA 80/100)
Google-Scholar , Presentation(ppt), Research-Gate
Google-site Linkedin, Writer(poetry)
Postdoctoral Research Associate at HECTA group, The University of Manchester, UK
https://www.research.manchester.ac.uk/portal/lifeng.han.html Office: 2.90 Kilburn, Oxford Road, Manchester