Language domains that require very careful use of terminology are abundant. The need to adequately translate within such domains is undeniable, as shown by e.g. the different WMT shared tasks on biomedical translation.
More interestingly, as the abundance of research on domain adaptation shows, such language domains are (a) not adequately covered by existing data and models, while (b) new (or “surge”) domains arise and models need to be adapted, often with significant downstream implications: consider the new COVID-19 domain and the large efforts for translation of critical information regarding pandemic handling and infection prevention strategies.
In the case of newly developed domains, while parallel data are hard to come by, it is fairly straightforward to create word- or phrase-level terminologies, which can be used to guide professional translators and ensure both accuracy and consistency.
This shared task will replicate such a scenario, and invites participants to explore methods to incorporate terminologies into either the training or the inference process, in order to improve both the accuracy and consistency of MT systems on a new domain.
Release of training data and terminologies | April 2021 |
Surprise languages announced: | June 28, 2021 |
Test set available | July 19, 2021 |
Submission of translations | July 23, 2021 |
System descriptions due | August 5, 2021 |
Camera-ready for system descriptions | September 15, 2021 |
Conference in Punta Cana | November 10-11, 2021 |
The submission report should highlight in which ways participants’ methods and data differ from the standard MT approach. They should make clear which tools were used, and which training sets were used.
You may participate in any or all of the language pairs.
--
You received this message because you are subscribed to the Google Groups "Workshop on Statistical Machine Translation" group.
To unsubscribe from this group and stop receiving emails from it, send an email to wmt-tasks+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/wmt-tasks/4cdc7ea7-9252-4142-a8e5-ccd9f2b8ee7an%40googlegroups.com.
Regards,
Dr. T.Bergmanis
NLProc researcher @ Tilde MT
To view this discussion on the web visit https://groups.google.com/d/msgid/wmt-tasks/CAAFADDD3Qp%2Bf7NUrfAVnjguvALwC4cK1nPoGsK%3DqAxRvFikEnA%40mail.gmail.com.
You received this message because you are subscribed to a topic in the Google Groups "Workshop on Statistical Machine Translation" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/wmt-tasks/aCMC0M5X_R4/unsubscribe.
To unsubscribe from this group and all its topics, send an email to wmt-tasks+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/wmt-tasks/32b54372-0445-479c-b123-4628928a05c3n%40googlegroups.com.