We are pleased to announce the release of DEAST, a new dataset introduced in our paper titled “DEAST: A Dataset for English–Arabic Scientific Translation and Vice Versa.”
DEAST is a high-quality bidirectional Arabic–English parallel dataset that focuses on scientific texts. It is intended to support research in domain-specific MT and cross-lingual applications involving Arabic and English.We hope DEAST will be a useful resource for Arabic language technology. We welcome feedback and encourage researchers to explore and use the dataset in their work.
📄 Paper and dataset access:
https://www.sciencedirect.com/science/article/pii/S2352340925010947
Direct link to dataset: