Dear SIGARAB Team,
I hope you are doing well.
I am currently looking for a reliable English-to-Arabic translation tool capable of handling a large volume of data. The goal is to obtain accurate translations that preserve both linguistic quality and semantic meaning.
Given your expertise in Arabic NLP, I would greatly appreciate your recommendations on tools or services that you consider effective for this task, whether open-source or commercial.
Thank you very much for your time and support.
Best regards.
[CAUTION: Non-UBC Email] |
You could try NLLB-200-distilled-600M, pretrained multilingual MT model, it gives good results for English–Arabic translation.
Also, take a look at FineWeb-EDU-Ar (arXiv:2411.06402) — it compares several models for this task.
I also have a quick question for Ahmed: could you kindly let me know which tokenizer you used in your experiments?
Best regards,
Meriem Sellami
To view this discussion visit https://groups.google.com/d/msgid/sigarab/CABpC1GX9aG25JkQm_z%3DvCbGeJfqmmpu4PNaxHHFUbGZazj-yrg%40mail.gmail.com.