Hello Ishtiyaq,
I understand that you might be worried about the performance of your model. Because Arabic and English are quite different languages, it might be difficult to convey in both languages the exact same meaning and context even for a human expert. Also, please bear in mind that the way in which humans understand the language is different to how computers do so because of the way in which words are represented (as vectors of numbers).
Nonetheless, before using your model in production, Google Cloud Platform provides you with the Bleu Score [1] so you can know beforehand how good your model is. There are two kinds of models you can use in GCP depending on whether you are going to use everyday language or if, instead, the scope of your app will focus on a certain domain.
The Natural Language API [2] is a general model that is not specialised in any specific field of knowledge. Nevertheless, better results can be obtained by providing your own datasets and labels for a specific domain (for example IT, biology, jargon language, etc). If that was your case, I would suggest you also check the AutoML Natural Language product in GCP [3] if you want to provide your own domain-specific keywords or phrases.
For troubleshooting further issues that might arise about this topic, please also consider using IssueTracker [4] or signing up for one of Google Support plans [5].
Have a nice day!
[1]: https://cloud.google.com/translate/automl/docs/evaluate#bleu
[2]: https://cloud.google.com/natural-language/docs
[3]: https://cloud.google.com/translate/automl/docs
[4]: https://developers.google.com/issue-tracker
[5]: https://cloud.google.com/support