allCountries.txt contains a field for alternative names.
There all kind of languages and scripts are found, among them Hebrew and Arabic.
These two run right to left, and are hard to deal with when processing the alternative names for a user interface,
if one, like me, is not used to deal with such scripts.
Is there a way to remove these two languages from the data?
Occasionally, but not systematically, Unicode character U+200E LEFT-TO-RIGHT MARK is also found in this field, and it should also be eliminated together with Hebrew and Arabic.