First, what do you mean by "no break space". Do you mean "faasele-ye majaazi", which is names U+200C ZERO-WIDTH NON-JOINER (ZWNJ) in Unicode? The thing is that there is no U+00A0 NBSP character in the attached text file.
Anyway, the commands I noted may be used to work with utf-8 files, as long as you pass them the "escaped utf-8 sequence" of the characters from bash. You may want to try them out with a visible character like U+0627 ARABIC LETTER ALEF (
http://www.fileformat.info/info/unicode/char/0627/index.htm ), then use it with ZWNJ, NBSP or BOM.
Best,
-Behnam