Just curious, do you think we must uses UTF8 with or without BOM at the beginning
https://en.wikipedia.org/wiki/Byte_order_mark
http://www.prelude.me/index.php/2011/01/15/utf-8-avec-ou-sans-bom/
When I asked S. Paumier (a lot of time ago) about utf8 in java portion, he tell me UTF16 is better for file mapped when editor open very very big file (because in UTF8, we known character N is at position N*2 in byte, and in UTF8 we must read the file).
But pehaps this is not used now
Regards
Gilles
De : unitex-...@googlegroups.com [mailto:unitex-...@googlegroups.com] De la part de eric.laporte
Envoyé : Thursday, November 12, 2015 10:18 AM
À : Unitex-GramLab
Cc : denis....@univ-tours.fr
Objet : [Unitex-GramLab] Re: UTF-8
Dear all,
I agree with Denis: UTF-8 is more standard and more compact for some languages. Thanks for pointing this out.
Eric
--
You received this message because you are subscribed to the Google Groups "Unitex-GramLab" group.
To unsubscribe from this group and stop receiving emails from it, send an email to unitex-gramla...@googlegroups.com.
To post to this group, send email to unitex-...@googlegroups.com.
Visit this group at http://groups.google.com/group/unitex-gramlab.
To view this discussion on the web visit https://groups.google.com/d/msgid/unitex-gramlab/74e73df3-8a42-4fa2-911d-53bcf2c9dcb6%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.