Thanks in advance.
Wayne
Most developers I have talked to are using UTF-8.
--
Phil Frisbie, Jr.
Hawk Software
http://www.hawksoft.com
Eugene
That's probably because most developers are in US and there UTF-8 is just
good ol' ASCII and so is quite efficient for communications. For the far
east countries UTF-8 is less efficient than other alternatives but who cares
<sigh>
Eugene
You have convinced me that this is more complicated than
I thought. I did not know that anybody uses 4 bytes for
unicode. Can you direct me to anyplace with good
documentation on how to share unicode strings across a
network?
Thanks for your help.
Wayne
>.
>
Eugene
In this context we are talking about general network transmission. It is up to
the developer to decide if it is better to use UCS-2 or UCS-4 instead of UTF-8.
If using UCS-2 and UCS-4 the byte order can be negotiated at connection time or
hard coded into the application.
> Eugene
> Eugene,
>
> You have convinced me that this is more complicated than
> I thought. I did not know that anybody uses 4 bytes for
> unicode. Can you direct me to anyplace with good
> documentation on how to share unicode strings across a
> network?
I like this reference: http://www.cl.cam.ac.uk/~mgk25/unicode.html
> Thanks for your help.
>
> Wayne
--
Both UCS encodings are ill-suited for network transmissions because of the
8-bit bytes and byte order issues. UTF-16 could be a better alternative to
UTF-8 though.
Some interesting reading:
http://www.faqs.org/rfcs/rfc2277.html
http://www.faqs.org/rfcs/rfc2781.html
Eugene