Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Locale to Unicode Codepoint mapping?

1 view
Skip to first unread message

Peter Olcott

unread,
Dec 23, 2009, 3:56:17 PM12/23/09
to
I need to know the set of Unicode codepoints associated with
every regional dialect of a human language. How can I go
about finding this information?

It does not seem to be anywhere in the Unicode Consortium
online documentation.


Mihai N.

unread,
Dec 23, 2009, 9:41:58 PM12/23/09
to

It is part of CLDR (Unicode Common Locale Data Repository)
http://cldr.unicode.org/

But take it with a grain of salt.
It is more of a guideline, no hard rules, so don't try to validate input
based on that, for instance.

Are accented characters ok in English?
At the first look, no. But then you think of R�sum�
(and in other few words) and discover that is probably ok
(http://en.wikipedia.org/wiki/Acute_accent#Use_in_English)


--
Mihai Nita [Microsoft MVP, Visual C++]
http://www.mihai-nita.net
------------------------------------------
Replace _year_ with _ to get the real email

green

unread,
Dec 23, 2009, 10:46:49 PM12/23/09
to
Thank you Peter, Merry Christmas and a Happy New Year too.


"Peter Olcott" <NoS...@SeeScreen.com> д����Ϣ����:0MSdnWi0HZ5sHq_W...@giganews.com...

0 new messages