What do confliciting aliases mean?

0 views
Skip to first unread message

Kurosaka, Teruhiko

unread,
Jun 4, 2003, 12:07:31 PM6/4/03
to ICU-Charsets (E-mail), Kurosaka, Teruhiko
There are aliases that map to more than one converters in ICU.
For example Shift_JIS is an alias to both ibm-943_P14A-1999
and ibm-943_P130-1999, according to these:
http://oss.software.ibm.com/cgi-bin/icu/convexp?conv=ibm-943_P14A-1999&s=ALL
http://oss.software.ibm.com/cgi-bin/icu/convexp?conv=ibm-943_P130-1999&s=ALL

Which converter is actually used when Shift_JIS is specified ?
How ibm-943_P14A-1999 and ibm-943_P130-1999 differ?
What the middle element in these internal encoding names mean?

Thank you in advance.

T. "Kuro" Kurosaka
Internationalization Architect
teruhiko...@iona.com
-------------------------------------------------------
IONA Technologies
2350 Mission College Blvd. Suite 650
Santa Clara, CA 95054
Tel: (408) 350 9684/9500
Fax: (408) 350 9501
-------------------------------------------------------
Making Software Work Together TM

Markus Scherer

unread,
Jun 4, 2003, 2:54:03 PM6/4/03
to Kurosaka, Teruhiko, icu-ch...@www-126.southbury.usf.ibm.com
Most charset names are extremely unreliable. For an example of some of the
problems with legacy Japanese charsets see
http://www.w3.org/TR/japanese-xml/

The easiest way to find out how two converters differ is to compare their
.ucm or .xml conversion table files.

The names are constructed as Unicode Technical Report #22 suggests, see
http://www.unicode.org/reports/tr22/
For IBM charsets, the number is the IBM CCSID, and the suffix after it is
a variant indicator generated from IBM's conversion table filenames. See
the ICU User Guide:
http://oss.software.ibm.com/icu/userguide/conversion-data.html

markus

Markus Scherer マルクス IBM GCoC-Unicode/ICU San José, CA
markus....@us.ibm.com





"Kurosaka, Teruhiko" <Teruhiko...@iona.com>
Sent by: icu-chars...@www-124.southbury.usf.ibm.com
2003-06-04 09:07


To: "ICU-Charsets (E-mail)" <icu-ch...@www-124.southbury.usf.ibm.com>
cc: "Kurosaka, Teruhiko" <teruhiko...@iona.com>
Subject: What do confliciting aliases mean?
_______________________________________________
icu-charsets mailing list
icu-ch...@oss.software.ibm.com
http://oss.software.ibm.com/developerworks/oss/mailman/listinfo/icu-charsets



Reply all
Reply to author
Forward
0 new messages