Gmail Calendar Documents Reader Web more »
Recently Visited Groups | Help | Sign in
Google Groups Home
A few good articles on Unicode
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  4 messages - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Charles O Nutter  
View profile  
 More options Jun 15 2006, 12:15 pm
From: "Charles O Nutter" <head...@headius.com>
Date: Fri, 16 Jun 2006 01:15:30 +0900
Local: Thurs, Jun 15 2006 12:15 pm
Subject: A few good articles on Unicode

To add a little fuel to the discussion (and to help dispel some rumors,
myths, and legends about Unicode) I present you with Tim Bray's 4-part
trilogy of articles on Unicode, why it's important, and why you should use
it. The first article provides a nice overview, even mentioning some of the
political and technical difficulties of CJK languages and Unicode (as well
as the previously-mentioned gaiji). The second article discusses character
strings in general. The third, perhaps most relevant to the Ruby Unicode
discussion is an exploration of characters versus bytes, and how the various
encodings work. The fourth article discusses Java's use of UTF-16
internally, and why that may be a good or bad thing.

At any rate, they're entertaining to read and cleared up a number of my own
questions about Unicode. Perhaps they will help the rest of us in the Ruby
community to understand Unicode as well.

Part 1: On the Goodness of Unicode -
http://www.tbray.org/ongoing/When/200x/2003/04/06/Unicode
Part 2: On Character Strings -
http://www.tbray.org/ongoing/When/200x/2003/04/13/Strings
Part 3: Characters vs. Bytes -
http://www.tbray.org/ongoing/When/200x/2003/04/26/UTF
Part 4: Programming Languages and Text -
http://www.tbray.org/ongoing/When/200x/2003/04/30/JavaStrings

And while not directly related, Tim also fiddled with a
fully-unicode-supporting UTF-8 string class in Java with many of the typical
C string operations (strcpy, strstr, ...). Some of the logic he uses for his
byte-vector-as-unicode-string might be applicable to Ruby as well:

Yooster (Ustr): http://www.tbray.org/ongoing/When/200x/2003/05/17/Yooster

--
Charles Oliver Nutter @ headius.blogspot.com
JRuby Developer @ jruby.sourceforge.net
Application Architect @ www.ventera.com


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Dae San Hwang  
View profile  
 More options Jun 15 2006, 12:43 pm
From: Dae San Hwang <dae...@gmail.com>
Date: Fri, 16 Jun 2006 01:43:34 +0900
Local: Thurs, Jun 15 2006 12:43 pm
Subject: Re: A few good articles on Unicode

On Jun 16, 2006, at 1:15 AM, Charles O Nutter wrote:

> The fourth article discusses Java's use of UTF-16
> internally, and why that may be a good or bad thing.

Excellent!  I'm particularly interested to learn more about pros/cons  
between using UTF-16 internally for all strings (Java) vs. being able  
to specify different encoding for each string object (Ruby 2.0).

Thanks for sharing,

Daesan

Dae San Hwang
dae...@gmail.com


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Christian Neukirchen  
View profile  
 More options Jun 15 2006, 3:50 pm
From: Christian Neukirchen <chneukirc...@gmail.com>
Date: Fri, 16 Jun 2006 04:50:25 +0900
Local: Thurs, Jun 15 2006 3:50 pm
Subject: Re: A few good articles on Unicode
"Charles O Nutter" <head...@headius.com> writes:

While were are at it, also see
"The Absolute Minimum Every Software Developer Absolutely, Positively
Must Know About Unicode and Character Sets (No Excuses!)"

http://www.joelonsoftware.com/articles/Unicode.html

--
Christian Neukirchen  <chneukirc...@gmail.com>  http://chneukirchen.org


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Keith Fahlgren  
View profile  
 More options Jun 16 2006, 9:10 am
From: Keith Fahlgren <ke...@oreilly.com>
Date: Fri, 16 Jun 2006 22:10:04 +0900
Local: Fri, Jun 16 2006 9:10 am
Subject: Re: A few good articles on Unicode
On Thursday 15 June 2006 3:50 pm, Christian Neukirchen wrote:

> While were are at it, also see

And it's probably worth mentioning that O'Reilly has a 678 page book on
Unicode coming to bookstores by the end of the month:

http://www.oreilly.com/catalog/unicode/index.html

HTH,
Keith


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »

Create a group - Google Groups - Google Home - Terms of Service - Privacy Policy
©2009 Google