Account Options

  1. Sign in
The old Google Groups will be going away soon, but your browser is incompatible with the new version.
Google Groups Home
« Groups Home
Message from discussion Unicode conversion bug?

Received: by 10.114.192.1 with SMTP id p1mr6721503waf.14.1205404313616;
        Thu, 13 Mar 2008 03:31:53 -0700 (PDT)
Return-Path: <pin...@progiciels-bpi.ca>
Received: from phenix.progiciels-bpi.ca (206-248-137-202.dsl.teksavvy.com [206.248.137.202])
        by mx.google.com with ESMTP id k36si3000484waf.1.2008.03.13.03.31.49;
        Thu, 13 Mar 2008 03:31:53 -0700 (PDT)
Received-SPF: neutral (google.com: 206.248.137.202 is neither permitted nor denied by best guess record for domain of pin...@progiciels-bpi.ca) client-ip=206.248.137.202;
Authentication-Results: mx.google.com; spf=neutral (google.com: 206.248.137.202 is neither permitted nor denied by best guess record for domain of pin...@progiciels-bpi.ca) smtp.mail=pin...@progiciels-bpi.ca
Received: by phenix.progiciels-bpi.ca (Postfix, from userid 2001)
	id 574DE96D4A; Thu, 13 Mar 2008 06:31:48 -0400 (EDT)
Date: Thu, 13 Mar 2008 06:31:48 -0400
From: =?utf-8?B?RnJhbsOnb2lz?= Pinard <pin...@iro.umontreal.ca>
To: vim_multibyte@googlegroups.com
Subject: Re: Unicode conversion bug?
Message-ID: <20080313103148.GA5474@phenix.progiciels-bpi.ca>
References: <672e42ee-97d1-4979-83b4-402f2e30cbc0@e23g2000prf.googlegroups.com> <47D78E76.2040408@gmail.com> <47D79E02.4040402@hkstar.com> <47D7A455.9020905@gmail.com> <47D8943B.5080409@hkstar.com> <47D8A215.4020504@gmail.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Disposition: inline
Content-Transfer-Encoding: 8bit
In-Reply-To: <47D8A215.4020504@gmail.com>
User-Agent: Mutt/1.5.16 (2007-06-09)

[Tony Mechelynck]

>[...] and since that Unicode encoding can represent anything [...]

This is a common misconception.  Unicode can represent many things, not 
anything.  On one side, the W3C consortium has dispositions against 
attributing, in the future, single code points where combination 
characters would do, while keeping in Unicode what has already been 
lobbied by richer countries.  That is, Unicode is meant to be easier to 
use for some than for others.  Unicode is also set for supporting "main" 
scripts, not necessarily all of them.  It means that poorer nations have 
less chance to get their script well represented in Unicode, if at all.

-- 
François Pinard   http://pinard.progiciels-bpi.ca