Message from discussion
Unicode conversion bug?
Received: by 10.114.192.1 with SMTP id p1mr6721503waf.14.1205404313616;
Thu, 13 Mar 2008 03:31:53 -0700 (PDT)
Return-Path: <pin...@progiciels-bpi.ca>
Received: from phenix.progiciels-bpi.ca (206-248-137-202.dsl.teksavvy.com [206.248.137.202])
by mx.google.com with ESMTP id k36si3000484waf.1.2008.03.13.03.31.49;
Thu, 13 Mar 2008 03:31:53 -0700 (PDT)
Received-SPF: neutral (google.com: 206.248.137.202 is neither permitted nor denied by best guess record for domain of pin...@progiciels-bpi.ca) client-ip=206.248.137.202;
Authentication-Results: mx.google.com; spf=neutral (google.com: 206.248.137.202 is neither permitted nor denied by best guess record for domain of pin...@progiciels-bpi.ca) smtp.mail=pin...@progiciels-bpi.ca
Received: by phenix.progiciels-bpi.ca (Postfix, from userid 2001)
id 574DE96D4A; Thu, 13 Mar 2008 06:31:48 -0400 (EDT)
Date: Thu, 13 Mar 2008 06:31:48 -0400
From: =?utf-8?B?RnJhbsOnb2lz?= Pinard <pin...@iro.umontreal.ca>
To: vim_multibyte@googlegroups.com
Subject: Re: Unicode conversion bug?
Message-ID: <20080313103148.GA5474@phenix.progiciels-bpi.ca>
References: <672e42ee-97d1-4979-83b4-402f2e30cbc0@e23g2000prf.googlegroups.com> <47D78E76.2040408@gmail.com> <47D79E02.4040402@hkstar.com> <47D7A455.9020905@gmail.com> <47D8943B.5080409@hkstar.com> <47D8A215.4020504@gmail.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Disposition: inline
Content-Transfer-Encoding: 8bit
In-Reply-To: <47D8A215.4020504@gmail.com>
User-Agent: Mutt/1.5.16 (2007-06-09)
[Tony Mechelynck]
>[...] and since that Unicode encoding can represent anything [...]
This is a common misconception. Unicode can represent many things, not
anything. On one side, the W3C consortium has dispositions against
attributing, in the future, single code points where combination
characters would do, while keeping in Unicode what has already been
lobbied by richer countries. That is, Unicode is meant to be easier to
use for some than for others. Unicode is also set for supporting "main"
scripts, not necessarily all of them. It means that poorer nations have
less chance to get their script well represented in Unicode, if at all.
--
François Pinard http://pinard.progiciels-bpi.ca