Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

typing is ok but copy-paste does not encode

13 views
Skip to first unread message

nick...@yahoo.gr

unread,
Aug 13, 2007, 8:32:56 PM8/13/07
to
similar topic = "Help! Unicode cut/paste on Unix"

OS = No problem is Windows XP , but in ubuntu 7.04
application = aMSN 0.97RC1
tcl/tk version = 8.5a6

The problem is not typing in greek, but pasting greek characters from
another program.
example:
If I select with mouse from terminal the greek word γειά and paste the
text in the message window of aMSN , I get the following:
\u03b3\u03b5\u03b9\u03ac

This is my first message here , please ask me for additional
information.
I am a newbie and I apologies if the solution is somewhere that I
didn't see...

nick...@yahoo.gr

unread,
Aug 13, 2007, 9:13:23 PM8/13/07
to
A better definition of the problem:
copy-paste(Ctrl-v) UTF-8 characters to a tcl/tk application on a Unix-
like OS

The only sure , is that there is no official solution:
http://www.cl.cam.ac.uk/~mgk25/unicode.html#x11

Donal K. Fellows

unread,
Aug 14, 2007, 9:53:17 AM8/14/07
to
nick_m...@yahoo.gr wrote:
> copy-paste(Ctrl-v) UTF-8 characters to a tcl/tk application on a Unix-
> like OS
>
> The only sure , is that there is no official solution:http://www.cl.cam.ac.uk/~mgk25/unicode.html#x11

Well, yes and no. The solution that has been fairly widely adopted was
to define a new format for data transfer on Unix that sent the data as
utf-8. Tk supports that format, and has done since 8.4 (a fair few
years now). The problem only comes when the other end of the transfer
*doesn't* support the format and instead sends random 8-bit data in an
unknown encoding through the STRING type. There's not much Tk can do
in that case except guess, and it may well not guess right. (I don't
remember if it uses the system encoding or if it guesses ISO 8859-1.)
Working around bugs in other applications is a mug's game...

Donal.

nick...@yahoo.gr

unread,
Aug 14, 2007, 9:36:47 PM8/14/07
to
I am a newbie so I stop here.I informed the aMSN developers and I hope
they will understand what to do.

Thank you for your time!

nick...@yahoo.gr

unread,
Aug 14, 2007, 9:38:10 PM8/14/07
to
I am a newbie ,so I stop here.I informed the aMSN developers and I

nick...@yahoo.gr

unread,
Aug 14, 2007, 9:38:49 PM8/14/07
to

Andreas Leitgeb

unread,
Aug 17, 2007, 7:47:51 AM8/17/07
to
Donal K. Fellows <donal.k...@man.ac.uk> wrote:
> nick_m...@yahoo.gr wrote:
>> copy-paste(Ctrl-v) UTF-8 characters to a tcl/tk application on a Unix-
>> like OS
>> The only sure , is that there is no official solution:http://www.cl.cam.ac.uk/~mgk25/unicode.html#x11
>
> unknown encoding through the STRING type. There's not much Tk can do
> in that case except guess, and it may well not guess right. (I don't
> remember if it uses the system encoding or if it guesses ISO 8859-1.)
> Working around bugs in other applications is a mug's game...

I don't think this is the right answer, since nick_meet wrote
that he got the string "\u03b3\u03b5\u03b9\u03ac", which looks
like the message has been correctly "understood" by aMSN, but
its displaying failed.
I wonder if each of these \u... things behave like single
characters, (which would indicate a font-problem) or if they
behaved like separate '\' 'u' '0' characters... (which would
hint towards some explicit malakia happening in amsn)

nick...@yahoo.gr

unread,
Aug 24, 2007, 6:56:07 AM8/24/07
to
Problem fixed in latest SVN version(r8963). They didn't save/retreive
data from the clipboard using UTF8 (undocumented). Thanks to dkf from
#tcl channel who pointed that out.

0 new messages