Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Unicode characters

1 view
Skip to first unread message

GameboyHippo

unread,
Feb 1, 2008, 10:52:09 AM2/1/08
to
I've googled around for this, but I can't seem to find the answer. I
want to insert into a database the following characters: руб These
are cryllic characters, by the way. All of my tools screw up when I
paste in these symbols. So is there a way I can tell mysql that I
want to insert in U+0440, U+0443, U+0431 without having to paste in
the actual characters or figure out how they are encoded in UTF-8?

GameboyHippo

unread,
Feb 1, 2008, 10:56:10 AM2/1/08
to
On Feb 1, 9:52 am, GameboyHippo <jasonwthomp...@gmail.com> wrote:
> I've googled around for this, but I can't seem to find the answer.  I
> want to insert into  a database the following characters: ÒÕÂ These

> are cryllic characters, by the way.  All of my tools screw up when I
> paste in these symbols.  So is there a way I can tell mysql that I
> want to insert in U+0440, U+0443, U+0431 without having to paste in
> the actual characters or figure out how they are encoded in UTF-8?

Sorry to double post, but as you can see, my characters got screwed up
on this post.

Peter H. Coffin

unread,
Feb 1, 2008, 11:12:50 AM2/1/08
to

Sounds like you need better tools. Which ones are you using? We'll not
recommend those.

For general "figuring out characters" tools, I've found

http://software.hixie.ch/utilities/cgi/unicode-decoder/utf8-decoder

fairly useful, though perhaps not for this purpose specifically.

--
Random Wisdom: Let a pizza fresh from the oven stand for several minutes
before cutting; it will reduce the amount of sauce running
out, and help prevent mouth burns.

GameboyHippo

unread,
Feb 1, 2008, 12:10:05 PM2/1/08
to
On Feb 1, 10:12 am, "Peter H. Coffin" <hell...@ninehells.com> wrote:
> On Fri, 1 Feb 2008 07:52:09 -0800 (PST), GameboyHippo wrote:
> > I've googled around for this, but I can't seem to find the answer.  I
> > want to insert into  a database the following characters: ÒÕÂ These

> > are cryllic characters, by the way.  All of my tools screw up when I
> > paste in these symbols.  So is there a way I can tell mysql that I
> > want to insert in U+0440, U+0443, U+0431 without having to paste in
> > the actual characters or figure out how they are encoded in UTF-8?
>
> Sounds like you need better tools. Which ones are you using? We'll not
> recommend those.
>
> For general "figuring out characters" tools, I've found
>
> http://software.hixie.ch/utilities/cgi/unicode-decoder/utf8-decoder
>
> fairly useful, though perhaps not for this purpose specifically.
>
> --
> Random Wisdom: Let a pizza fresh from the oven stand for several minutes
>                before cutting; it will reduce the amount of sauce running
>                out, and help prevent mouth burns.

I'm using mysql query browser. And I've tried doing it from the
command line via putty.

Peter H. Coffin

unread,
Feb 2, 2008, 11:20:48 AM2/2/08
to
On Fri, 1 Feb 2008 09:10:05 -0800 (PST), GameboyHippo wrote:
> On Feb 1, 10:12 am, "Peter H. Coffin" <hell...@ninehells.com> wrote:
>> On Fri, 1 Feb 2008 07:52:09 -0800 (PST), GameboyHippo wrote:
>> > I've googled around for this, but I can't seem to find the answer.  I
>> > want to insert into  a database the following characters: ÒÕÂ These
>> > are cryllic characters, by the way.  All of my tools screw up when I
>> > paste in these symbols.  So is there a way I can tell mysql that I
>> > want to insert in U+0440, U+0443, U+0431 without having to paste in
>> > the actual characters or figure out how they are encoded in UTF-8?
>>
>> Sounds like you need better tools. Which ones are you using? We'll not
>> recommend those.
>>
>> For general "figuring out characters" tools, I've found
>>
>> http://software.hixie.ch/utilities/cgi/unicode-decoder/utf8-decoder
>>
>> fairly useful, though perhaps not for this purpose specifically.
>
> I'm using mysql query browser. And I've tried doing it from the
> command line via putty.

I did some looking around and couldn't find any indication as to why it
SHOULDN'T be working correctly, but maybe the tables aren't set up
correctly for the data involved. What charset are the tables created in?
If they've been created with latin1, for example, trying to stick koi8r
data into the tables would get broken on the way into the table and show
a lot of substitution characters, and if the charset is utf8, koi8r data
could be broken either going in or coming out of the table.

The intent, given everything being set up correctly, is that character
conversion becomes very smooth, reliable, and almost transparent. When
data are not labelled correctly, or charset are not properly assigned is
when things become difficult and programming becomes a struggle against
the database and data, and people end up wanting to do things like
"figure out how they are encoded in UTF-8" as you say above. That sounds
to my ear like the table is in perhaps latin1, your output mechanism
(perhaps a browser) is being told that it is utf8, and nothing is being
easy for you.

The usual place to start with understanding all of this from a MySQL
perspective is

http://dev.mysql.com/doc/refman/5.0/en/charset.html

It's a quite good discussion of the topic in general, and (clearly) the
most important part of making everything work smoothly inside the
database.


--
100. Finally, to keep my subjects permanently locked in a mindless trance, I
will provide each of them with free unlimited Internet access.
--Peter Anspach's list of things to do as an Evil Overlord

Nikolaj Lazic

unread,
Feb 18, 2008, 12:31:38 PM2/18/08
to
Dana Fri, 1 Feb 2008 09:10:05 -0800 (PST), GameboyHippo <jasonwt...@gmail.com> napis'o:

> I'm using mysql query browser. And I've tried doing it from the
> command line via putty.

Is your translation in putty also set up to use utf-8?

0 new messages