what's a "conversion error" and how do I correct it?

eNG1Ne

unread,

May 22, 2011, 6:52:52 AM5/22/11

to vim_use

Working on a Linux box (Ubuntu 10.4), I've successfully copy/pasted a
block of text from a Planmaker spreadsheet into a vim file. The text
includes U+2012 dashes, which are correctly displayed in vim … but
when I try to save the vim file, I get the message "conversion error".

Probably related, but when I used :dig to try and find the code for
the U+2012 dash (so I could use search/replace) I couldn't spot one.
Just out of curiosity, what do the numbers in the digraph reference
page refer to?

Thanks in advance for helping me out.

Christian Brabandt

unread,

May 22, 2011, 7:00:57 AM5/22/11

to vim_use

Hi eNG1Ne!

On So, 22 Mai 2011, eNG1Ne wrote:

> Working on a Linux box (Ubuntu 10.4), I've successfully copy/pasted a
> block of text from a Planmaker spreadsheet into a vim file. The text
> includes U+2012 dashes, which are correctly displayed in vim … but
> when I try to save the vim file, I get the message "conversion error".

I guess, your fileencodings setting (notice the plural) does not include
utf-8, so Vim does not try to save it with that encoding. It probably
either tries to save it using plain old ASCII encoding or something like
latin1, which does not include this char and therefore conversion fails.

You should fix your 'fencs' setting to something like
ucs-bom,utf-8,default,latin1 or you can force Vim to save it in utf-8
encoding, by issuing :w ++enc=utf8 filename. (It might be, that this
needs the +multi_byte feature, which is only enabled, when compiling at
least a big version of Vim).

> Probably related, but when I used :dig to try and find the code for
> the U+2012 dash (so I could use search/replace) I couldn't spot one.
> Just out of curiosity, what do the numbers in the digraph reference
> page refer to?

The decimal number for that unicode char.

regards,
Christian

Ben Fritz

unread,

May 22, 2011, 2:55:14 PM5/22/11

to vim_use

On May 22, 6:00 am, Christian Brabandt <cbli...@256bit.org> wrote:
> Hi eNG1Ne!
>
> On So, 22 Mai 2011, eNG1Ne wrote:
>
> > Working on a Linux box (Ubuntu 10.4), I've successfully copy/pasted a
> > block of text from a Planmaker spreadsheet into a vim file. The text
> > includes U+2012 dashes, which are correctly displayed in vim … but
> > when I try to save the vim file, I get the message "conversion error".
>
> I guess, your fileencodings setting (notice the plural) does not include
> utf-8, so Vim does not try to save it with that encoding. It probably
> either tries to save it using plain old ASCII encoding or something like
> latin1, which does not include this char and therefore conversion fails.
>
> You should fix your 'fencs' setting to something like
> ucs-bom,utf-8,default,latin1

The 'fileencodings' option is what Vim uses to detect file encoding
when *reading* a file. When writing, Vim uses the current setting of
'fileencoding' (*without* the 's' at the end) as the encoding in which
to write the file.

If the file did not originally have Unicode characters in it, quite
possibly it was not detected as Unicode, so 'fileencoding' will be set
to something else (from the 'filencodings' option) or it will be
empty. If 'fileencoding' is empty, the value of 'encoding' is used
instead.

At least, this is how I understand it. Is there some situation I'm
missing in which 'filencodings' (with the 's') is relevant to a write
operation?

> or you can force Vim to save it in utf-8
> encoding, by issuing :w ++enc=utf8 filename.

This is true, but you can also do a

:setlocal fileencoding=utf-8

before saving.

> (It might be, that this
> needs the +multi_byte feature, which is only enabled, when compiling at
> least a big version of Vim).
>

Yes, I'm pretty sure it does require +multi_byte.

See our current featured tip on working with Unicode in Vim:

http://vim.wikia.com/wiki/Working_with_Unicode

Christian Brabandt

unread,

May 23, 2011, 1:47:14 AM5/23/11

to vim_use

Hi Ben!

On So, 22 Mai 2011, Ben Fritz wrote:

> > You should fix your 'fencs' setting to something like
> > ucs-bom,utf-8,default,latin1
>
> The 'fileencodings' option is what Vim uses to detect file encoding
> when *reading* a file. When writing, Vim uses the current setting of
> 'fileencoding' (*without* the 's' at the end) as the encoding in which
> to write the file.

Oh yes true. I got confused.

regards,
Christian

Erik Christiansen

unread,

May 23, 2011, 4:59:13 AM5/23/11

to vim...@googlegroups.com

On 22.05.11 11:55, Ben Fritz wrote:
> On May 22, 6:00�am, Christian Brabandt <cbli...@256bit.org> wrote:
> > or you can force Vim to save it in utf-8
> > encoding, by issuing :w ++enc=utf8 filename.
>
> This is true, but you can also do a
>
> :setlocal fileencoding=utf-8

Thank you both! I've also been irritated by occasional write failure due
to conversion error, after pasting text to vim. A quick overwrite of the
offending characters in vim has always cured the problem. Yesterday it
was a weird minus sign. I see now that the file is latin1.

Presumably the old and new encodings sync after setting fileencoding=utf-8,
so I wouldn't still have two kinds of '-' ? (I have no idea how many
different minus signs and hyphens are included amongst utf-8 multibyte
characters.)

Erik

--
Forum moderator: Did you take the [End of the world on 21.05.11] doomsday prediction seriously?
Contributor: Bugger, now I have to go to work tomorrow.
- Seen on ABC website on 22.05.11

Ben Fritz

unread,

May 23, 2011, 10:48:27 AM5/23/11

to vim_use

On May 23, 3:59 am, Erik Christiansen <dva...@internode.on.net> wrote:
> On 22.05.11 11:55, Ben Fritz wrote:
>
> > On May 22, 6:00 am, Christian Brabandt <cbli...@256bit.org> wrote:
> > > or you can force Vim to save it in utf-8
> > > encoding, by issuing :w ++enc=utf8 filename.
>
> > This is true, but you can also do a
>
> > :setlocal fileencoding=utf-8
>
> Thank you both! I've also been irritated by occasional write failure due
> to conversion error, after pasting text to vim. A quick overwrite of the
> offending characters in vim has always cured the problem. Yesterday it
> was a weird minus sign. I see now that the file is latin1.
>
> Presumably the old and new encodings sync after setting fileencoding=utf-8,
> so I wouldn't still have two kinds of '-' ? (I have no idea how many
> different minus signs and hyphens are included amongst utf-8 multibyte
> characters.)
>

They will "sync" only if your 'fileencodings' and 'encoding' options
are set in a way that the proper encoding is detected when reading the
file.

See the help for each option, and also http://vim.wikia.com/wiki/Working_with_Unicode
as mentioned before.

Something I've also found useful for those times Vim cannot properly
recognize the encoding by itself, is the AutoFenc plugin:

http://www.vim.org/scripts/script.php?script_id=2721

I only use it for those cases where the encoding is specified in the
file text (like in many HTML documents), but I know there's also an
option to use an external tool to determine encoding based on file
content.

eNG1Ne

unread,

Oct 6, 2011, 6:11:20 AM10/6/11

to v...@vim.org

Thanks! just the information I needed. Now to try and remember it ...

--
View this message in context: http://vim.1045645.n5.nabble.com/what-s-a-conversion-error-and-how-do-I-correct-it-tp4416508p4875850.html
Sent from the Vim - General mailing list archive at Nabble.com.

Tony Mechelynck

unread,

Oct 8, 2011, 12:59:02 AM10/8/11

to vim_use

On 22/05/11 13:00, Christian Brabandt wrote:
> Hi eNG1Ne!
>
> On So, 22 Mai 2011, eNG1Ne wrote:
>
>> Working on a Linux box (Ubuntu 10.4), I've successfully copy/pasted a
>> block of text from a Planmaker spreadsheet into a vim file. The text

>> includes U+2012 dashes, which are correctly displayed in vim ï¿½ but

>> when I try to save the vim file, I get the message "conversion error".
>
> I guess, your fileencodings setting (notice the plural) does not include
> utf-8, so Vim does not try to save it with that encoding. It probably
> either tries to save it using plain old ASCII encoding or something like
> latin1, which does not include this char and therefore conversion fails.
>
> You should fix your 'fencs' setting to something like
> ucs-bom,utf-8,default,latin1 or you can force Vim to save it in utf-8
> encoding, by issuing :w ++enc=utf8 filename. (It might be, that this
> needs the +multi_byte feature, which is only enabled, when compiling at
> least a big version of Vim).
>
>> Probably related, but when I used :dig to try and find the code for
>> the U+2012 dash (so I could use search/replace) I couldn't spot one.
>> Just out of curiosity, what do the numbers in the digraph reference
>> page refer to?
>
> The decimal number for that unicode char.
>
> regards,
> Christian
>

Conversely, when reading, Vim will not give a "conversion error" message
if there is an 8-bit encoding at the end of your 'fileencodings', but it
can still fail to recognize the actual encoding used: for instance, with
:set fencs=ucs-bom,utf-8,latin1 which is a "good" setting for people in
"Western" countries like I am, try to read a file containing Japanese
encoded in Shift-JIS, Traditional Chinese in Big5, Simplified Chinese in
GB18030, or even Russian in KOI8-R, and the text will look like
gibberish, because Vim saw no BOM, saw correctly that the text wasn't
UTF-8 on disk, and fell back on Latin1. In that case you need to tell
Vim the actual encoding of the file (and if you don't know it, maybe try
several possible ones, proceeding by trial and error), by adding a ++enc
modifier to your :e or :view command, for instance

:view ++enc=sjis example.txt

and (assuming that 'encoding' is already set to utf-8) Vim will then (if
compiled with +iconv, or with +iconv/dyn and the iconv or libiconv
library is available) happily translate the shift-JIS into the UTF-8
used internally.

About digraphs: 0x2012 == 8210 and I see no digraph for that, but you
could use Ctrl-V u 2012 (without the spaces, see :help i_CTRL-V_digit
which also applies in command-line mode) or make your own digraph (but
try to use something which is not already in use).

Best regards,
Tony.
--
hundred-and-one symptoms of being an internet addict:
195. Your cat has its own home page.

Reply all

Reply to author

Forward