Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Probelm with paste text from web pages

29 views
Skip to first unread message

twiki

unread,
Sep 14, 2012, 4:05:41 AM9/14/12
to
When i paste text from Opera/Firefox in a buffer
and i want to save this buffer...
i have always to solve this question:

[...]
These default coding systems were tried to encode text
in the buffer `test.txt':
(iso-latin-1-dos (72 . 8592) (143 . 8217) (222 . 8217) (251 . 8217)
(318 . 8217) (355 . 8217) (458 . 8217) (685 . 8217) (755 . 8217)
(960 . 8217) (1021 . 8230))
However, each of them encountered characters it couldn't encode:
iso-latin-1-dos cannot encode these: ...
[...]

How to paste text with apostrophes, accents, etc. for
to save in iso-latin1-dos?

THANX in ADVANCE!

Eli Zaretskii

unread,
Sep 14, 2012, 5:04:35 AM9/14/12
to help-gn...@gnu.org
> Date: Fri, 14 Sep 2012 01:05:41 -0700 (PDT)
> From: twiki <lap...@yahoo.it>
> Injection-Date: Fri, 14 Sep 2012 08:05:41 +0000
You can't, not without changing those characters to their equivalents,
and deleting those that don't have equivalents. These characters
simply don't exist in the Latin-1 character set.

Why do you want to save them in iso-latin1-dos? What's wrong with
utf-8-dos?

twiki

unread,
Sep 14, 2012, 6:31:29 AM9/14/12
to help-gn...@gnu.org
>
> Why do you want to save them in iso-latin1-dos? What's wrong with
>
> utf-8-dos?

Default coding system (for new files) in my Emacs installation is:

iso-latin-1-dos

maybe for my local-setup... moreover i've most files in iso-latin-1-dos!

twiki

unread,
Sep 14, 2012, 6:31:29 AM9/14/12
to gnu.ema...@googlegroups.com, help-gn...@gnu.org
>
> Why do you want to save them in iso-latin1-dos? What's wrong with
>
> utf-8-dos?

Peter Dyballa

unread,
Sep 14, 2012, 3:49:49 PM9/14/12
to twiki, help-gnu-emacs@gnu.org List

Am 14.09.2012 um 12:31 schrieb twiki:

> maybe for my local-setup... moreover i've most files in iso-latin-1-dos!

Then try to tell your browser to display the pages in ISO Latin-1! It might also be possible to tell your windowing system to convert everything copied to ISO Latin-1. I've heard (or rather read) of a so-called registry…

In using UTF-8 instead of ISO Latin-1 your saved files on disk will contain the same bytes for the characters in the US-ASCII range and two bytes for the 8-bit characters, for example accented characters. The typographical quotes you copy will be encoded by three bytes each.

It's worth to upgrade to UTF-8. We're not living in some limited country but in a global village.

--
Greetings

Pete

To most people solutions mean finding the answers. But to chemists solutions
are things that are still all mixed up.


0 new messages