just wondering ...
Contrary to a common misconception ASCII is not really sufficient even
for English, which is why TeX has:
resum\'e n\'ee
fa\c cade
No\"el
belov\`ed
medi\ae val [a bit old-fashioned]
r\^ole
`` '' [should be single characters really]
Other languages with a Latin alphabet in which diacritics play a similarly
minor r\^ole are Indonezian, Dutch and a lot of African languages.
How about Latin itself? More-or-less dead, but natural nonetheless.
--
Peter Castine | Dr. Who quote du jour:
pcas...@prz.tu-berlin.de | Have you noticed the way people's
===========================| intelligence capabilities decline sharply
``Have Mac, will travel'' | the minute they start waving guns around?
: just wondering ...
I just asked my collegues, and they said dutch, italian and maybe portugese!
Stephan
--
We don't inherit the earth from our parents,
we borrow it from our children!
+-----------------------------------------------------------------------+
| Stephan Hoffmann Mail: Werner-von-Siemens-Str. 60 D-91052 Erlangen |
| ANL A433SI Voice: +49 9131 7 43192 Fax: +49 9131 7 43303 |
| Siemens AG EMail: stephan....@erlm.siemens.de |
+-----------------------------------------------------------------------+
I'm not sure what the standards in the UK are for these words, but I
know that in the US it's acceptable in most mediums to leave off accents
and ligatures, and the `` and '' characters can both be represented by
the same character, i.e. ". All the examples you gave are generally
excused in anything but formal writing and bookprinting (which is what
TeX is great for, in addition the scientific and mathematical texts).
I think that the thrust of the original post was that the author,
pprk...@ecx.tuwien.ac.at (numath), was perhaps a little upset by the
fact that ASCII is the standard in even countries whose languages
require much more.
But is this what this group is all about? I'd love to know....
--
Randall Nortman (aka Bob the WonderClown)
University of Chicago
rcno...@midway.uchicago.edu
My opinions are just that - my opinions.
There are lots of accents in Italian and portuguese (plus the tilde in
the latter case).
Pierre
--
Pierre Jelenc * A day can really slip by when you're deliberately *
rc...@panix.com * avoiding what you're supposed to do. =Calvin= *
...I'd thought the ASCII adoption was due to portability... it is
a skene between '0' and '1' strings and, as japanese refer to it, romanji.
The inclusion, or better, exclusion of accents and ligatures is an
interesting issue however: how many words are spelled(sic) with a 'j' or
'q' in Italian? In German?
I have seen chinese terminals... but given the tremendous
number of "characters" in Chinese... is that similarly limited?
I'd say this whole discussion is natural. Appropriate? Yup.
--
O0O0O0O0O0O0O0O0O0O0O0O0O0O0O0O0O0O0O0O00O0O0O0O0O0O0O0O0O0O0O0O0O0O0O0O0OO0
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
S T O N E David Carrigan 0 ver...@panix.com
is there any language using latin derived letters that can _not_ do
with the LATIN1(=ISO8852-1 = MS-windows(?)) character set without transcribing?
>just wondering ...
mee too ...
Anders H
ps. esperanto is one. Or is it one to one hyphen(latin1) <--> hat(esperanto)?
: >is there any other language than English and equals that can do
: >with the US-character-set and without transcribing (like "u or ue for
: >the umlaut u) ?
: is there any language using latin derived letters that can _not_ do
: with the LATIN1(=ISO8852-1 = MS-windows(?)) character set without transcribing?
: ps. esperanto is one. Or is it one to one hyphen(latin1) <--> hat(esperanto)?
ISO Latin-1 (that is *ISO-8859-1*. You got the number wrong.) Is
currently the standard for internet news and mail, although it is
mostly used by people in foreign newsgroups because there are a lot of
outdated machines on the American internet (for example, DEC's
sendmail has a problem with this). ISO-8859-1 was designed for Western
European languages. A lot of eastern european languages use
characters that are not in Latin-1 (e.g. Polish. the native tongue of
the inventor of Esperanto).
As a side note, I've seen the Russian one, (Latin-7 ??) appear on
soc.russian.
Personally I think that Unicode would be best, but there aren't very
many editors for unicode out there. Unicode has fonts for all
languages, and it is IPA to keep linguists happy...
Steve Dunham
dun...@gdl.msu.edu
: But is this what this group is all about? I'd love to know....
hmmm, still hits the topic better than "what is while() ?"
Yes, indeed, the slashed L is not there! But why didn't they
get that in place of all the Donald Duck characters 162-190(decimal)
i find on my machine?
AH
>is there any language using latin derived letters that can _not_ do
>with the LATIN1(=ISO8852-1 = MS-windows(?)) character set without transcribing?
>>just wondering ...
>mee too ...
Vietnamese.
Bambang N. Prastowo (pras...@qucis.queensu.ca, grad. student)
Software Technology Laboratory, Dept. Computing & Info. Sci.
Queen's Univ., Kingston, Ontario, Canada K7L 3N6
: Steve Dunham
: dun...@gdl.msu.edu
: As a side note, I've seen the Russian one, (Latin-7 ??) appear on
: soc.russian.
As a side note, here is written Chinese using ZWDOS:
~{DcCG:C#?~} But I live and work in French, and functions like
toupper() just don't work well on the French language because of the
accents.
Russell Holmes
*******************************************************************************
| I would add a snazzy line here, but it's all been said :( |
*******************************************************************************
SD> A lot of eastern european
SD> languages use characters that are not in Latin-1 (e.g. Polish. the
SD> native tongue of the inventor of Esperanto).
Most of the eastern european characters are in latin-2 (ISO 8859-2),
and Esperanto characters are in latin-3.
SD> As a side note, I've seen the Russian one, (Latin-7 ??) appear on
SD> soc.russian.
Cyrillic characters (including Russian) are in latin-5.
--
TAKAHASHI Naoto
Electrotechnical Laboratory, Japan
ntak...@etl.go.jp
they also don't work well because of the french convention of omitting
the accent on capital letters. this practice is a more serious problem
for an upper-to-lower conversion routine that the actual writing of the
code.
supposedly, it is now becoming acceptable to put accents on upper-case
letters.
-gary
That is the fault of your C-compiler. (What are you using?) You
might want to look into how your compiler implements toupper, many
(including, I believe, GCC) use a macro and a 256 byte array , which
you could change.
Steve Dunham
dun...@gdl.msu.edu
: SD> A lot of eastern european
: SD> languages use characters that are not in Latin-1 (e.g. Polish. the
: SD> native tongue of the inventor of Esperanto).
: Most of the eastern european characters are in latin-2 (ISO 8859-2),
: and Esperanto characters are in latin-3.
: SD> As a side note, I've seen the Russian one, (Latin-7 ??) appear on
: SD> soc.russian.
: Cyrillic characters (including Russian) are in latin-5.
I've been looking around for the specification of all of the iso-latin
fonts, but I can't find it. I think I saw it in an RFC once... I've
also seen mention of escape codes that are supposed to signal
switching from one font to another, does anyone know anything about
this?
Steve
dun...@gdl.msu.edu
Cymraeg (Welsh) is not fully supported by LATIN1
--
=============================================================================
Sean V. Kelley ________ __o B'aite liom fe/in bheith ar
Lockheed Corp. _____ _`\<,_ thaoibh mhalai/ shle/ibhe,
kel...@ede.sanders.lockheed.com ___ (*)/ (*) Agus caili/n gaelach
a bheith 'mo chomhair
Even in Dutch there are quite a number of words requiring a so-called 'trema' (typographically comparable with an umlaut): e.g. financi"ele.
Miek Dekeyser.