Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss
Groups keyboard shortcuts have been updated
Dismiss
See shortcuts

Understanding ASCII encoding across platforms?

96 views
Skip to first unread message

TRS-90

unread,
Dec 5, 2023, 9:09:55 PM12/5/23
to
Could anyone help me understand a text file? Like the most basic that works on every system since the 1960s? I realize that might not be possible and that's why ASCII was invented. I've read about UTF-8 and there are certainly more since then. I honestly don't even know what encoding is used to make this message on an Apple IIgs readable on Usenet. Is ASCII the most platform-independent? On modern systems I use VSCode, but I find even that program adds characters that show up as ? marks if I send the file to a IIgs for example.

Thank you for reading.

fadden

unread,
Dec 6, 2023, 12:19:27 PM12/6/23
to
On Tuesday, December 5, 2023 at 6:09:55 PM UTC-8, TRS-90 wrote:
> Could anyone help me understand a text file?

Run "iconv -l" to get a brief list of character encodings. (I see about 1100 on Ubuntu Linux.)

If you stick to ASCII, your text will be readable everywhere, but most non-English languages can't be represented with the ASCII character set. Modern systems use Unicode, often with UTF-8 encoding, which was designed so that ASCII text "just works".

The Apple IIgs uses a custom locale-specific character set, often Mac OS Roman. It is based on ASCII, but has additional characters for common Latin-derived languages, plus some math symbols.

TRS-90

unread,
Dec 6, 2023, 7:15:36 PM12/6/23
to
>"fadden" wrote:

> If you stick to ASCII, your text will be readable everywhere, but most
> non-English languages can't be represented with the ASCII character set.
> Modern systems use Unicode, often with UTF-8 encoding, which was
> designed so that ASCII text "just works".

Thank you, I've been re-typing historical articles from 1800s newspapers about the area I live in. Doing it on a IIgs. I didn't realize there are so many different encodings. Your reply was helpful.

mmphosis

unread,
Dec 7, 2023, 7:32:07 PM12/7/23
to
That iconv command is super helpful. Thank you!


My serial card is currently in slot 3 on the Apple II:

]IN#3
]0 get a$ : ? a$; : if a$ <> chr$(4) goto
]RUN


In the Terminal, on the Linux platform:

./mistral-7b-instruct-v0.1-Q4_K_M-main.llamafile --temp 0.7 -r '\n' -p
'Display the euro symbol.' | tee /dev/tty | iconv -f UTF-8 -t
ASCII//TRANSLIT | tr [:lower:] [:upper:] | tr '\n' '\r' > /dev/ttyUSB0

Display the euro symbol.
Answer: €


On the Apple II:

DISPLAY THE EURO SYMBOL.
ANSWER: EUR


Colin Leroy-Mira

unread,
Feb 1, 2024, 4:35:58 PM2/1/24
to
Hi,

>./mistral-7b-instruct-v0.1-Q4_K_M-main.llamafile --temp 0.7 -r '\n' -p
>'Display the euro symbol.' | tee /dev/tty | iconv -f UTF-8 -t
>ASCII//TRANSLIT | tr [:lower:] [:upper:] | tr '\n' '\r' > /dev/ttyUSB0
>
>Display the euro symbol.
>Answer: €
>
>
>On the Apple II:
>
>DISPLAY THE EURO SYMBOL.
>ANSWER: EUR

On a related note about iconv and Apple II,

1) For international Apple IIs, the charset are:
French: ISO646-FR1
Spanish: ISO646-ES
Italian: ISO646-IT
German: ISO646-DE

You can use iconv -f UTF-8 -t ISO646-FR1//TRANSLIT in the same manner.

2) By the way, glibc 2.39, released yesterday, contains a little patch
of mine that translits (some) emojis to ASCII:

root@a2proxy:~# echo "😉" | iconv -f UTF-8 -t ASCII//TRANSLIT
;-)

I wrote it so that my Mastodon client, which relies on a proxy for
network access, json parsing and charset change, could display common
emojis!
--
Colin
https://www.colino.net/

0 new messages