In comp.infosystems.www.authoring.html message <jg97gm$7rs$1@dont-
email.me>, Tue, 31 Jan 2012 19:14:00, Jukka K. Korpela
<
jkor...@cs.tut.fi> posted:
>Years ago, I wrote a page about using national and special characters
>in HTML. At that time, the main problems were with character encodings.
>Things have changed a lot, and instead of updating the old document, I
>wrote a new one that focuses on font problems:
>
>
http://www.cs.tut.fi/~jkorpela/html/characters.html
>
>It's largely based on discussions in this group. Comments welcome.
cited.
"Entering characters"
It could be useful to state the range of character encodings. It is
fairly well-known that "ASCII or ANSI" goes up to "127 or 255", and that
Unicode foes up to 65535 = U+FFFF. But in fact Unicode goes higher, so
do browsers actually understand 𒍁 or $#x123412;, regardless of
the question of whether any installed font has them ?
JavaScript characters are restricted to \u0000 to \uFFFF; unless there's
something I've missed in ECMA 262 5; I doubt whether Unicode over U+FFFF
can be represented in current JavaScript, and I have no idea about
VBScript.
A link to the UTF-8 standard, and to a good description, might help.
You could add that if it is essential to use a character (whether or not
in Unicode) that is liable not to be displayed suitably as such by
readers' browsers, a small image styled for height in ex units can be
used.
I have a page which requires one Old French beta. That is in Unicode,
and below U+FFFF; and my present computer system displays it nicely. On
Old French Greek, it no doubt is seen as an Old French beta. But in an
English sentence, even in quotes, it looks too much like a 6, no actual
6 being nearby for comparison.
The printer of Euler's Collected Works had the same problem; his cases
held β but not ϐ. In E.327, the ϐ characters may
actually be individually hand-drawn; they lack sharpness of edge, and
seem to vary more than printed ones do. So I chose a nice one and, via
copy'n'paste and Windows Paste, made a little graphic of it. It looks
as much at home in the browser display as in the PDF image of the Works.
So styled, it matches its HTML neighbours in Zoom and Zoom Text Only.
IMHO, it's normally better to have a character which certainly looks to
be an instance of that character at about the right size than to have
one which may be perfect for style and size but is quite likely not to
show as that character at all.
"...such as \0000e9." - I know nothing of that format; but I only use
CSS1, since the CSS2 document is too big.
"Line spacing problems"
The first diagram needs more words. Generally, 29" is shown with a
normal line height, since ordinary people (British, Finns, etc.) get
those characters by pressing 2 9 Shift-2 and get ASCII 50 57 34, with
ordinary line height.
Evidently CSS font designation needs grammar for "anything but Cambria
Math".
An index at the top, linking to H2 elements, would help.
Since you wrote on ISO 8601, you may care to know that I have something
resembling MSDOS DIR but with options for yyyy-mm-dd, yyyy-Www-d, yyyy-
ddd, and time_t in milliseconds, days GMT, and days LCT. Google for
SEAKFYLE.
I write Web pages in 7-bit ASCII, characters HT, LF, CR, 20-126.
--
(c) John Stockton, nr London, UK. ?@
merlyn.demon.co.uk Turnpike 6.05 WinXP.
Web <
http://www.merlyn.demon.co.uk/> - FAQ-type topics, acronyms, and links.
Command-prompt MiniTrue is useful for viewing/searching/altering files. Free,
DOS/Win/UNIX now 2.0.6; see <URL:
http://www.merlyn.demon.co.uk/pc-links.htm>.