does anybody know where i find a list with things like that?
thx!!!
niko
The %20 stuff is ascii, and is only used in URLs for characters that are
forbidden, like spaces, slashes, etc. The number is the value of the ascii
character in hex. You should have no trouble finding all the values in a
net search on the term ascii.
The accented characters are different, since they do not exist in true
ascii. You need to create these as entities. The most common entitity is
which produces a non-breaking space. Other entities are:
ä or ä for the a-umlaut
ö or ö for the o-umlaut and
ß or szlig; for the german double s
I have a book listing all these, if you can't find a chart on the net, make
one yourself, using the codes from 32 to 255 for the most common entities.
(I find the numbers work on more browsers than the alphabetic ones anyway).
There are additional entities at 402, 913 to 982, 8226 to 9830 (with a lot
of missing mumbers in the range.
Don McCahill
Internet Graphic Design
Westervelt College
London, Canada
--
Posted via CNET Help.com
http://www.help.com/
>for a space you can write %20
In a Uniform Resource Locator, or URL for short, yes. And indeed I
_must_, in order to comply with specifications (and to make things
work on different browsers). See
http://www.hut.fi/u/jkorpela/HTML3.2/3.5.html#urlenc
for a summary of the encoding rules.
>but what can you write for a ä or a ö or a
The URL syntax does not allow ä or ö at all, unencoded or encoded.
When naming resources to be put onto the Web, don't use such
characters. But if you have to refer to resources with non-ASCII
characters in their names, the best shot is probably to use a method
analogous with URL encoding, using ISO 8859-1 instead of ASCII. That
would mean =E4 for ä (a umlaut) and =F6 for ö (o umlaut). Whether this
works depends on both the server and the browser. I'm a bit surprised
at having seen that it works rather often. I haven't conducted
systematic tests to see how widely the unencoded characters work too.
>does anybody know where i find a list with things like that?
The URL encoding is simple: %hh where hh is the ASCII code of a
character, in two hexadecimal digits. There are lots of references
which give you the codes, though you need to be cautious - not all
people consult the ultimate references (standards) before making their
own "reference" documents available to the public.
My http://www.hut.fi/u/jkorpela/latin1/2.html lists first the numeric
codes for ASCII characters (up to 126 decimal, 7E hexadecimal), then
the codes for the non-ASCII characters in ISO 8859-1.
--
Yucca, http://www.hut.fi/u/jkorpela/ *** Happy 19100 to all! ***