On 19/12/17 15:34, Ralf Goertz wrote:
> Am Tue, 19 Dec 2017 11:09:46 +0100
> schrieb David Brown <
david...@hesbynett.no>:
>
>> On 19/12/17 09:22, Ralf Goertz wrote:
>>>
>>> You say that probably because for your native language there is no
>>> difference between ASCII and UTF-8. For those of us less fortunate
>>> (or, to put it differently, with a richer character set in their
>>> language) it is a huge relief to have an encoding where you don't
>>> have to bother about code pages and stuff like that. And after
>>> getting used to it I found myself playing around with those other
>>> characters like „…“ (ellipsis in german style quotation marks) just
>>> because I can and it is so much nicer. Spoiled by using LaTeX, I
>>> guess.
>>
>> There is no problem in using UTF-8 where it is /useful/. The issue is
>> with /gratuitous/ use of extra characters that makes it needlessly
>> difficult for some people. Not everyone is lucky enough to have Linux
>> with good fonts, and some people have much more limited systems.
>
> Okay, but are you suggesting that those harmless characters are not
> available in other systems than linux? I don't talk about e.g. asian
> alphabets (which /I/ can't read anyway) but typographic characters.
> Can't we assume that in a computer oriented newsgroup like this people
> know how to have UTF-8 displayed correctly after so many years that it
> is around?
I have no idea what sort of proportions of users have trouble displaying
such typographic UTF-8 characters. But I do know that older systems can
have challenges. A lot of fonts have common Western European characters
(like Latin-9 characters), such as accented letters - but relatively few
have extra typographical characters. Modern gui software can do a fair
job of font substitution so that even if you are normally using a font
without these characters, they can show them from other fonts.
However, there are people out there with older systems, or older
newsreaders, or with particular font settings that might cause
limitations. Linux has supported UTF-8 and associated fonts for ages -
in the Windows world, it is more recent (Windows has supported a variant
of UTF-16 for nearly two decades, but good UTF-8 support and good fonts
are not as old. Remember, Windows XP is the third most popular OS in
the world!).
Is it going to bother anyone if you use UTF-8 typographical symbols? I
don't know - probably not, or at least not many people. Maybe I'm just
old-fashioned. Maybe the percentage of people that can't see these
UTF-8 symbols is so small by now that it is irrelevant.
>
>> The rule I suggest (and this is merely my opinion) is to use ASCII
>> when it suffices, and use UTF-8 for other cases. So if your surname
>> is actually spelt Görtz, then by all means write it that way - it is
>> your name, and should appear the way you want it to.
>
> No, it is actually spelt Goertz which itself is a consequence of not
> having around a "UTF-8 capable" typewriter when my father was born in
> the 1930s outside of Germany. He regretted that very much all his life
> whereas I was happy with it because it is easier when communicating with
> non-German people. But the ö is a good example. You need UTF-8 for this
> anyway. Okay there is iso*, but i you use non-ASCII characters anyway
> one should use UTF-8. Seeing how those characters get messed up when
> they are quoted by people not capable of handling them correctly makes
> me said. How must Öö feel.
The difference here is that you /don't/ need a large UTF-8 font to show
a ö character. You just need a program that understands UTF-8 encoding,
and you need a font with that has that symbol. And lots of fonts do -
fonts for Western European languages, that have been around since the
days of MS-DOS, have it. So someone who is running XP and likes "MS
sans serif" will have no problem with ö, but will have difficulty with
directional quotation marks.
>
>> However, using … instead of ... or « » instead of " " or << >> adds
>> nothing to the information in a post (unless the thread is about
>> typography and characters).
>
> To my mind it is useful to have distinct characters to open and close a
> quote. It helps clarity. It's not needed but it helps.
If it helps clarity and reading, then fair enough - that is a good
reason to use them. But make sure it really helps. For example,
Stefan's use of » « quotations does not, IMHO, help clarity at all - it
hinders it. Along with his non-standard indentation and quoting, and
stunningly ugly coding style, his ideas of what makes a post clear mean
that I rarely bother reading them.
> Much like your
> usage of two spaces after a sentence. This has always looked wrong for
> me. Try using \frenchspacing. ;-)
I learned to type on a typewriter - a mechanical one! (I am not /that/
old, and had been programming before I decided to teach myself
touch-typing properly.) Double spaces at the end of a sentence look
good to me, even though I know they are a bit old fashioned. (There is
some controversy as to whether "French spacing" refers to single or
double spacing at the end of a sentence. But we know that Knuth is
always right :-) )
> By the way using << and >> instead of
> the (french?) quotation marks feels even more wrong (is that the right
> comparative?). I associate those with much smaller/bigger.
I think you have missed a bit of your sentence here...
>
>> It is a different matter for documents where appearance is more
>> relevant. You don't want to use simple quotation marks in a LaTeX
>> document - people might think you wrote the document in Word, and that
>> would be awful! (That's not sarcasm - I really am that snobby about
>> typography and document appearance. But everything in its place.)
>
> I agree with you. Completely. But I think this /is/ the place.
>
Well, maybe we should end this off-topic sub-thread and start making
more use of UTF-8 characters in C++ posts. If it adds to readability
and people are happy viewing it, then I am quite ready to change my
mind. If people complain about funny symbols or question marks turning
up in the posts, then we will know that the c.l.c++ world is not yet
ready for UTF-8 !