On 2013/Nov/06 13:18, rusi wrote:
> On Tuesday, November 5, 2013 11:47:23 AM UTC+5:30, Lobachevsky wrote:
>
> <snip>
>
>> I know enough to run APL on my computer why things work is beyond
>> me. Phil Chastney has suggested there mat be a problem with the
>> encoding on the idiom page. As it does not seem to stop it working
>> for me I feel no incentive to spend time changing anything.
>
> Phil Chastney said:
>
>> according to my browser, that page is "encoded" in UTF-8
>>
>> that being so, the idiom should use codepoint U+2227 (rather than the
>> circumflwx accent), which would then display properly
>>
>> this is probably a minor editing error on that page
>
> If the encoding as UTF-8 is the error then what should it be??
> I believe he is saying that there is an editing error.
> Whether that is the case I can hardly comment since I am hardly an apl expert
> Basically Dyalog-apl and Gnu-apl dont seem to agree.
> Here a session with Gnu-apl
>
> 0 ∧ 1
> 0
> 0 ^ 1
> SYNTAX ERROR
>
> From what you are saying the second one works for you.
> Should this be so? I am not qualified to comment
there is nothing wrong with UTF-8, and there is nothing wrong with using
UTF-8 as an "encoding" -- UTF-8 is one of the great inventions of the
20th century
UTF-8 is actually a "transformation format", which maps Unicode values
in the range 0 to 2*20 or 2*21 or thereabouts, onto a succession of
8-bit values
this means a sequence of Unicode values can be mapped onto an existing
8-bit communication system, so that the original Unicode values can be
reconstructed at the receiving end
that being so, the next question is "what Unicode values should be used
to represent the APL character set?"
the big problem is that word "should" -- there is no standard
the nearest approach to a standard is still, I guess,
http://std.dkuug.dk/jtc1/sc22/open/n3067.pdf
and, personally, I would have expected APLWiki to work to that, but
they're under no obligation to do so
among the vendors, there is a broad consensus of how APL maps into
Unicode, but the consensus is by no means complete
one thing they all seem to agree on, however, is that input should be
tolerant, so that pasting A^B (as code!) into a session, should be
interpreted as A∧B (which should read: A and B )
I believe copying from APLWiki and pasting into Dyalog works OK, but YMMV
The Unicode Standard itself used to have a brief entry on Compliance, to
the effect that any string intended for onward transmission, should be
passed on unchanged -- principally, this meant comms kit shouldn't go
round stripping nulls the way they used to
the word "compliance" no longer appears in the standard's index, but
there's a whole chapter on Conformance -- basically, you can do what
you like with a string of Unicode characters, but if your software
includes routines for BiDi, line-breaking, collation, etc, you have to
follow Unicode's specs if you want to claim your software "Conforms" --
clearly, custom collation routines will be installed all over the
place, but nobody claims they're conformant, so that's all OK
it's just possible that using a spacing version of an accent as
equivalent to a mathematical operator may infringe the standard's rules
on preserving Character Identity -- but who cares? none of this stuff
is enforceable anyway
in brief, if APLWiki chooses to use circumflex for "and", they are free
to do so -- maybe C&P from APLWiki into your favourite interpreter
will work, and maybe not
that may be inconvenient, it may be undesirable, but it's not "wrong"
-- there is no _requirement_ for suppliers to accept incoming APLWiki code
well, that's the way I see it . . . /phil