Hi Mike,
On Tue, Dec 14, 2021 at 7:50 PM Mike Dowd <
mike...@gmail.com> wrote:
>
> For the moment I am focusing on the British words that have an American equivalent in the dictionary, until I have more experience. But I'm taking notes on the words that don't have the American counterpart in the dictionary. As well as other discoveries (e.g. a noun in the dictionary is categorized uncountable, but there is also a countable definition for the word as well).
Yes, it's an adventure.
The distinctions between countable and uncountable nouns are as you
can guess: For example:
countable noun:
"the tree"
"a tree"
mass noun:
"the sand"
*"a sand"
There are some nouns which are both (I can't think of any off the top
of my head). A conventional dictionary will assign these different
word-senses, and explain why they're different.
The link-grammar subscripts (the .n .n-u .s) are enforced debugging
aids. If you list the same word twice in the LG dictionary, it will
complain. This is useful for avoiding mistakes, as otherwise it can be
very hard to understand why some parse is coming out the way it is.
Yet, often, one *does* want to list the same word in multiple places,
and the subscripts provide the mechanism to do so.
The use of subscripts in the LG dictionary is haphazard: they do not
play any grammatical role whatsoever, they are only debugging aids.
Thus, there aren't really any rules for how they should be used, other
than they shouldn't be misleading (don't use the .v subscript for
nouns.)
Note: it is easy to get the impression that the subscripts identify
"parts of speech". This is incorrect. (They are debugging aids). The
actual part-of-speech is done with connectors/links:
sand: Dmu- & ...;
tree: Ds**c- & ...;
trees: Dp- &..
The part of speech is encoded in those connectors: verbs never have
"D". Mass nouns get an "m". singular get "s", plural get "p", and
these rules are strict and uniformly applied: they determine the
grammar; the parser ensures that the link types are satisfied.
I mentioned "conventional dictionaries" and "word senses". The
connectors/links on words correlate strongly with word-senses. So
strongly, that, in fact, you can fish out word senses simply by
looking at the links on a word, and get pretty decent accuracy. I
tried this with a combo of LG+WordNet a decade ago, and hit 70%
accuracy right out the gate. Anyway, the lesson here is that if you
want to know about the "part of speech" of a word, do NOT look at the
subscripts; look at the linkages instead.
- Linas