Hi there,
sorry for the delay. To provide some concrete examples of some of the
problems, Maja prepared a spreadsheet with the mapping of MULTEXT-EAST
to GOLD, and suggestions for Croatian translations of the terminology.
You will find a HTML document and an OpenOffice spreadsheet at the
following URLs:
http://web.mac.com/dcavar/GOLD/multext-gold.html
http://web.mac.com/dcavar/GOLD/multext-gold.ods
The notation: we used for / in the GOLD column to describe the path
from the root to some concept. The sheet uses UTF8 encoding. It is
obvious that some of the missing direct counterparts can be
constructed as sets of GOLD concepts, but also some are missing as
"straight" and well-motivated concepts. For example the notion of
"Neuter" is not obvious in GOLD. It might be that the authors intended
to use some other type of discrimination for that. Maybe the
GenderProperty needs some review in general?
Then, is there some notion of Definiteness and Specificity defined in
GOLD?
The notion "Comparative" is not just a sub-property of adjectives, but
also other categories, e.g. adverbs. It should be an elementary
concept that can serve as a sub-feature of other concepts.
This is just a draft of the MULTEXT-EAST mapping. The MULTEXT-EAST
tagset is still missing in the columns, and some features we need are
not part of the tagset (partially part of GOLD though). We'll be
working on the tables and sheets, and place them on the web and post
here soon.
Wrt. Helen's recent post, are there any ideas about what the process
might look like, wrt. suggestions, changes, extensions etc.?
Maja & Damir