Thanks Stuart for the reply to Fryderyk.
The question and response highlight a slightly messy situation with
the multilingual version of JMdict.
My original intention was to have the glosses for all the languages in
the one relevant sense, and each tagged with the relevant attribute
(xml:lang="ger", etc.). Since the database is compiled by aggregating
the entries from a heap of bilingual sources, many of which did not
have multiple senses or did not follow the sense structure of the
Japanese-English JMdict, I applied an expedient of putting the glosses
for each language in one or more distinct senses. The only exception
(originally) was the Japanese-French file, which started off
sense-aligned with the JE version. Soon they began to drift apart, so
I changed the Japanese-French handling to be the same as the other
languages.
Ideally, I would like things to be as originally planned, but the task
of aligning the glosses is huge and is unlikely to happen any time
soon.
That brings me to language tags associated with the <info> and <s_inf> elements.
It would be great to cater for the possible implementation of the
original scheme for multilingual glosses. In such a situation, it's
quite possible that there would be <info> and <s_inf> elements
associated with entries in more than one language. I think the most
appropriate approach would be to have a separate element for each
language with the relevant tagging.
In terms of the DTD, I think this means we'd need something like:
<!ELEMENT info (#PCDATA)>
<!ATTLIST info xml:lang CDATA "eng">
and
<!ELEMENT s_inf (#PCDATA)>
<!ATTLIST s_inf xml:lang CDATA "eng">
The raw entry data might look a little odd, as the <info> and <s_inf>
elements could be some distance from the glosses they relate to, but
that's an issue for the apps and systems using the data.
HTH
Jim
> --
> You received this message because you are subscribed to the Google Groups "EDICT-JMdict" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to
edict-jmdict...@googlegroups.com.
> To view this discussion visit
https://groups.google.com/d/msgid/edict-jmdict/62b213ba-c188-4829-87a7-2cbce6f28bd6%40mtneva.com.