JMdict furigana issues

9 views
Skip to first unread message

Jim Breen

unread,
Jun 15, 2021, 7:15:16 PM6/15/21
to edict-...@googlegroups.com
I just started a new thread for this topic. The archiver software
(MHonArc) cunningly grouped it with the Examples thread, so if you are
looking at the thread index at
https://www.edrdg.org/jmdict_edict_list/2021/threads.html you'll need
to scroll up a bit.

Jim

--
Jim Breen
Adjunct Snr Research Fellow, Japanese Studies Centre, Monash University
http://www.jimbreen.org/
http://nihongo.monash.edu/

AbleMind

unread,
Jul 6, 2021, 3:30:31 PM7/6/21
to edict-...@googlegroups.com
In reference to the conversation on adding JMdict ids to entries in the JmdictFurigana output:

Doublevil (author of JmdictFurigana) chimed in on the issue created by Ahmed. He points out that adding the ids should be trivial and outlines how you would go about determining ids for any given entry. Since the process is so straightforward, he questions the need to include the ids at all.

I haven't used the output file in any of my projects (though I am going to test it out soon) so I can't say for myself, but there may be a use case for someone here?

-
Cameron Chambers

--
You received this message because you are subscribed to the Google Groups "EDICT-JMdict" group.
To unsubscribe from this group and stop receiving emails from it, send an email to edict-jmdict...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/edict-jmdict/CABHGxq4gXrErw2xyVD-j%3DykSdvwrRF2b-to1pyZW7ibOwK8%2B5g%40mail.gmail.com.

Jim Breen

unread,
Jul 6, 2021, 11:11:55 PM7/6/21
to edict-...@googlegroups.com
Doublevil didn't actually say the task was trivial. I'd classify it as
straightforward for about 99% of the entries, and messy for the other
1%.

He/she/it also said: "but I'm not sure why they would be needed in the
first place".
My interest was to add the derived furigana as an option to the
WWWJDIC server. To do this I'd need to align the dictionary entries
with the JmdictFurigana data. The alignment would be simple if the
sequence numbers were in both streams. Otherwise, I'd have to do a
(daily) kanji/reading alignment as I use the latest dictionary
version. Not impossible but messy when many readings have multiple
kanji forms and readings.

JIm
> To view this discussion on the web visit https://groups.google.com/d/msgid/edict-jmdict/CAOGcgZrDP99LGWmh7dfOH7fnAnabneR7XG-zZexeNeoP9FKuew%40mail.gmail.com.
Reply all
Reply to author
Forward
0 new messages