WWWJDIC and names with macrons

12 views
Skip to first unread message

Jim Breen

unread,
Jul 9, 2021, 3:45:30 AM7/9/21
to edict-...@googlegroups.com
I've implemented a fiddle with WWWJDIC and the names dictionary so
those name entries containing vowels with macrons in the "English"
part can be searched using keys without macrons.

For example, an entry such as:
あたご町 【あたごちょう】 (p) Atagochō
can be found using either Atagochō or Atagocho as the search key.

Note that the contents of the distributed JMnedict/enamdict files have
not been changed. This fiddle is local to WWWJDIC. Other sites and
apps can do their own things.

I'm planning to progressively replace the old "ou" and "uu"
romanizations with ō and ū as appropriate.

Jim

--
Jim Breen
Adjunct Snr Research Fellow, Japanese Studies Centre, Monash University
http://www.jimbreen.org/
http://nihongo.monash.edu/

Marcella M. MARIOTTI

unread,
Jul 13, 2021, 10:09:43 AM7/13/21
to edict-...@googlegroups.com, Mantelli Alessandro, Nicolò Di Domenico, Giuseppe Garassino
Dear Jim
this is great news! Could you do it automatically?
We are still working on it...
Ciao,
Marcella

--
You received this message because you are subscribed to the Google Groups "EDICT-JMdict" group.
To unsubscribe from this group and stop receiving emails from it, send an email to edict-jmdict...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/edict-jmdict/CABHGxq6Tk3mYFJruVjqpLn7UWwV6H83ajqEfwGm4bb2QKT1C5w%40mail.gmail.com.

Jim Breen

unread,
Jul 13, 2021, 9:02:43 PM7/13/21
to edict-...@googlegroups.com, Mantelli Alessandro, Nicolò Di Domenico, Giuseppe Garassino
Hi Marcella,

> this is great news! Could you do it automatically?
> We are still working on it...

I'm guessing you are referring to my comment "I'm planning to
progressively replace the old "ou" and "uu" romanizations with ō and
ū as appropriate."?

This is happening only in the names dictionary; not in JMdict/EDICT,
where there are relatively few romanized terms and they need to be
handled on a case-by-case basis.

I'm converting the name romanizations in batches, doing the easiest
ones first. For example, I just changed the ~峠/~touge ones to ~tōge -
there were about 2,500 of them. I have some scripts for doing this.

Cheers
Reply all
Reply to author
Forward
0 new messages