On 8 November 2013 19:06, Darren Cook <
dar...@dcook.org> wrote:
> Ping! Is this group still alive? (Is there now a better place to discuss
> Japanese NLP questions, in English?)
It never really fired as a group/list. The little discussion in English seems to
happen on the Corpora list.
> Question of the day: is MeCab is still the NLP tool of choice? (The MeCab
> site appears to be alive, and has a comparison with ChaSen, JUMAN and
> KAKASI, but the link to Chasen goes to a page last updated in 2007, the
> JUMAN link is a 404, and the KAKASI linked-to page was last updated in 2004,
> so I think that comparison table hasn't been touched in 6 years...)
I would like to think so, since I use MeCab daily. It's been updated recently
(several times in the last 12 months.) It's also keeping in step with the latest
versions of Unidic, which I think is the lexicon of choice for mrphological
analysis.
There seems to have been a recent new version of Juman, and I have head
a comment that it's good, but I haven't explored it much.
> Any other Japanese NLP tools that are being developed that are worth a look?
Have a look at
http://cl.naist.jp/~eric-n/ubuntu-nlp/ if you haven't
already. Things
like yamcha and cabocha may be useful. There are similar things in the
Juman camp.
There's also kuromoji, which has been getting interest because it's in Java and
can run under Android, etc. on phones, etc. I think it uses a compressed version
of IPADIC, which puts me off it.
Cheers
Jim
--
Jim Breen
Adjunct Snr Research Fellow, Japanese Studies Centre, Monash University