At 14:47, Sun 2012-01-29, Alex Rudnick wrote:
> Hey Morten,
Hi! Sorry that I didn't remember to say “thank you” right away =)
> Morphological analysis is kind of a hard problem for many languages!
> You may have to find a language-specific tool in a lot of cases, and
> many of them may not be in Python.
Indeed, and lots of the tools are closed source and/or have rotten code
bases.
> But if you want to do Spanish, Mike Gasser (my advisor) has some
> Python 3 software that works pretty well for Spanish verbs. In many
> cases (don't know the precision/recall), it will find the infinitive
> form of a verb, given the conjugated form. There's also morphological
> analyzers for a few other languages here:
In case anyone cares, I ended up writing a small wrapper around
TreeTagger. It's giving me good results.
http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/DecisionT... --
Morten Minde Neergaard