Hi all,
This could be a really dumb question, but is there a way of filtering
words based on language?
i.e., let's say I have a sentence (or a set of words) with words from
multiple languages, like "the quick brown fox jumped over the lazy
sleeping dog le rapide goupil brun sauta par dessus le chien paresseux
sommeil el zorro marrón rápido saltó sobre el perro que duerme
perezoso" - it's just the first sentence in English followed by the
corresponding translations in French and Spanish (courtesy of Google
Translate).
Is there a way to pull out just the English words? I tried loading the
Brown corpus and getting out only the words which appear in it but I
wanted to know if there's something more elegant out there.
Thanks in advance.
A
--
You received this message because you are subscribed to the Google Groups "nltk-users" group.
To post to this group, send email to
nltk-...@googlegroups.com.
To unsubscribe from this group, send email to
nltk-users+...@googlegroups.com.
For more options, visit this group at
http://groups.google.com/group/nltk-users?hl=en.