Hello everyone,
I have trained an averaged perceptron model for part-of-speech tagging of Russian texts and would like to incorporate it into nltk. My goal is to alter pos_tag and pos_tag_sents methods from tag module so that they work not only for English texts but also for Russian.
I used methods of PerceptronTagger class from tag module for training and storing the model file.
The average accuracy of the model measured by 10-fold cross-validation is 99%.
You can find more detailed description in the attachement.
Tsolak Ghukasyan