CESS Corpora p.o.s tags - can't find any info.

62 views
Skip to first unread message

Reuben

unread,
May 28, 2014, 8:47:17 AM5/28/14
to nltk-...@googlegroups.com
Hi,

I am using the cess_esp corpus for a project and have trained a tagger on it. The problem is I can't find out what the tags used in it mean. It seems that the project page was originally: http://www.lsi.upc.edu/~mbertran/cess-ece/publications however this page has now been removed from the university's site and I couldn't find anything more than this https://web.archive.org/web/20081219152200/http://clic.ub.edu/cessece/ when searching internet archive (there is some documentation there but the tagset seems different to the one in nltk). Does anyone know where the new home for this is and shouldn't this be updated for the cess readme in nltk_data. If there is no new home does anyone have a list of what the tags mean or something?

Any info would be much appreciated, thanks!

Reuben

unread,
May 28, 2014, 5:08:09 PM5/28/14
to nltk-...@googlegroups.com

 I contacted someone who wrote a related blog and they replied with this: http://nlp.lsi.upc.edu/freeling/doc/tagsets/tagset-es.html
I think this along with the catalan version should replace the old link in the nltk readme.
Reply all
Reply to author
Forward
0 new messages