TagAnt, tags and the COCA corpus

172 views
Skip to first unread message

Marie Juanchich

unread,
May 29, 2015, 11:10:22 AM5/29/15
to ant...@googlegroups.com
Hello all,

I am a psycholoist and trying my first corpus analysis (on the way people express their uncertainty), so please bear with me!

I am trying to use TagAnt to tag some of the untagged files of the COCA corpus but it crashes systematically. I tried with a single file in case it would be too much to processed at once, but that does not work either. I also have the COCA corpus with tags but I do not understand how to differentiate the text from the tags and lemmas because the tags and lemmas are not delimited with _ or <>.

Example of the tagged text from COCA:
It    it    pph1
was    be    vbdz
n't    n't    xx
the    the    at
usual    usual    jj

Any tips on how to handle tags that are not marked as tags or how to use TagAnt? That would be very much appreciated!

Regards,

Marie

Laurence Anthony

unread,
May 29, 2015, 12:15:44 PM5/29/15
to ant...@googlegroups.com
Hi Marie,

When you say TagAnt crashes, do you mean it closes unexpectedly. Or is it just thinking? With a small 1000 line file, does it work? TagAnt should be fine with later like this.

The raw data looks to be word SPACE lemma SPACE tag.

So, you just need to replace the spaces with underscores and then delete the middle _LEMMA_ part to get a tagged version. Maybe not so easy with a bit of scripting.

Laurence.



###############################################################
Laurence ANTHONY, Ph.D.
Professor
Center for English Language Education in Science and Engineering (CELESE)
Faculty of Science and Engineering
Waseda University
3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555, Japan
E-mail: antho...@gmail.com
WWW: http://www.laurenceanthony.net/
###############################################################

--
You received this message because you are subscribed to the Google Groups "AntConc-discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to antconc+u...@googlegroups.com.
To post to this group, send email to ant...@googlegroups.com.
Visit this group at http://groups.google.com/group/antconc.
For more options, visit https://groups.google.com/d/optout.

Reply all
Reply to author
Forward
0 new messages