Matthias Stirner
unread,Jul 11, 2010, 8:56:54 AM7/11/10Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to AntWordProfiler-discussion
Hi Laurence,
I'm fairly new to AntWordProfiler and corpus linguistics in general,
so this might be a bit of a stupid question, but I haven't found any
info explaining this. I wondered why we need three or more seperate
level lists, and why exactly Paul Nation chose to put words and word
families into one or the other.
Trying to find out myself, I came up with this explanation:
- "nation_baseword_1.txt" contains the most common words. These might
show up in any kind of text.
- "nation_baseword_2.txt" contains less common words. If one or more
of these are found, we can make assumptions about the texts topic.
- "nation_baseword_3.txt" contains very specific words. If one or more
of these are found, we can make conclusions about the texts topic.
Is this correct? Thanks for your time reading this!
Kind regards,
Matthias