Hi Yedo,
With the default settings, "don't" will be treated as "don" and "t" in the list. I recommend either of the following:
1) Use a tokenization tool (e.g. my TagAnt tool) to properly segment the raw texts. Then, load the texts into AntConc using the Corpus Manager with the "simple word, tag, headword" indexer turned on. This will then treat all tokenized words separately. I really should add this functionality directly into AntConc, but the models needed to tokenize different languages are quite big, so it would increase the size of the app considerably.
2) Edit the default token definition in AntConc to include the apostrophes.
1) is the better option because it is the most complete. But, 2) is simple and easy to do and the results are very transparent.
I hope that helps!
Laurence.
###############################################################
Laurence ANTHONY, Ph.D.
Professor of Applied LinguisticsFaculty of Science and Engineering
Waseda University
3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555, Japan
E-mail:
antho...@gmail.comWWW:
http://www.laurenceanthony.net/###############################################################