Hi everybody!
I write my master thesis on physiotherapy master abstracts and have collected a corpus of such abstracts. To evaluate keyness etc. I wanted to use antconc. After I realized that it is expensive to get the data from coca I looked for alternatives. I found the 5k lemma list from Word frequency data (
https://www.wordfrequency.info/free.asp). I copied the data in excel, changed the columns in order to get the right ordering of rank, frequency and word. Then I copied the data out of the table into word and once more into wordpad to get the txt file. Then I tried the Notepad++ and saved it with ticking the UTF-8-BOM coding. In the end, I did get some results, but I would say weird ones...see attached files.
Then I thought I would use the word list suggested through Mr. Anthony on his homepage and, therefore, I took the written part of the BNC. However, I obtained another error message:
On or more of the rank column values is not a number. First error on line: 0 Value: #word types: 334660
Finally, I tried the AntFileConverter on my data with the same weird results.
I'm quite confused by now...
Any ideas?
Best regards,
Theresa