Dear all,
I would like to have a measure of vocabulary diversity in number of transcripts of children's narratives. VOCD seems like the most reliable choice for that and I would really like to use it. However, narratives are rather small. Most of them is between 80 and 170 tokens, but some of them are lower than 50. I know that default minimum value for calculating VOCD is 50 tokens, but it can also be set lower. Also, I believe that using the option of replacements would give me some results. However, I do not know how reliable such results would be and which of the two mentioned methods should give me more appropriate measures.
I am sorry if I'm posting this to the wrong group. Perhaps it is more of a methodological than a technical question. I feel it it somewhere between two worlds:). But if it would be more appropriate for info-childes, I will place it there.
I would really appreciate your help on this one.