--
You received this message because you are subscribed to the Google Groups "Suite of automatic linguistic analysis tools" group.
To unsubscribe from this group and stop receiving emails from it, send an email to linguistic-analysi...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/linguistic-analysis-tools/0c70c76f-43e2-4c69-a271-82a8e9dbb054n%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Thank you for your reply! It indeed helps a lot. I have read the index description spreadsheet and so some problems were solved. However, I am still confused with three indices, which are WN_Mean_Accuracy, BNC_Written_Trigram_Freq_Normed_Log, and Aoe_inverse_linear_regression_slope.
#TAALES #Interpretations of some indices
Thanks for everything!
Kyle, K., Crossley, S., & Berger, C. (2018). The tool for the automatic analysis of lexical sophistication (TAALES): Version 2.0. Behavior Research Methods, 50(3), 1030–1046. https://doi.org/10.3758/s13428-017-0924-4
- For WN_Mean_Accuracy, the description in the spreadsheet is “average naming accuracy of all participants for this word” and there is no equation provided. So if the score of this index is higher, the words in the texts could elicit more accurate responses. And so the texts with a higher score of this index are less sophisticated in lexical proficiency. I’m not sure my understanding is right.
- For BNC_Written_Trigram_Freq_Normed_Log, the description in the spreadsheet is “Mean frequency score” and the equation is sum logged trigram frequency score/number of trigrams in text with frequency score. For my self-built corpus, the data of this index are negative. So could I interpret this index like other indices related to n-gram frequency? If the value of this index is higher, the trigrams in the texts are more frequent.
- For Aoe_inverse_linear_regression_slope, the description in the spreadsheet is “incremental Age of exposure (AOE) for words across 13 grade level using LDA modeling” and the equation is “1/slope of linear regression based on LDA cosine values”. After reading the spreadsheet, I’m still not clear on how to interpret this index. Kyle et al. (2018) published an article named The tool for the automatic analysis of lexical sophistication (TAALES): Version 2.0 and they found that aoe_inverse_linear_regression_slope “explained a small amount of the variance (0.3%) in lexical proficiency scores. The results indicate that texts including words that have lower co-occurrence patterns at later grade level tended to earn higher scores”. Firstly, does "scores" in "earn higher scores" refer to the index scores or the lexical proficiency scores? Secondly, does it mean that a text with a higher index score suggests the words in that text are exposed later, and the lexical proficiency is accordingly higher? I still feel confused with this index.
To view this discussion on the web visit https://groups.google.com/d/msgid/linguistic-analysis-tools/8c6ff1b9-0f51-4f43-bbd2-bda9c92d5a29n%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.