I am currently analyzing data on teacher-child conversations in prekindergarten classrooms within underserved communities, specifically examining linguistic features such as clausal density, lexical diversity, and lexical sophistication.
For my analysis, I have transcribed book reading conversations and processed them through the TAALES and TAALED software. I am using the subordination index (SI; SALT) to assess clausal density, MATTR for lexical diversity, and I am exploring the COCA Academic frequency measure (all words, log transformation) to capture lexical sophistication. However, I am having difficulty understanding the output of TAALES. The frequency counts surpass the total word count. In looking at the articles posted and the spreadsheet, it seems that words are ranked by their frequency, with less frequent words on the academic word list receiving a higher rank, and the sum is then divided by the total word count. Thus, higher scores would indicate higher use of academic words. Am I interpreting this process correctly?
Additionally, do you consider this measure appropriate for assessing lexical sophistication in spoken conversations with children? My preliminary findings suggest that SI, MATTR, and the COCA frequency measure capture distinct dimensions, each potentially influencing children's vocabulary growth in different ways.
Thank you very much for any feedback!
JeanMarie