Thank you very much Leonid! I still have some questions:
1. I also wonder if I can use a similar command with "-b" to calculate English MLU in words (instead of morphemes). For example, may I use the command to calculate MLU in words for the transcript attached?
mor +t*CHI MEV001_E_retell.cha +1
post +t*CHI MEV001_E_retell.cha +1
mlu +t*CHI +s“[+ G]” +s"[+ U]" +k -b MEV001_E_retell.cha > 001_E_retell_mlu.cha
2. Moreover, I want to distinguish between regular verbs (e.g., work - worked) and irregular verbs (e.g., find - found) when calculating number of different words (NDW).
I understand +s@r-*,o-% will find all stems and erase all other markers. But I want to treat the irregular verbs (e.g., found) as a separate lexical from the stem (e.g., find), but count the regular verbs (e.g., worked) and the stem (e.g., work) as one entry.
But with the following two commands, I got results treating the irregular and regular verbs as the same (either reserving the suffix in regular verbs or keeping only the stem):
freq +t*CHI +s"[+ G]" +s"[+ U]" +k MEV001_E_retell.cha > 001_E_retell_NDW_different.cha
freq +t*CHI +s"[+ G]" +s"[+ U]" +t%mor -t* +s"@r-*,o-%" +k MEV001_E_retell.cha > 001_E_retell_NDW_same.cha
3. If I am going to use "+d3" to output type/token information in Excel format, is it possible to also output MLU, total number of utterances, and even other information (e.g., code calculation) in the same spreadsheet?
Thanks a lot!!!
Sincerely,
Ying