Hi,
Sorry for being opaque. I DID get frequency for each lemma, but the frequencies are based on each file, so I got output like what I've pasted below. But what I'm asking is how to get the frequency ACROSS all the speakers/files. So, for example, there are 3 tokens of verb stem abridged in the first file and then 1 token from the child, but from a later file. Imagine that for all the files I'm looking at there's a total of 50 tokens of abri verb stem. Is there a way to just automatically extract that number (for each verb stem) without having to manually go through and count how many abridged stems there are for each file (i.e. 3+1+ ...). In other words, what I want is the TOTAL number of tokens of verb stem abri -- including all speakers and including all the Spanish files that have %mor tiers - a few hundred files since there are often more than one file per child (I've put them all in one folder).
I hope this clarifies the question.
Thanks!
-Naomi
small portion of current output:
From file <diegoU030614a.cha>
Speaker: *MOT:
3 v|abri
5 v|cabe
3 v|cerra
1 v|coge
1 v|da
1 v|dormi
1 v|empuja
1 v|entra
1 v|falta
6 v|gusta
4 v|habe
2 v|hace
12 v|i
1 v|importa
2 v|junta
1 v|marcha
3 v|mira
1 v|monta
2 v|move
1 v|necesita
2 v|parece
8 v|pode
4 v|pone
1 v|prepara
4 v|quere
1 v|regala
3 v|sabe
2 v|saca
1 v|sali
16 v|tene
1 v|tira
1 v|toca
4 v|trae
3 v|ve
1 v|veni
------------------------------
35 Total number of different item types used
104 Total number of items (tokens)
0.337 Type/Token ratio
Speaker: *CHI:
1 v|abri
1 v|aparca
3 v|baja
6 v|cabe
5 v|cerra
1 v|coge
3 v|come
1 v|deja
1 v|desperta
1 v|entra
3 v|espera
7 v|habe
2 v|hace
31 v|i
1 v|mira
2 v|oí
1 v|parece
3 v|pode
2 v|pone
2 v|queda
1 v|queja
1 v|sabe
1 v|saca
2 v|senta
4 v|tene
1 v|tira
1 v|toca
1 v|trae
2 v|vale
1 v|ve
------------------------------
30 Total number of different item types used
92 Total number of items (tokens)
0.326 Type/Token ratio
From file <diegoU030614b.cha>
Speaker: *CHI:
2 v|dispara
1 v|escapa
2 v|espera
2 v|habe
16 v|i
1 v|lanza
1 v|mete
4 v|mira
2 v|oí
4 v|parece
14 v|pode
4 v|pone
1 v|quere
1 v|saca
1 v|senta
1 v|tira
------------------------------
16 Total number of different item types used
57 Total number of items (tokens)
0.281 Type/Token ratio
Speaker: *GUI:
4 v|apreta
5 v|da
1 v|deja
1 v|echa
3 v|espera
1 v|explica
2 v|habe
2 v|hace
7 v|i
4 v|mira
1 v|oí
1 v|pode
6 v|pone
1 v|sali
1 v|sujeta
2 v|tene
2 v|tira
2 v|veni