Get frequencies of a list of words for each transcript

39 views
Skip to first unread message

Lulu

unread,
Jan 14, 2015, 4:47:39 PM1/14/15
to chib...@googlegroups.com
Happy New Year to the CLAN Team!

I have a list of word stems and would like to 1) find the frequencies of each stem in the list in each transcript and 2) output these frequencies into an excel data file. My transcripts have already got the MOR tier. I'm able to search for a specific word stem using the following command:

freq  +s"@|-v,|-part,r-go" *.cha > go

I'm wondering if there's any other method to perform my first task other than placing all the words in the place of "go". Also, is there a way to extract the frequencies from the output file and plug them into an excel file (or plain text that can be read into excel)?

Thank you very much!

All best,
Lulu

Brian MacWhinney

unread,
Jan 14, 2015, 5:14:45 PM1/14/15
to ChiBolts
Dear Lulu,
     FREQ has several options for putting results into tEXCEL format.  Just type FREQ by itself and you will see some of the options like +d2 and +d3.  In terms of including a reference to a list, there are various ways to do this and a lot depends on further details about what you want.  The simple approach is +s@F where F is the filename for your words.  But you seem to want something more complex, so I will leave this to Leonid to answer in further detail.

—Brian MacWhinney

--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+u...@googlegroups.com.
To post to this group, send email to chib...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/a884a9b0-d27a-47aa-990b-2e8c8a99f0a4%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Leonid Spektor

unread,
Jan 14, 2015, 6:00:57 PM1/14/15
to chib...@googlegroups.com
Lulu,

I don't fully understand what you are trying to achieve, so I am going to take my best guess. Try the following command:

freq +sm|*,;*,o% +d2 *.cha

If you are using an older CLAN, then command would be:

freq +s"@|*,r-*,o%" +d2 *.cha


It will create an Excel formatted output file "stat.frq.xls". If this is not what you want, then please email to me directly with more detailed description or an example of what you are looking for and the form of output you want to get.

Leonid.



Lulu

unread,
Jan 14, 2015, 10:41:07 PM1/14/15
to chib...@googlegroups.com
Thanks, Brian, for the prompt reply. Your answer is very helpful. I will just write to Leonid with more detail to confirm what I'm doing is right.

Best,
Lulu
Reply all
Reply to author
Forward
0 new messages