Finding verb stems in bilingual transcripts

20 views
Skip to first unread message

Lulu

unread,
Jan 29, 2014, 12:19:23 PM1/29/14
to chib...@googlegroups.com
Hi,

Following up on my previous post - I'm able to find all the verb stems now but would like to separate the results into English and Spanish words, as my transcripts are bilingual. I have 3 sets of files: English only transcripts, English dominant transcripts with [- spa] lines (which may contain @s for English words occurring in Spanish lines), and Spanish dominant transcripts with [- eng] lines (also with @s words). Is there any way to generate a list of all English verb stems and a list of all Spanish verb stems from these transcripts?

Thanks in advance!

Lulu

Brian MacWhinney

unread,
Jan 29, 2014, 2:45:56 PM1/29/14
to ChiBolts
Dear Lulu,
     You can get results for English by excluding lines marked with [- spa] using -s”[- spa]” and you can get Spanish by using +s”[- spa], but words marked at second language only by having the @s symbol on them will not be automatically excluded.  You would have to take a look at the output in each case to see which additional words are marked as @s.
   The general point is that maximizing use of the [- spa] method for coding gives the best results.  

—Brian MacWhinney

--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+u...@googlegroups.com.
To post to this group, send email to chib...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/3f1a7366-17ac-488e-94c7-4eab251b9727%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Lulu

unread,
Feb 4, 2014, 4:00:51 PM2/4/14
to chib...@googlegroups.com
I see. Thank you very much!

Best,
Lulu
Reply all
Reply to author
Forward
0 new messages