CLAN’s FLO program does most of this. Alternatively, you could grab all the <w> tags from the XML version of the database.
What kind of NLP do you want to use? You could apply Universal Dependencies directly.
— Brian MacWhinney
Teresa Heinz Professor of Cognitive Psychology,
Language Technologies and Modern Languages, CMU
> --
> You received this message because you are subscribed to the Google Groups "chibolts" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to
chibolts+u...@googlegroups.com.
> To view this discussion on the web visit
https://groups.google.com/d/msgid/chibolts/237e8996-63ba-4476-859f-4b1e6841ab3an%40googlegroups.com.