Streamcorpus_dump tool for getting data from .sc files

41 views
Skip to first unread message

Kanika Parashar

unread,
Jun 27, 2015, 9:50:06 PM6/27/15
to tre...@googlegroups.com
I have installed streamcorpus using pip. I need to deserialize the data, i.e get the data from the .sc files.
I tried using the streamcorpus_dump tool with the following command -
streamcorpus_dump --component clean_visible input.sc>input.txt
However it outputs an empty file.
Pls help me figure this out.. or if anyone can suggest an alternative that would help too.

Richard McCreadie

unread,
Jul 7, 2015, 1:41:51 PM7/7/15
to tre...@googlegroups.com
I would suggest taking this question to the stream corpus mailing list, as we do not maintain the stream corpus software.

I personally use my own java implementation to process the corpus, compiled from the thrift definition.

RichardM
Reply all
Reply to author
Forward
0 new messages