extracting stream items from the TREC-TS-2014F dataset

95 views
Skip to first unread message

kita3...@gmail.com

unread,
Jul 21, 2014, 9:31:50 PM7/21/14
to stream...@googlegroups.com
Hello,

I am one of participants in the TREC 2014 Temporal Summarization (TS) track.
I use the TS-specific corpus subset (http://s3.amazonaws.com/aws-publicdatasets/trec/ts/index.html) and have some problems.
Would you give me any advice?


1.
We were extracting stream items from the TS-specific corpus subset with the attached java program.
However, we cannot find "lingpipe" component in stream items from 2013-02-03-00 to 2013-04-20-23 or get sentences.
Would you tell me about that solution methods?


2.
Our atattied program doesn't output any sentences, when we extract stream items from some chunk files.
(e.g.

What does this case mean?
May I ignore these chunk files?


Thank you for your consideration.
Sayaka

ReadThriftKOBE.java
Reply all
Reply to author
Forward
0 new messages