We have released the BC3 corpus and annotation framework that was
announced at the Email 2008 Workshop at AAAI.
It is available at: http://cs.ubc.ca/labs/lci/bc3.html
The corpus contains 40 email threads with an average of 6 emails each
that are annotated with human written abstract summaries with links to
the original sentences and extract summaries. The email sentences are
also labeled for speech acts, meta sentences, and subjective sentences.
The annotation software is a webserver written in Ruby on Rails that
lets researchers import and manage an email corpus. It also lets users
annotate emails threads for summaries and label email features. The
software is released open-source.
Best,
Jan Ulrich