Hi Francisco and Paolo,
this one is for you.
Martin
---------- Forwarded message ----------
From: James Andrew Marquardt <jam...@uw.edu>
Date: Wed, Apr 2, 2014 at 6:06 AM
Subject: PAN Author profiling task - blog corpus
To: Martin Potthast <martin....@uni-weimar.de>
Cc: Golnoosh Farnadi <Golnoosh...@ugent.be>, gayathri Vasudevan
<gv...@uw.edu>, Martine De Cock <Martine...@ugent.be>
Mr. Potthast,
I have a question that has just come up today regarding the blog
corpus for the author profiling task.
It appears in the blog posts that have been provided are incomplete
posts, in that the text is cut off after a certain point in each post.
The full text can be viewed by following the url in the document xml
tag.
My question is whether the full text will be made available for the
blogs for the final submission or if we should anticipate needing to
retrieve the full text ourselves.
Thank you,
-James Marquardt
--
Martin Potthast
Bauhaus-Universität Weimar
www.webis.de --- www.netspeak.org