Dear everyone,
with some delay, I am happy to announce that the oracle for the
plagiarism source retrieval task is now ready.
Please have a look at the documentation:
http://webis15.medien.uni-weimar.de/pan/#clueweb-usage
There, you'll find that the JSON response upon downloading a document
now includes the document as HTML as well as text, the oracle decision
about this document, and in case the oracle decides a document is an
actual source for a given suspicious document, also the plagiarized
text.
In order to make up for the delay, we extend the early bird submission
deadline for those working on source retrieval until Tuesday, March 3.
Apologies for the inconveniences.
Should you find any errors, please let us know as soon as possible at
p...@webis.de.
Best,
Martin
--
Martin Potthast
Bauhaus-Universität Weimar
www.webis.de ---
www.netspeak.org