Deadline Extension & Corpus Update

31 views
Skip to first unread message

Martin Potthast

unread,
Apr 9, 2014, 2:20:06 PM4/9/14
to pan-workshop-series
Dear everyone,

as many of you are hard at work with getting ready for the final
software submission deadline, we have uncovered a couple of problems
and inconsistencies that needed fixing. We hope that fixing these
problems at your end will not give you a hard time, however, in order
not to add to the stress, we decided to extend the final software
submission deadline 2 weeks in order to give everyone time to breathe.

The new final software submission deadline is on March 1, 2014.


For those of you who work on author profiling, please take not of the
most recent update of the training data which have been uploaded to
the task web page today:
http://www.webis.de/research/corpora/corpus-pan-labs-09-today/pan-14/pan14-data/pan14-author-profiling-training-corpus-2014-04-09.zip

The changes to the corpus include the unification of the 65+-label for
which there have been a couple of different versions in the corpus.

You can download the new corpus version, and you will also find also
mounted into your virtual machines right now.

If you find any further problems, please let us know as early as possible.

Martin

--
Martin Potthast
Bauhaus-Universität Weimar
www.webis.de --- www.netspeak.org

Martin Potthast

unread,
Apr 9, 2014, 2:58:26 PM4/9/14
to pan-workshop-series
> The new final software submission deadline is on March 1, 2014.

That should be May 1, 2014!

We'll update from virtual machines to time machines in next year's PAN. :-)

kiviel...@gmail.com

unread,
Apr 9, 2014, 2:59:40 PM4/9/14
to Martin Potthast, martin....@gmail.com, pan-workshop-series
Hi Martin,
I assume you mean *May* 1st, right? :) Any news on the early birds?
Kind regards
Victoria
------Mensaje original------
De: Martin Potthast
Remitente: martin....@gmail.com
Para: pan-workshop-series
Asunto: Deadline Extension & Corpus Update
Enviado: 9 de abr de 2014 15:20

Dear everyone,

as many of you are hard at work with getting ready for the final
software submission deadline, we have uncovered a couple of problems
and inconsistencies that needed fixing. We hope that fixing these
problems at your end will not give you a hard time, however, in order
not to add to the stress, we decided to extend the final software
submission deadline 2 weeks in order to give everyone time to breathe.

The new final software submission deadline is on March 1, 2014.


For those of you who work on author profiling, please take not of the
most recent update of the training data which have been uploaded to
the task web page today:
http://www.webis.de/research/corpora/corpus-pan-labs-09-today/pan-14/pan14-data/pan14-author-profiling-training-corpus-2014-04-09.zip

The changes to the corpus include the unification of the 65+-label for
which there have been a couple of different versions in the corpus.

You can download the new corpus version, and you will also find also
mounted into your virtual machines right now.

If you find any further problems, please let us know as early as possible.

Robert Layton

unread,
Apr 9, 2014, 9:37:42 PM4/9/14
to pan-works...@googlegroups.com, Martin Potthast, Martin Potthast
Hi Martin,

The authorship verification corpus in /media/pan14-training-corpora are outdated, specifically they don't contain the new json style information.
Can these please be updated? I believe all you'd need to do is unzip the new data into that folder.

Thanks,

Robert



--
--
You received this message because you are subscribed to the Google Group "PAN".
Visit this group at http://groups.google.com/group/pan-workshop-series
To unsubscribe send email to pan-workshop-se...@googlegroups.com.
---
You received this message because you are subscribed to the Google Groups "PAN Workshop Series. Uncovering Plagiarism, Authorship, and Social Software Misuse." group.
To unsubscribe from this group and stop receiving emails from it, send an email to pan-workshop-se...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.



--
Dr. Robert Layton
Research Fellow
w: Website   e: r.la...@icsl.com.au   t: @robertlayton
Internet Commerce Security Laboratory
Federation University Australia

Some recent publications:
1) Authorship Attribution of IRC Messages Using Inverse Author Frequency  Link
2) Unsupervised authorship analysis of phishing webpages  PDF
3) Recentred local profiles for authorship attribution  Link
Coming soon -- a book chapter on attribution!

Martin Potthast

unread,
Apr 10, 2014, 2:49:04 AM4/10/14
to Robert Layton, pan-workshop-series
> The authorship verification corpus in /media/pan14-training-corpora are
> outdated, specifically they don't contain the new json style information.
> Can these please be updated? I believe all you'd need to do is unzip the new
> data into that folder.

Good catch, Robert! I've added the most recent version to the mount
point on everyone's virtual machines.

Robert Layton

unread,
Apr 13, 2014, 8:30:48 PM4/13/14
to Martin Potthast, pan-workshop-series
Thanks for that. All working now!
Reply all
Reply to author
Forward
0 new messages