Final Results of 1st International Competition on Plagiarism Detection

Martin Potthast

unread,

Jun 15, 2009, 3:45:04 PM6/15/09

to pan09-co...@googlegroups.com

Dear all,

I have the honour to announce the final results of the 1st International Competition on Plagiarism Detection.

In total, we received submissions from 13 out of 21 registered participants. There were 10 submissions for the external plagiarism analysis task and 4 for the intrinsic plagiarism analysis task (1 participant submitted results for both tasks). The competition corpus contains 46,946 plagiarism cases, 36,475 of them in the corpus for the external analysis task, and the remaining 10,471 in the corpus for the intrinsic analysis task.

The competition Web page has been updated and it now lists the participant's performances (mind your browser cache).
Click on this link to jump there directly:
http://www.uni-weimar.de/medien/webis/research/workshopseries/pan-09/competition.html#results

Congratulations to our winners!

I would also like to take the chance and thank you all for participating in the competition. It is you who made the competition a success! All of you did a great job in so short a time, and we are looking forward to meeting all of you at the PAN workshop in September to discuss your ideas, and to celebrate.

Best regards,
Martin

PS: The full corpus as well as the submission evaluation source code will be released soon. Also, remember the submission deadline for your papers describing your approach is July 1, 2009.

--
Martin Potthast
Bauhaus-Universität Weimar
www.webis.de - netspeak.webis.de

If you do things right, people won't be sure you've done anything at all.

Jan Kasprzak

unread,

Jun 15, 2009, 4:00:03 PM6/15/09

to pan09-co...@googlegroups.com

Martin Potthast wrote:
: http://www.uni-weimar.de/medien/webis/research/workshopseries/pan-09/competition.html#results

Congratulations to the winners!

To my surprise, our rather simple approach turned out to be
pretty good. Also, by evaluating the development corpus, I have discovered
several ways how to improve our existing system. So thanks also to the
competition organizers!

-Jan Kasprzak

--
| Jan "Yenya" Kasprzak <kas at {fi.muni.cz - work | yenya.net - private}> |
| GPG: ID 1024/D3498839 Fingerprint 0D99A7FB206605D7 8B35FCDE05B18A5E |
| http://www.fi.muni.cz/~kas/ Journal: http://www.fi.muni.cz/~kas/blog/ |
>> If we wanted to trade simplicity and kewl design for usability I think <<
>> we all know the URL of the Apple Store. --jmorris42 @LWN <<

vladislav....@gmail.com

unread,

Jun 15, 2009, 4:25:51 PM6/15/09

to PAN'09 Competition on Plagiarism Detection

Our congratulations to the winners as well! Well done :)

Martin and other guys who organized this, thank you, that was very
interesting challenge!

Martin, do I understand it right, that we still can submit a paper,
even though we took only 6th place?

2009/6/15 Martin Potthast <martin....@uni-weimar.de>:

> Dear all,
>
> I have the honour to announce the final results of the 1st International
> Competition on Plagiarism Detection.
>

--
WBR, Vlad A. Shcherbinin
http://siberiasoft.info

On Jun 15, 9:00 pm, Jan Kasprzak <k...@fi.muni.cz> wrote:
> Martin Potthast wrote:
>
> :http://www.uni-weimar.de/medien/webis/research/workshopseries/pan-09/...
>
> Congratulations to the winners!
>

Martin Potthast

unread,

Jun 15, 2009, 6:20:46 PM6/15/09

to pan09-co...@googlegroups.com

To my surprise, our rather simple approach turned out to be
pretty good. Also, by evaluating the development corpus, I have discovered
several ways how to improve our existing system. So thanks also to the
competition organizers!

You're welcome! We are eager to hear about your approach.

Best,
Martin

Martin Potthast

unread,

Jun 15, 2009, 6:22:07 PM6/15/09

to pan09-co...@googlegroups.com

Martin and other guys who organized this, thank you, that was very
interesting challenge!

Thank you, too!

Martin, do I understand it right, that we still can submit a paper,
even though we took only 6th place?

Why, yes, certainly!

Everyone participant is entitled to publish at least a short paper.

Best regards,
Martin

Emanuele Caglioti

unread,

Jun 17, 2009, 9:54:08 AM6/17/09

to pan09-co...@googlegroups.com

Congratulations to the winner and to the organizers!
It is not so easy to organize the 1st competition (in whatever field)!
Best,
Emanuele

Martin Potthast

unread,

Jun 17, 2009, 12:48:02 PM6/17/09

to pan09-co...@googlegroups.com

Thank you very much for your kind words, Emanuele!

Best,
Martin

J.A. Palkovskii Plagiarism-Detector Project Leading Programmer

unread,

Jun 29, 2009, 3:48:30 PM6/29/09

to PAN'09 Competition on Plagiarism Detection

Hello, Martin!
Hello, Dear All!

I'm awfully sorry I'm sending this post late - as personal
matters prevented me from replying earlier.

I would like to say my word of gratitude to the people who worked hard
for the Plagiarism
Detection Competition, and Martin in particular!

You have not only developed a profound model of Plagiarism detection
and
effectivenes estimation, but you've managed to overcome all the
difficulties that
arose on your way. The developed framework (both corpuses) and the
resulting
system have no maches (I've been working on Plagiarism detection quite
a long time,
but to the best of my knowlege this is the first time I've so
impressed by the results)

It's so good to see that scientific research can solve really
interesting and hot problems today!

Being one of the developers I feel an enourmous impulse to continue
polishing
the detection algorithms and make our results better during the next
competition.

I've seen a number of commersial solutions participating in the
competition -
and I think that it is great, when a really objective comparison is
done.

I hope this will become a good scientific annual occasion!

On 17 июн, 19:48, Martin Potthast <martin.potth...@uni-weimar.de>
wrote:

> Thank you very much for your kind words, Emanuele!
>
> Best,
> Martin
>
> On Wed, Jun 17, 2009 at 3:54 PM, Emanuele Caglioti <
>
>
>
> emanuele.cagli...@gmail.com> wrote:
> > Congratulations to the winner and to the organizers!
> > It is not so easy to organize the 1st competition (in whatever field)!
> > Best,
> > Emanuele
>
> > On Tue, Jun 16, 2009 at 12:20 AM, Martin Potthast <
> > martin.potth...@uni-weimar.de> wrote:
>
> >> To my surprise, our rather simple approach turned out to be
> >>> pretty good. Also, by evaluating the development corpus, I have
> >>> discovered
> >>> several ways how to improve our existing system. So thanks also to the
> >>> competition organizers!
>
> >> You're welcome! We are eager to hear about your approach.
>
> >> Best,
> >> Martin
>
> >> --
> >> Martin Potthast
> >> Bauhaus-Universität Weimar

> >>www.webis.de- netspeak.webis.de

>
> >> If you do things right, people won't be sure you've done anything at all.
>
> --
> Martin Potthast

> Bauhaus-Universität Weimarwww.webis.de- netspeak.webis.de

Martin Potthast

unread,

Jun 30, 2009, 4:01:23 AM6/30/09

to pan09-co...@googlegroups.com

Hi Yurii,

I would like to say my word of gratitude to the people who worked hard
for the Plagiarism
Detection Competition, and Martin in particular!

Thank you very much!

You have not only developed a profound model of Plagiarism detection
and
effectivenes estimation, but you've managed to overcome all the
difficulties that
arose on your way. The developed framework (both corpuses) and the
resulting
system have no maches (I've been working on Plagiarism detection quite
a long time,
but to the best of my knowlege this is the first time I've so
impressed by the results)

We are also happy how things turned out in the end. In particular, the quality measures carry some interesting and fresh ideas, and we also hope that the corpus is sufficient to be used in the future. It will be released soon.

It's so good to see that scientific research can solve really
interesting and hot problems today!

You're right! It is very rewarding to work on solutions for hard practical problems.

Being one of the developers I feel an enourmous impulse to continue
polishing
the detection algorithms and make our results better during the next
competition.

I've seen a number of commersial solutions participating in the
competition -
and I think that it is great, when a really objective comparison is
done.

I hope this will become a good scientific annual occasion!

We are already planning the next competition, and, we are also open to suggestions and criticism of any kind. What would you wish for next time?

Best,
Martin

--
Martin Potthast
Bauhaus-Universität Weimar

www.webis.de - netspeak.webis.de

Jan Kasprzak

unread,

Jun 30, 2009, 10:06:45 AM6/30/09

to pan09-co...@googlegroups.com

Martin Potthast wrote:
: We are already planning the next competition, and, we are also open to

: suggestions and criticism of any kind. What would you wish for next time?

I would definitely like to have a corpus without the large number
of "accidental" similarities, not generated by the machine plagiarist.

Also, it could be interesting to unify the source and suspicious
documents into one base, with "find all the similarities amongst them"
as a competition task.

Reply all

Reply to author

Forward