PAN Track @ FIRE on Cross-Language detection of SOurce COde Re-use (CL-SOCO)

23 views
Skip to first unread message

Paolo Rosso

unread,
May 21, 2015, 2:24:18 PM5/21/15
to pan-workshop-series

----------------------------------------------
Call for Participation
----------------------------------------------

PAN Track on Cross-Language detection of SOurce COde re-use (CL-SOCO)
http://www.dsic.upv.es/grupos/nle/clsoco

held in conjunction with the FIRE 2015 Forum for Information Retrieval
Evaluation
4th - 6th December 2015
DAIICT, Gandhinagar

----------------------------------------------

Nowadays there is a vast amount of resources on the Web such as
repositories, blogs or forums that have become easily accessible for
all type of users. A particular type of available information within
such resources are Source Code files, that can be looked, debugged and
even tested by programmers. Original authors of these programs would
like to protect the use of these resources without their acknowledgment.

However, manually checking for suspicious re-used source code files is
infeasible when searching in big collections or in the Web. Therefore,
there is an urgent need to develop automatic systems for detecting
source code re-use cases.

The academia represents the scenario where most cases of source code
re-use have been reported: students have to solve the same assignments
under the same conditions and they can easily modify the source code in
order to not be detected. In addition, software companies have also a
special interest in preserving their intellectual property. Although
source code re-use traditionally occurs within the same programming
language (i.e., mono-lingual), an interesting and also frequent
practice is when source code re-use occurs between different
programming languages (cross-lingual).

Hence, CL-SOCO provides with source code collections written in
different programming languages, where it is known that re-use has
happened. Therefore, the aim of the task is to identify source code
pairs that are likely to been re-used across programming languages.
Participant teams are allowed to submit up to three runs as a maximum.
The training corpus has been already released.

We cordially invite all researchers and practitioners from all fields
to participate in this year's SOCO edition.

----------------------------------------------
Important Dates
----------------------------------------------

20th May, 2015 Release of training corpus (training period starts)
14th July, 2015 Release of test corpus
7th September, 2015 Submission of runs
23th October, 2015 Results notification
23th November, 2015 Working notes due

----------------------------------------------
Task Coordinators
----------------------------------------------

Enrique Flores, Paolo Rosso, Lidia Moreno
Universitat Politècnica de València, Spain

Esaú Villatoro-Tello
Universidad Autónoma Metropolitana (UAM), Mexico

----------------------------------------------
Contact
----------------------------------------------

E-mail: pan-soco[AT]dsic.upv.es
Track Web page: http://www.dsic.upv.es/grupos/nle/clsoco

----------------------------------------------

Reply all
Reply to author
Forward
0 new messages