Update

1 view
Skip to first unread message

Eneko Agirre

unread,
Mar 4, 2010, 2:21:21 PM3/4/10
to wsd-d...@googlegroups.com

*NEWS:*


* Chinese background texts and wordnets available
* The number of system runs is limited to 3 runs per team
* The dtd has been enriched to accomodate sentence boundaries and,
in the case of Dutch, to properly encode component lemmas of
compounds. You can retrieve the new version here
<http://ixa2.si.ehu.es/eneko/Semeval2010/all-words.dtd>.


Instructions for participation

At test time, the test documents will be tokenized (segmented in the
case of Chinese). And the organizers will clearly mark which are the
units (single word, multiword, component of compounds) that need to
be disambiguated. These will include nouns and verbs.

The sense inventory is based on WordNet and will include all words,
multiwords and senses to be used. Instructions to obtain the relevant
wordnets are available from the website.

The organizers also include some untagged background documents,
which the participants can use to tune their WSD algorithms.

Steps:

1. join the mailing list <http://groups.google.com/group/WSD-domain>
2. register in SemEval website
<http://semeval2.fbk.eu/semeval2.php?location=login_user>
3. download the trial data from SemEval website
<http://semeval2.fbk.eu/semeval2.php?location=data>
4. download background text (see "download area" in task website
<http://xmlgroup.iit.cnr.it/SemEval2010/>)
5. get wordnets of respective languages (see "download area" in task
website <http://xmlgroup.iit.cnr.it/SemEval2010/>)
6. download test data from SemEval website
<http://semeval2.fbk.eu/semeval2.php?location=data>
7. upload results (up to three different system runs) to SemEval
website <http://semeval2.fbk.eu/>
8. write a paper for the workshop


Schedule

* March 26: test data available
* April 2: deadline for uploading results
* April 13: results made available
* April 17: deadlline for paper submission


Reply all
Reply to author
Forward
0 new messages