a doubt regarding LMF format and ids

4 views
Skip to first unread message

Anup Kulkarni

unread,
Apr 1, 2010, 8:27:53 AM4/1/10
to WSD-d...@googlegroups.com
Hi all,

We had some confusion regarding the synset ids to be used for marking
the words. Kindly clarify our query as mentioned below:

Each <LexicalEntry> in the LMF format file provided for the English
Wordnet contains one or more <Sense> tags. Each of these <Sense> tags
has two attrributes, namely, "id" and "sysnet". We wanted to know
whether the value of the "id" attrribute is to be used as the sense
key or whether the value of the "sysnet" attribute is to be used as
the sense key.

For example for the word "environment" the <LexicalEntry> is as follows:

<LexicalEntry>
<Lemma writtenForm="environment" partOfSpeech="n"/>
<Sense id="eng-30-08567235-n_2" synset="eng-30-08567235-n"/>
<Sense id="eng-30-13934596-n_1" synset="eng-30-13934596-n"/>
</LexicalEntry>

So if our system decides that the correct sense is 08567235 then shoud
we output our answer as "eng-30-08567235-n_2" (i.e. the value of the
"id" attribute) or as "eng-30-08567235-n" (i.e. the value of the
"synset" attribute).

--
thanks and regards,
Anup Kulkarni
MTech II year,
CSE, IIT-B
http://www.cse.iitb.ac.in/~anup

Oier Lopez de Lacalle

unread,
Apr 1, 2010, 9:06:39 AM4/1/10
to wsd-d...@googlegroups.com, anup...@gmail.com

Hi Anup,

You have to focus on the synset value. That is, following your example,
you should return "eng-30-08567235-n" as your system's answer.

I hope I clarify your question.

Best,

Oier

Anup Kulkarni(e)k dio:

> http://www.cse.iitb.ac.in/~anup <http://www.cse.iitb.ac.in/%7Eanup>
> --
> You received this message because you are subscribed to the Google
> Groups "SemEval-2010 task 17: all-words WSD on a specific domain" group.
> To post to this group, send email to WSD-d...@googlegroups.com.
> To unsubscribe from this group, send email to
> WSD-domain+...@googlegroups.com.
> For more options, visit this group at
> http://groups.google.com/group/WSD-domain?hl=en.

Radu ION

unread,
Apr 1, 2010, 2:57:47 PM4/1/10
to wsd-d...@googlegroups.com, Dan Stefanescu
Hello all,

We have already submitted some of our results using WordNet 3.0 sense keys (as in the last SemEval) as opposed to synset identifiers because in the trial data there is a document specifying format answers this way (valid for SemEval 2010). Unfortunately I have to leave tomorrow and I won't be able the change the output format ...

What will happen in this case ?

Thank you,
Radu Ion
RACAI team

Radu ION

unread,
Apr 1, 2010, 3:58:04 PM4/1/10
to wsd-d...@googlegroups.com, Dan Stefanescu
Ok, I have taken care of this (have converted the outputs). No need to worry.

Thank you,
Radu

Oier Lopez De Lacalle Lekuona

unread,
Apr 2, 2010, 12:47:43 PM4/2/10
to wsd-d...@googlegroups.com

Radu,

Sorry for not replying before, I've been out of the office the whole
day. Thanks you for converting the output to the correct format.

If the other case had happened, I think that we'd have tried to give
some more time to convert the data.

Best,

Oier

Radu ION <mush...@gmail.com> erabiltzaileak idatzi du:

>>>> WSD-domain+...@googlegroups.com<WSD-domain%2Bunsu...@googlegroups.com>


>>>> .
>>>> For more options, visit this group at
>>>> http://groups.google.com/group/WSD-domain?hl=en.
>>>>
>>>
>>> --
>>> You received this message because you are subscribed to the Google Groups
>>> "SemEval-2010 task 17: all-words WSD on a specific domain" group.
>>> To post to this group, send email to WSD-d...@googlegroups.com.
>>> To unsubscribe from this group, send email to

>>> WSD-domain+...@googlegroups.com<WSD-domain%2Bunsu...@googlegroups.com>

Reply all
Reply to author
Forward
0 new messages