Dear Linas, Thanks for developing WordNet::SenseKey. It is ecactly the program I was looking for. Thanks Siddharth and Ted and for pointing to SenseKey. However the output from WordNet::SenseKey seems to be bit different from the corresponding value shown by the WordNet web interface. For example for the sense number "distinct#a#1", the WordNet::SenseKey shows sensekey as "distinct%5:00:00:different:00" where as the WordNet web interface shows sense key as "distinct%3:00:00:different:00". I apologise if I am missing something here. Following are more examples. Output from WordNet::SenseKey.pm ****distinct#a#1 [distinct%5:00:00:different:00] ****distinct#a#2
[distinct%3:00:00::] ****distinct#a#3 [distinct%5:00:00:separate:00] ****distinct#a#4 [distinct%5:00:00:definite:00] ****distinct#a#5 [distinct%5:00:00:clear:00] Output from WordNet web interface http://wordnetweb.princeton.edu/perl/webwn distinct#1 (distinct%3:00:00:different:00), distinct#2 (distinct%3:00:00::) distinct#3 (distinct%3:00:00:separate:00) distinct#4 (distinct%3:00:00:definite:00) distinct#5 (distinct%3:00:00:clear:00), Regards, Ambikesh Jayal. School of IS, Computing & Maths, Brunel University, Uxbridge, UB8 3PH, United Kingdom. Email: ambikes...@brunel.ac.uk --- On Mon, 5/4/10, Siddharth Patwardhan <si...@cs.utah.edu> wrote:
|
The web interface is incorrect in the cases you list. Internally, the
Perl library we used at the lab for all of our projects was somewhat
stupid about the 'adjective'/'satellite' distinction. ('Stupid' in the
sense that it conflated 3 and 5 -- I gave it an option called
conflate35, which defaulted to true.)
The good news is that the 3/5 distinction is redundant with the
:'head-word':'head-sense' trailing portion of a sense key. You can just
do:
sub recover_sense35 {
local $_ = shift;
/::$/ and s/%3/%5/;
$_
}
to get the correct senses. I'm kind of surprised that the web interface
is wrong -- I thought I'd corrected that at some point -- but maybe it
got reverted when the site moved. (And I'm no longer working at WordNet
anyway.)
Best,
Ben
> --
> You received this message because you are subscribed to the Google Groups "wn-perl" group.
> To post to this group, send email to wn-...@googlegroups.com.
> To unsubscribe from this group, send email to wn-perl+u...@googlegroups.com.
> For more options, visit this group at http://groups.google.com/group/wn-perl?hl=en.
>
>
> Hi Ben,
> >>However the output from WordNet::SenseKey seems to be bit different
> >>from the corresponding value shown by the WordNet web interface.
> >>For example for the sense number "distinct#a#1", the
> >>WordNet::SenseKey shows sensekey as "distinct%5:00:00:different:00"
> >>where as the WordNet web interface shows sense key as
> >>"distinct%3:00:00:different:00".
>
> >The web interface is incorrect in the cases you list.
>
> You mean the web interface should have
> shown "distinct%3:00:00:different:00" for the sense
> number "distinct#a#1"?
No. The correct sense key has a 5 if it lists a headword. Any key that
*doesn't* end with '::' (which denotes two empty colon-separated fields)
should have a 5.
distinct#a#2 is the only sense of 'distinct' that should be a '3'
(adjective) rather than a '5' (satellite). (Assuming the sense-ordering
is correct in the web interface.) So, most of the adjectives appear to
be wrong. I'm fairly certain that's to do with the server change (which
happened long ago now), because we used sense keys internally for many
things. (Since they're relatively stable, as opposed to sense numbers
which are less stable, or file-offsets which are almost entirely
unstable).
Best,
Ben