Hmmm...this also seems to be an issue for
http://search.cpan.org/dist/WordNet-Similarity/
Note in the example below that when we don't specify the part of
speech, we don't seem to be able to realize that 'are' is a verb, but
that when we specify the part of speech as 'are#v' we find it...
marimba(7):
similarity.pl --type WordNet::Similarity::wup are be
Loading WordNet... done.
Loading Module... done.
Warning (WordNet::Similarity::wup::parseWps()) - are#n and be#v belong
to different parts of speech.
Warning (WordNet::Similarity::wup::parseWps()) - are#n and exist#v
belong to different parts of speech.
Warning (WordNet::Similarity::wup::parseWps()) - are#n and equal#v
belong to different parts of speech.
Warning (WordNet::Similarity::wup::parseWps()) - are#n and
constitute#v belong to different parts of speech.
Warning (WordNet::Similarity::wup::parseWps()) - are#n and embody#v
belong to different parts of speech.
Warning (WordNet::Similarity::wup::parseWps()) - are#n and cost#v
belong to different parts of speech.
are#n#1 be#n#1 0.352941176470588
marimba(8):
similarity.pl --type WordNet::Similarity::wup are#v be#v
Loading WordNet... done.
Loading Module... done.
be#v#1 be#v#1 1
This is pretty inconsistent - I would have hoped that we'd report the
same result in both cases since this command should return the max
similarity regardless of pos (and 1 is clearly greater than .35 so the
first version is missing something important).
So, I think we need to look into this. :) We'll keep you posted!
(Thanks to kido, btw, for noticing this...)
Thanks,
Ted