hi
I have recently begun using the UBY resources, and have found them to be very useful on the whole, and I have made an initial exploration of these, thanks in large part to some very kind help from Silvana Hartmann. However, I have run into issues to do with aligning various components.
I would like to extract enough information to allow me to align the English FrameNet and English Wiktionary; specifically, given a WK entry I would like to find the matching FN lexical unit.
I have downloaded the UBY SQL dump, and I have discovered that the tables "SenseAxis" and "sense", have fields "senseTwoId" and "senseId", respectively, both of which yield values like "WktEN_sense_...". So I have reasoned that by aligning these tables using these fields, I would be able to then extract English FN lexical units matching the actual written forms from the English WK.
I have written the following SQL queries to carry this out:
(1)
SELECT mer.externalReference AS FNLU, sa.senseTwoId AS WKSENSE
FROM SenseAxis AS sa
JOIN MonolingualExternalRef AS mer ON (mer.senseId=sa.senseOneId)
WHERE substring(sa.senseOneId,1,2)="FN" AND
substring(sa.senseTwoId,1,4)="WktE" AND
mer.externalSystem="FrameNet_1.5_eng_lexicalUnit";
(2)
SELECT s.senseId AS WKSENSE,fr_l.writtenForm AS ENGWRITTENFORM
FROM sense as s
JOIN senserelation as sr ON (s.senseId=sr.senseId)
JOIN formRepresentation_lemma AS fr_l ON (sr.formRepresentationId=fr_l.formRepresentationId)
JOIN lexicalEntry as le ON (le.lemmaId=fr_l.lemmaId)
JOIN lexicon as l ON (l.lexiconId=le.lexiconId)
WHERE l.languageIdentifier='eng' AND
l.lexiconName='FrameNet' AND
fr_l.languageIdentifier='eng';
However, after aligning "SenseAxis.senseTwoId" and "sense.senseId", the FN lexical units that are returned are not in fact an appropriate match for the returned WK written form. For example, "activity" is matched to FN lexical unit with id "7900", which is in fact "book" (as in "Arrrest"). So either something has gone wrong with my SQL queries, and/or I have approached this problem from the wrong angle, and/or what I am trying to do is not in fact possible in UBY.
Either way, any help with this would be very much appreciated.
Many thanks
Andrew Gargett