Message from discussion
Different concepts for singular and plural forms
Learner" <cmunell@googlegroups.com>
Date: Wed, 31 Aug 2011 01:36:33 -0700 (PDT)
From: Manny <mwa...@gmail.com>
Reply-To: cmunell@googlegroups.com
To: cmunell@googlegroups.com
Message-ID: <22692920.3294.1314779793559.JavaMail.geo-discussion-forums@yqah42>
In-Reply-To: <alpine.LFD.2.00.1108301530250.6016@linux1.gp.cs.cmu.edu>
References: <8320914.224.1314706705619.JavaMail.geo-discussion-forums@yqic37>
<alpine.LFD.2.00.1108301530250.6016@linux1.gp.cs.cmu.edu>
Subject: Aw: Re: [cmunell] Different concepts for singular and plural forms
MIME-Version: 1.0
Content-Type: multipart/mixed;
boundary="----=_Part_3292_19987337.1314779793558"
------=_Part_3292_19987337.1314779793558
Content-Type: multipart/alternative;
boundary="----=_Part_3293_11304440.1314779793558"
------=_Part_3293_11304440.1314779793558
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 7bit
Hi Bryan,
You are of course right, classes versus instances is always a tricky thing.
But I believe, the slightly simpler case of NELL learning a singular and a
plural form of the same class (as I think is the case with hamster /
hamsters) is a bit easier to tackle. Then again, it might require a
different kind of learner. I do wonder if it would be possible to tap Oxford
Dictionaries for that. Granted, there should also be a few restrictions as
to when singular and plural classes become related (or merged), or Apple
(Computers) could end up getting linked to apples.
By the way, we are in the process of trying to employ NELL to measure
semantic relatedness between terms and use it as one of the knowledge bases
our classifiers use to identify potential co-references within texts. The
reason I brought this up is the following: I did notice that our path length
algorithms could not link "hamster" and "rodent", but were quickly able to
find a short path between "hamsters" and "rodents". But there is no need to
"fix" this within NELL if you are running out of resources; we'll try to
extend our KB import process with a reduction step that attempts to find
such cases and merge concepts as necessary. If you want to, I'll report back
and tell you how that went.
Manuel
------=_Part_3293_11304440.1314779793558
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable
Hi Bryan,<div><br></div><div>You are of course right, classes versus instan=
ces is always a tricky thing. But I believe, the slightly simpler case of N=
ELL learning a singular and a plural form of the same class (as I think is =
the case with hamster / hamsters) is a bit easier to tackle. Then again, it=
might require a different kind of learner. I do wonder if it would be poss=
ible to tap Oxford Dictionaries for that. Granted, there should also be a f=
ew restrictions as to when singular and plural classes become related (or m=
erged), or Apple (Computers) could end up getting linked to apples.</div><d=
iv><br></div><div>By the way, we are in the process of trying to employ NEL=
L to measure semantic relatedness between terms and use it as one of the kn=
owledge bases our classifiers use to identify potential co-references withi=
n texts. The reason I brought this up is the following: I did notice that o=
ur path length algorithms could not link "hamster" and "rodent", but were q=
uickly able to find a short path between "hamsters" and "rodents". But ther=
e is no need to "fix" this within NELL if you are running out of resources;=
we'll try to extend our KB import process with a reduction step that attem=
pts to find such cases and merge concepts as necessary. If you want to, I'l=
l report back and tell you how that went.</div><div><br></div><div>Manuel</=
div>
------=_Part_3293_11304440.1314779793558--
------=_Part_3292_19987337.1314779793558--