Message from discussion
Creating custom Name Entity classes
Received: by 10.224.105.205 with SMTP id u13mr5197174qao.6.1352390666205;
Thu, 08 Nov 2012 08:04:26 -0800 (PST)
X-BeenThere: nltk-users@googlegroups.com
Received: by 10.224.52.141 with SMTP id i13ls6336149qag.7.gmail; Thu, 08 Nov
2012 08:04:24 -0800 (PST)
Received: by 10.236.89.114 with SMTP id b78mr5445958yhf.16.1352390664947;
Thu, 08 Nov 2012 08:04:24 -0800 (PST)
Received: by 10.236.89.114 with SMTP id b78mr5445957yhf.16.1352390664934;
Thu, 08 Nov 2012 08:04:24 -0800 (PST)
Return-Path: <nigel.l...@gmail.com>
Received: from mail-qa0-f53.google.com (mail-qa0-f53.google.com [209.85.216.53])
by gmr-mx.google.com with ESMTPS id fg3si4734079qcb.2.2012.11.08.08.04.24
(version=TLSv1/SSLv3 cipher=OTHER);
Thu, 08 Nov 2012 08:04:24 -0800 (PST)
Received-SPF: pass (google.com: domain of nigel.l...@gmail.com designates 209.85.216.53 as permitted sender) client-ip=209.85.216.53;
Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of nigel.l...@gmail.com designates 209.85.216.53 as permitted sender) smtp.mail=nigel.l...@gmail.com; dkim=pass header...@gmail.com
Received: by mail-qa0-f53.google.com with SMTP id k31so1580814qat.5
for <nltk-users@googlegroups.com>; Thu, 08 Nov 2012 08:04:24 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
d=gmail.com; s=20120113;
h=mime-version:in-reply-to:references:date:message-id:subject:from:to
:content-type;
bh=lO89MkyKAuyfImQf+/M+K21io+Clqvy3CdHvwEbcGdM=;
b=sbc/FWUMctDzOFTFvlVV+zvGAVekyPnkb0cdPi8ZQdtTXl6P965cYF5d2qPrbsrn/Y
gHTptUpgejKT10Emqdir6V8kM+OUgvjtELfxA4xrPFazjlwRJ2CR2Vj8qelvIMLxC7I5
iDAQzlq7fItA4zh6c/nRD90aqZAOxVTbH9vyQIl96by8gKoH/rN+3Dy0+37xPRMRzlzi
oa7uCRTc0tsFmnMlF+k1F9tMqPXvoi7r8TpMYVx0v7ss9dG417ysisqwYl2Rn6072f2Y
eV0JiJH87shgEZGJ8SYpxn5m/ckX1xGCp76lJwHvndKstu53N78GdxqeLwciPZHU7c0f
kTwA==
MIME-Version: 1.0
Received: by 10.224.35.137 with SMTP id p9mr12518425qad.85.1352390664610; Thu,
08 Nov 2012 08:04:24 -0800 (PST)
Received: by 10.49.98.226 with HTTP; Thu, 8 Nov 2012 08:04:24 -0800 (PST)
In-Reply-To: <201211081600.45203.nikola.jeli...@gmail.com>
References: <201211081600.45203.nikola.jeli...@gmail.com>
Date: Thu, 8 Nov 2012 16:04:24 +0000
Message-ID: <CADeX7vy1XOZPoH9h=RprTaftHjTZCRbEomeECC0WCGJUP51...@mail.gmail.com>
Subject: Re: [nltk-users] Creating custom Name Entity classes
From: Nigel Legg <nigel.l...@gmail.com>
To: nltk-users@googlegroups.com
Content-Type: multipart/alternative; boundary=20cf3074b44a29632604cdfdfd12
--20cf3074b44a29632604cdfdfd12
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable
Take a look at nltk.classify - there are a number of different classifiers
there that should do what you need.
On 8 November 2012 15:00, =D0=9D=D0=B8=D0=BA=D0=BE=D0=BB=D0=B0 =D0=88=D0=B5=
=D0=BB=D0=B8=D1=9B <nikola.je...@euroicc.com> wrote:
> Hello,
>
> I'm developing nlp application for extracting named entities. The problem
> with
> built-in nltk name entity module is that the classes are fixed.
> For example, if I want to create table of entities like this:
> TOOL - any mechinacal implement usable by human (hammer, axe, etc.)
> QUANTITY - any human measurment (meter, pound, inch, liter, gram, etc.)
> BMATERIAL - any building material (wood, brick, stone, sand, etc.)
> Let's say I have some tagged data in IOB form, which would be splitted in
> training and testing portions. Wordnet would be nice basis for features.
> Is there a way to create custom classes and their features, and then to
> train
> the algorithm with tagged data?
> --
> Nikola Jeli=C4=87
>
> EuroICC d.o.o.
> Tr=C5=A1=C4=87anska 21
> 11080 Zemun, Srbija
> tel: +381 11 371 36 65
> fax: +381 11 371 36 66
> e-mail: nikola.je...@euroicc.com
> web: www.euroicc.com
>
> --
>
>
>
--=20
Regards,
Nigel Legg
07722 652866
http://twitter.com/nigellegg
http://uk.linkedin.com/in/nigellegg
--20cf3074b44a29632604cdfdfd12
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable
Take a look at nltk.classify - there are a number of different classifiers =
there that should do what you need. <br><div class=3D"gmail_extra"><br><br>=
<div class=3D"gmail_quote">On 8 November 2012 15:00, =D0=9D=D0=B8=D0=BA=D0=
=BE=D0=BB=D0=B0 =D0=88=D0=B5=D0=BB=D0=B8=D1=9B <span dir=3D"ltr"><<a hre=
f=3D"mailto:nikola.je...@euroicc.com" target=3D"_blank">nikola.jelic@euroic=
c.com</a>></span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">Hello,<br>
<br>
I'm developing nlp application for extracting named entities. The probl=
em with<br>
built-in nltk name entity module is that the classes are fixed.<br>
For example, if I want to create table of entities like this:<br>
TOOL - any mechinacal implement usable by human (hammer, axe, etc.)<br>
QUANTITY - any human measurment (meter, pound, inch, liter, gram, etc.)<br>
BMATERIAL - any building material (wood, brick, stone, sand, etc.)<br>
Let's say I have some tagged data in IOB form, which would be splitted =
in<br>
training and testing portions. Wordnet would be nice basis for features.<br=
>
Is there a way to create custom classes and their features, and then to tra=
in<br>
the algorithm with tagged data?<br>
<span class=3D"HOEnZb"><font color=3D"#888888">--<br>
Nikola Jeli=C4=87<br>
<br>
EuroICC d.o.o.<br>
Tr=C5=A1=C4=87anska 21<br>
11080 Zemun, Srbija<br>
tel: <a href=3D"tel:%2B381%2011%20371%2036%2065" value=3D"+381113713665">+3=
81 11 371 36 65</a><br>
fax: <a href=3D"tel:%2B381%2011%20371%2036%2066" value=3D"+381113713666">+3=
81 11 371 36 66</a><br>
e-mail: <a href=3D"mailto:nikola.je...@euroicc.com">nikola.je...@euroicc.co=
m</a><br>
web: <a href=3D"http://www.euroicc.com" target=3D"_blank">www.euroicc.com</=
a><br>
<br>
--<br>
<br>
<br>
</font></span></blockquote></div><br><br clear=3D"all"><br>-- <br>Regards,<=
br>Nigel Legg<br>07722 652866<br><a href=3D"http://twitter.com/nigellegg" t=
arget=3D"_blank">http://twitter.com/nigellegg</a><br><a href=3D"http://uk.l=
inkedin.com/in/nigellegg" target=3D"_blank">http://uk.linkedin.com/in/nigel=
legg</a><br>
</div>
--20cf3074b44a29632604cdfdfd12--