Entity missing entity_type attribute value

20 views
Skip to first unread message

marty

unread,
May 23, 2012, 3:50:54 PM5/23/12
to Zemanta Developers
Hi there,

I just noticed an entity being returned by "suggest" that has a
Freebase, a Wikipedia and a DBPedia link associated with it, yet no
"entity_type" (example below). Looking at the DBPedia URL, I can see a
lot of very meaningful type definitions in there, so I would expect
the entity to have an easily extractable entity_type.

Is this a bug or just something I'm missing?

Thanks,
Martin

{'relevance': 0.87091499999999999,
'confidence': 0.84897400000000001,
'entity_type': [],
'target': [{'url': 'http://en.wikipedia.org/wiki/IDA_Ireland', 'type':
'wikipedia', 'title': 'IDA Ireland'},
{'url': 'http://rdf.freebase.com/ns/en/ida_ireland', 'type': 'rdf',
'title': 'IDA Ireland'},
{'url': 'http://dbpedia.org/resource/IDA_Ireland', 'type': 'rdf',
'title': 'IDA Ireland'}],
'anchor': 'IDA Ireland'}

Andraz Tori

unread,
May 23, 2012, 5:06:22 PM5/23/12
to zemanta-d...@googlegroups.com
Yes, we recommend resolving urls and getting your entity types directly
from 'the source' (dbpedia or Freebase).

We've implemented a limited set of entity types which we provide
directly in the API since there was very direct demand for them by some
users.

If you need 'the whole story', by all means just dereference the url! :)

bye
andraz

marty

unread,
May 24, 2012, 6:36:43 AM5/24/12
to Zemanta Developers
I see. Well, to be honest I'm not asking for "the whole story" - in
the example shown above, the system doesn't even return a single type
where there are clearly many available. I guess my confusion is what's
the point in returning the entity_type attribute at all if it's
sometimes extracted and sometimes not..

Thanks

Andraz Tori

unread,
May 24, 2012, 8:45:03 AM5/24/12
to zemanta-d...@googlegroups.com, marty
On 05/24/2012 12:36 PM, marty wrote:
> I see. Well, to be honest I'm not asking for "the whole story" - in
> the example shown above, the system doesn't even return a single type
> where there are clearly many available. I guess my confusion is what's
> the point in returning the entity_type attribute at all if it's
> sometimes extracted and sometimes not..

Agreed,
there are two things at play:
- we only support a small subset of all types available in those two
databases, if it's not one of those types, we ignore it
- when the types are right we try to extract them, but that's not always
done properly and we continuously try to improve on that.

bye
andraz
Reply all
Reply to author
Forward
0 new messages