Definition of terms used in documentation

8 views
Skip to first unread message

Robert Dale

unread,
Jun 23, 2018, 1:26:28 AM6/23/18
to Dandelion Support Forum
Hi

I just have a couple of clarificatory questions:

1  Is the Dandelion concept of a 'spot' the same thing as is referred to as a 'mention' in other NER frameworks, or is there a subtle distinction I am missing?

2  What exactly does 'confidence' measure:  confidence that the located string corresponds to the associated Wikipedia page, or something else?

Thanks

R

ber...@dandelion.eu

unread,
Jun 28, 2018, 1:04:01 PM6/28/18
to Dandelion Support Forum
Hi Robert,

1) The spot is the text segment that can be related to an entity, yes, it could be called mention. In entity recognition we associate one Wikipedia entity to a spot, given a list of candidate entities for it (e.g. given the spot "apple" we must choose one between the company entity and the fruit entity)

2) The confidence is about how much confident the system is in linking one spot to one specific entity. A confidence of 1 means we are absolutely sure about an annotation (e.g. given the phrase "The US president Donald Trump", for the spot "Donald Trump" we can be 0.99 confident that it is related to the current US president!)

Cheers

Giacomo Berardi
Dandelion Team
Reply all
Reply to author
Forward
0 new messages