Aliases for not all Multitoken strings provided in ground truth

22 views
Skip to first unread message

Chitrank G

unread,
Jul 26, 2022, 2:11:45 AM7/26/22
to LM-KBC
Hi organizers

For words like "san francisco" or "goldman sachs", a single-token alias is not provided. Any reason for that? Or it's too much of a leeway for a language model?

Regards 

Simon Razniewski

unread,
Jul 26, 2022, 3:36:12 AM7/26/22
to LM-KBC
Hi,

Yes, not all multi-token names have a reasonable single-token variant, so in some cases we abstained from providing one (about your examples, on second thought, one might actually argue that "SF" and "Goldman" are fair). Anyways, there are very few of those cases, so they shouldn't be decisive.

Cheers,
Simon
Reply all
Reply to author
Forward
0 new messages