Cleaning up of answers from the ML

38 views

Skip to first unread message

Michael Cochez

unread,

Jul 14, 2022, 2:53:23 PM7/14/22

to LM-KBC

Dear organizers,

We notice that when we get answers from the LM we are using that they does sometimes deviate from the expected outputs in the dev set because of minor issues.

For example, lower vs upper case, use of a different name for the same company, use of an abbreviation instead of the full name, etc.

Is it acceptable that we include a component in our system to clean up those name based on a dictionary lookup? This would not itself be a language model.

Kind regards,

Michael

LM-KBC

unread,

Jul 14, 2022, 3:13:37 PM7/14/22

to LM-KBC

Hi Michael,

Yes, basic string cleaning operations and conversion of abbreviations are allowed on top of LM-generated output.