Data freeze in effect

31 views
Skip to first unread message

robvanderg

unread,
Jul 21, 2021, 5:57:55 AM7/21/21
to MultiLexNorm
Dear all, 

As of now, the data freeze is in effect. (if any serious issues are still found, and we decide to still update, I will post it here). 

Thanks to everyone who submitted corrections!

Best, 
Rob

Rob v

unread,
Aug 19, 2021, 11:47:11 AM8/19/21
to MultiLexNorm
Dear all, 

Unfortunately, I have still found some issues in the data. I have fixed these a minute ago and pushed the fixes to the repo. Two things have changed:

- The Danish dataset became much larger
- Some previously uncaught interjections (of the following 4 types: hmm, haha, eeh, xxx), where normalized. We decided to keep those in their original form all throughout the data.

I would strongly suggest to at least re-train your Danish model.

Sorry for the inconvenience, 

Rob
Reply all
Reply to author
Forward
0 new messages