![]()
![]()
As you can see, there is no one convention for prediction format. Yes of course, if we are talking about production system for users we should handle these cases user oriented. As I correctly understand the evaluation of result will be done automatically. Evaluation section of task description indicates “Systems will be evaluated using the ERRANT scorer.” This scorer work with Span-based and Token-based matches. In this case, if my system predicts (img 2) "did n't" instead "didn ` t" will it be span and token error?
Thank you in advance,