Hello Jessica,
The scoring program is flexible, and you can simply update it to compute F1 score or another metric.
In order to understand why your submission is failing, you can click on
it and explore the different log files, like "error from scoring step",
etc.
Example of scoring program:
More information about how to update it:
Tutorial for organizers
If you need more help, please open a
Github issue or contact us at
in...@codalab.org. The Google group is meant to receive announcements about competitions, benchmarks, conferences and about the platform.
Best regards,
Adrien Pavão
CodaLab Team