Nice post - many of the issues you brought up are precisely what
motivated us to create MLcomp. Currently, the site is somewhat
active, but we would love to have more people contribute! All the
code is open-sourced and available here:
https://github.com/percyliang/mlcomp
The README should have instructions on how to get set up.
Here are a bunch of bug/feature requests that's on our plate:
https://github.com/percyliang/mlcomp/issues
The site currently is quite functional and general, but getting
widespread adoption remains a challenge. Often ML workshops will have
competitions with standard benchmarks, and a good first step would be
to get them use MLcomp as the platform. We have been talking to
various groups, but there's a lot of inertia: people want to be able
to run their algorithms using their own custom setup (even though
MLcomp allows arbitrary binaries to execute - it just has to be
self-contained). Another thing is that we should probably do is just
seed MLcomp with more standard datasets/algorithms that researchers
are actively working on - once there is critical mass, more people are
likely to use it because it'll be convenient to do comparisons. A lot
of people download datasets/algorithms from the site without running
them there - maybe because they have private data. Please let us know
if you have any other ideas.
Cheers,
-Percy