Well, in theory, it supports also other evaluation measures as well as older datasets, but we are still debugging some inconsistencies in scores in comparison to Matlab toolkit. Until we at least know their cause we cannot officially support them because people would start opening tickets. And since I am working on another project at the moment, the progress is slow. But if anyone is willing to test older datasets and help us with debugging, this would speed up the progress.