Greetings,
I have a question about the highest possible score with the current challenge metric:
Let's say the evaluation dataset has more positive chagas cases that the 5% threshold. To my understanding, the highest possible challenge score would then not be 1, but rather be upperbounded by the ratio of the number of positives inside the threshold to the total number of positives in the dataset. In that case, it would also be possible that a perfect algorithm, that gets every prediction correct, receives the same score as an algorithm that misclassified a number of positive samples, as long as they aren't in the 5% threshold (To my understanding).
Is this behavior of the challenge metric desired? If yes, could it be possible, that the current validation dataset for the official phase has more positive chagas cases that the 5% threshold?
Thank you for your time and help!
Best regards,
Jad Haidamous