Hi,
Just to clarify the terminology about the robustness of a tracker: in VOT there are these 3 terms, which have been used in the paper through the years:
- failures: total number of failures on the whole dataset (it can be found in the robustness table generated by the vot toolkit under the column: pooled) [lower is better]
- failure rate: total number of failures normalized with the number of sequences or frames. In the last years the VOT papers use this metric in the final results table to show how often a tracker fails. It is calculated as: 100 * (total number of failures) / (total number of frames in the dataset) and it can be interpreted as average number of failures on 100 frames. [lower is better]
- robustness: probability of a tracker that it will successfully track a video segment, which is S frames long, without a failure. This measure is normalized to an interval [0, 1]. Robustness is calculated using the following equation: exp(-failure rate) = exp(-(S * (total number of failures) / (total number of frames in dataset))). This measure is used in the A-R plots. [higher is better]
Best,
Alan