In the original MT3 paper, you discuss leave-one-dataset-out / zero-shot experiments.
I'm a bit confused on the results, specifically the reported metric for GutiarSet.
Here is the LODO table (Table 5). Below it is the zero-shot table (Table 6). It looks like most of the entries from Table 6 come from Table 5 for the case when the left-out dataset is equal to the evaluation dataset. Is this correct?
If so, why is GuitarSet reported for the case where MAESTRO is left out? Shouldn't the result be 0.32, where GuitarSet is both the hold-out and the evaluation dataset?
Sorry if this has been pointed out already. If I am misunderstanding the reported results, can you please clarify?