Hi Rowan,
4 values for Q -> A task
and 16 values for QA -> R task (ex. rationale_conditioned_on_a0_0...)
per question.
But I guess only 4 out of 16 values, which correspond to predicted answers, are used when calculating QA->R accuracy. Can I just fill out the remaining 12 values as zeros?
Thanks,
Jaemin