Clarification about the four referring texts per video sequence

22 views

Skip to first unread message

渐明

unread,

Jun 20, 2026, 2:01:05 PMJun 20

to VOT Challenge technical support

Dear VOTSr organizers,

Could you please clarify whether the four referring texts provided for each video sequence correspond to the same target as alternative descriptions, or whether they should be treated as independent queries that may refer to different targets/regions?

We noticed that some expressions can be interpreted in multiple ways, so this clarification would be very helpful for our inference design.