I am writing to clarify a few details regarding the Multimodal Semantic Segmentation Challenge. I have successfully downloaded the dataset from the provided link (https://lmi.fe.uni-lj.si/en/MULTIAQUA/) and have begun my preliminary analysis.
Based on my understanding of the dataset structure, I have the following questions regarding the evaluation and submission process:
Label Mapping: Could you confirm if the model’s predicted masks should use the following integer values for each class for submission?
1: Static obstacle
2: Dynamic obstacle
3: Water
4: Sky
Submission Format: Is the correct submission procedure to generate inference masks with the same resolution as the provided annotation images, and then submit a compressed (.zip) folder containing these predicted mask images?
Thank you very much for your time and for organizing this interesting challenge. I look forward to your guidance.