Dear Mohammed,
The Challenge data come from different sources. These data sources have different numbers of records, different sampling frequencies, different numbers of samples, and, importantly, different file formats, among other differences. To make it easier for your code to run on data from different sources, including new data sources, we convert the data from each source to WFDB format, and we run your code on the WFDB versions of the data.
In particular, you should not expect the file format from the SaMiTrop data, and your code will not have access to it. You should not expect that each record will have 4096 samples. You can decide whether to use each lead separately or all of the leads together.
Please see these instructions and commands for how to prepare the public training data for the Challenge; we will prepare the hidden validation and test data, which are from different sources, very similarly:
https://github.com/physionetchallenges/python-example-2025?tab=readme-ov-file#how-do-i-create-data-for-these-scriptsPlease also run the example and scoring code to understand how the data are formatted and will be used by your code.
https://github.com/physionetchallenges/matlab-example-2025https://github.com/physionetchallenges/python-example-2025https://github.com/physionetchallenges/evaluation-2025Best,
Matt
(On behalf of the Challenge team.)
Please post questions and comments in the forum. However, if your question reveals information about your entry, then please email info at
physionetchallenge.org. We may post parts of our reply publicly if we feel that all Challengers should benefit from it. We will not answer emails about the Challenge to any other address. This email is maintained by a group. Please do not email us individually.