Hi everyone,
I'm new to the challenge and have some difficulty understanding how to use the D2 data for a simple submission. I want to make predictions for ADAS13.
The spreadsheet provided for the simple submission asks for several RIDs and specific forecasting dates (starting 2018-01, up to 2022-12). However, RID==2, for instance, only has data up to 2015-09 in the D1_D2 data csv. Does that mean I can utilize any data that is provided in the csv to forecast this specific time-range, regardless of whether the D2 in the csv column is labelled 1?
That would mean I can build a time-dependent model that takes in longitudinal data for each individual for making predictions for D2. Correct?
However, although not necessary for the simple submission, if I understand correctly, D3 is a purely cross-sectional data set, meaning that I could only inform my model using 'baseline' data (as if it's the baseline of a clinical trial). That would ask for a different kind of model that does not use longitudinal data but only cross-sectional data of an individual.
Are these two different kinds of challenges, or am I looking at it wrong?
Thanks!