Hi Jonathan,
Thank you for your reply. My case is a bit different. I also saw many people separate the training set and test set before they did multiple imputations. But in my cases it is different. I first use different variables and a bigger sample size to do multiple imputations. And take the variable and patient I need for this model. For example, I used a, b, c, d, e to do FCS multiple imputations with 500 patients to get 5 big imputed datasets. I did this because this dataset is to prepared for several projects in our department, and with many important variables (I have consulted with clinicians for variable selection), the result will be more reliable.
And take variable d, e to add in my prediction model. So my prediction model will have d, e, g, h,z with 250 patients and 5 imputed datasets (250 patients from the 500 patient above, because of specific patient criteria selection). So start from now on, I start to do the prediction model study.
Now I should separate them in the training set and test set. I am not sure shall I do random sampling in one dataset and keep the other 4 with the same patients. Or do 5 times random sampling.
Best,
Dandi