Dear challenge organizers,
I am using multiprocess in the train_challenge_model function of team_code.py to parallelize each subect's feature extraction on one a separate CPU.
The feature extraction of our code is quite computationally heavy. In the cross-validation you calculate all features individually over 12, 24, 48 and 72 hours, which again mutliplies the needed time for feature extraction.
I was planning to parallelize the 'run_challenge_models' in the same way as the 'train_challenge_model' function. However, the run_model.py (which we are not supposed to edit) already contains the loop over subjects. run_challenge_models (the one which I can edit) has only one subject as an input. The parallelization would therefore need to happen in the run_model.py.
The only possibility to include parallelization over subjects in this step would be to change the run_model() function to assign one cpu to the feature calculation and prediction of one subject.
Is there a way to do this ? Could you maybe just add an n_jobs to this function?
Thank you for your answer.
Best,
Charlotte