Hi,
I guess most of people today is struggling to find out the best parameter combination to train or most probably finetune a given network on a given task in Caffe.
Parameters which are involved indeed can be, for instance:
- which layer to learn from scratch
- start learning rate per layer
- learning rate decay policy
- dropout percentage
...
And the best combination is usually searched by grid/random/bayesian search.
I would like to have suggestions by the community about frameworks to handle this search, that is:
- set up multiple training trials (by defining multiple solver.prototxt and train_val.prototxt)
- train all of them for a certain amount of epochs and store the best validation accuracy/loss for each trial
- retrieve the model of the best trial at the best epoch
What do you think of this code:
https://github.com/kuz/caffe-with-spearmint?
Are there any other simple solutions (I would like to use Caffe's Matlab interface......)?
Since I have a single Tesla K40 and usually I can train only one model at a time, a simple Matlab code to run many training trials in series would be fine.
Thanks
Giulia