I am still newbie in caffe and python. But, I also interest in this after reading
deep compression paper. I want to try the pruning first. According to the paper,
Pruning is implemented by adding
a mask to the blobs to mask out the update of the pruned connections
But, I am not sure how to do this. Have you finally done it?
Is it using eltwise layer by product the pruned weight by 0? But I am still not sure how to do this. Any clue or reference will help.. i.e. how to product binary array {0,1,1,0,0} as mask to the weight.
Or modify the cpp inside caffe itself?
I also find about this
backward_gpu function in eltwise layer that has
const vector< bool > &propagate_down parameter. But I still don't know how to use it in python with net surgery method.