As a reference I'm using Shelhamer Github
https://github.com/shelhamer/fcn.berkeleyvision.org1. Why at the first convolutional layer there is padding of 100? If needed we just could have upscaled input image to obtain better initialization.
2. Why there is no convolutional and deconvolutional layer weights initialization specified - all weights are zeros?
3. I'm trying to train fcn8 on my data - I've created two HDF5 files, one with image data and other with masks(just two classes, zero and one), and it is just only modification I've made. And net learns nothing, all weights stays zeros. Then I've added xavier weight initialization, and again, net learns nothing, though output not all zeros - just mess. I've set low learning rate, e-11, and datasets are correct. Any clues about what might be wrong?