equations for feature map size resulting from convolution and pooling layers

49 views

Skip to first unread message

Stuart Kerr

unread,

Oct 28, 2015, 9:47:23 PM10/28/15

to Caffe Users

Referencing both:
Stanford's cs231 class and Caffe's Layer's Docs
they note that the size of a feature map resulting from a convolution is:
(W - F +2P)/S + 1
where W = incoming feature map spatial dimension, F = convolution filter dimension, P = padding and S = stride. However, when training a standard AlexNet where F = 11 and S = 4 on 64x64 image patches, Caffe does not throw an error, and we train successfully. Similar calculations can be done for feature maps resulting from pooling layers.

Please explain this discrepancy...

Stuart Kerr

unread,

Oct 28, 2015, 10:19:44 PM10/28/15

to Caffe Users

Forgot to mention that caffe does not throw an error, despite the quotient is not an integer...

Reply all

Reply to author

Forward

0 new messages