Chris,
Thanks for your explanation! If you don't mind, I'm going to walk through the structure of my model so that I can make sure I understand exactly what's happening here.
Assume we have a ResNet model, kernel size of 4x4, and 3 channel images. The model has three stages:
- Stage 1: Filter shape is (4,4,3,16) due to 4x4 kernel, 3 channels, 16 filters
- Stage 2: Filter shape is (4,4,3,32) due to number of filters doubling from stage 1 to 2
- Stage 3: Filter shape is (4,4,3,64) due to number of filters doubling from stage 2 to 3
When I have the code print the shape of the prior function at stage 1, I get (4,4,3,16,16). When it prints at stage 2, I get (4,4,3,16,32), which is where it errors out. Looking at the code you referenced before (
line 193), is that because the kernel prior adopts the same shape as the kernel from line 179, which would be (4,4,3,32), and can't reconcile 16 vs 32?
Is that what's going on here or am I missing something?
Thanks again!
Haley Jennings