Hey everyone, thanks for taking a glance at my issue!
I'm currently trying to replicate a CNN model I had built with Keras/Tensorflow with Caffe so that I could better visualize the layers using the
Deep Visualization Toolbox. Using a particular set of parameters, I've been able to get the Tensorflow CNN to reach 90-95% validation accuracy, and in general 85-95% validation accuracy with other settings.
Regarding Caffe, I currently have the toolbox working and have run the Caffe MNIST example both terminal-wise and with Pycaffe, all of which ran flawlessly with good results, so it seems everything is working properly.
I am using the
CPU version.
Two issues:
My first problem is that when using the exact same CNN configuration as my Tensorflow model, I can't get the Caffe model to break 30%; in fact, it bottoms out to 0%, and the loss keeps diverging which to me indicated a poor learning rate, but it happens no matter what rate I use. I have a feeling that it's certainly user error somewhere, but I'm not sure where I'm going wrong.
Second, and probably related, the Caffe kernel (and terminal-wise) constantly crashes when running my model, especially if I do step-by-step iterations. Even if it doesn't crash, exploring any variable it creates crashes the kernel as well, even when I can print it with no issue. Did the same thing with the MNIST example and no issue. The error message is : *** Error in `/usr/bin/python': double free or corruption (out): 0x0000562ae845a4e0 ***
My model is structured like so:
3 classes
train set: 885 images
test set: 349 images
input size: 1 x 216 x 216 (grayscaled images)
4x:
Conv (5x5, pad 2, stride 1)
Relu
Max Pool (3x3, stride 3)
2x:
Inner Product (output 7)
Relu
Inner Product (output 3)
Softmax
I've attached a few files:
The python script that makes my prototxts.
error log
Any insight would be EXTREMELY helpful!! Thank you! =)