Dear Caffe experts,
I am trying to fine tune the MNIST model by just training new 500 images (the same number of class = 10) and testing by 200 images.
I am getting an error saying "Cannot copy param 0 weights from layer 'conv1'; shape mismatch. Source param shape is 20 1 5 5 (500); target param shape is 20 3 5 5 (1500). To learn this layer's parameters from scratch rather than copying from a saved net, rename the layer."
The only thing that I changed in the model file was the path of data and in Solver I just changed the name of model and LMDB. Except those changes, everything is the same.
and one more question, assuming that I know what overftting means, how can I know if my model is overfitted or not? (I took MNIST model and trained a piece of data , the output is Test net output #0: accuracy = 0.695 Test net output #1: loss = 1.41684 (* 1 = 1.41684 loss))
Is that overfitted?
You prompt help is much appreciated,
Saman