You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to caffe...@googlegroups.com
Hi guys,
I'm trying to train a standard LeNet on MNIST with images loaded from IMAGE_DATA, not DATA (that's because I actually need to train it on smaller digits, about 15x15px, so I want to make sure that it works correctly before plugging in new_width and new_height). Unfortunately, it doesn't work as expected. Training loss rapidly becomes about 10^-9 while testing loss is in the range of 50-60. These would be the perfect signs of overfitting if it wasn't for the fact that this is the exact same LeNet as in the tutorials. Here's the code:
I would *guess* that caffe trains the network on only one batch, which is 100 images but I have no idea how to check that. Could somebody help?
Pastafarianist
unread,
Apr 13, 2015, 5:34:34 PM4/13/15
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to caffe...@googlegroups.com
I confirm that if I replace IMAGE_DATA layers with DATA and an lmdb database generated from the same directory with images, everything works perfectly. The problem is definitely somewhere within IMAGE_DATA.