is this an overfitting problem

26 views
Skip to first unread message

himora...@gmail.com

unread,
May 10, 2016, 10:16:25 PM5/10/16
to Caffe Users
Hi everyone,

I have a situation, don't know whether it is the overfitting or not?

I got 3 sets of data (used data augmentation: rotation and flipping)
  + Training data: 2 millions images
  + Validation data: 52k images (take random from the training data)
  + Testing data
The procedure include 3 steps
  + Step1: train CNN on training data and observe the accuracy on the validation data
  + Step2: use the trained CNN as feature extractor, extract feature on validation data and run SVM on the extracted feature to obtain the final model
  + Step3: test the final model on the testing data
I have run 3 cases
  + Case1: NW1(conv1=48,conv2=128,conv3=128,conv4=128,conv5=128,fc6=4096,fc7=4096,fc8=9699) => accuracy of step 1 is 57%, accuracy of step 3 is x
  + Case2: NW1(conv1=48,conv2=128,conv3=256,conv4=256,conv5=256,fc6=2048,fc7=2048,fc8=9699) => accuracy of step 1 is 61%, accuracy of step 3 is x1 (x1<x)
  + Case3: NW1(conv1=48,conv2=128,conv3=384,conv4=384,conv5=384,fc6=2048,fc7=2048,fc8=9699) => accuracy of step 1 is 64%, accuracy of step 3 is x2 (x2<x)
I have calculate the number of parameter in the case1: 100mil, case2: 70mil, case3: 90mil
because the case2 and case3 have less paramter, higher accuracy in step1, i expect it to have higher accuracy in step 3 but it didn't. Is this the overfitting problem

Hope to hear your suggestion and opinion in my situation

Best regards

 





Reply all
Reply to author
Forward
0 new messages