is this an overfitting problem

26 views

CNNOverfitting

Skip to first unread message

himora...@gmail.com

unread,

May 10, 2016, 10:16:25 PM5/10/16

to Caffe Users

Hi everyone,

I have a situation, don't know whether it is the overfitting or not?

I got 3 sets of data (used data augmentation: rotation and flipping)

+ Training data: 2 millions images

+ Validation data: 52k images (take random from the training data)

+ Testing data

The procedure include 3 steps

+ Step1: train CNN on training data and observe the accuracy on the validation data

+ Step2: use the trained CNN as feature extractor, extract feature on validation data and run SVM on the extracted feature to obtain the final model

+ Step3: test the final model on the testing data

I have run 3 cases

+ Case1: NW1(conv1=48,conv2=128,conv3=128,conv4=128,conv5=128,fc6=4096,fc7=4096,fc8=9699) => accuracy of step 1 is 57%, accuracy of step 3 is x

+ Case2: NW1(conv1=48,conv2=128,conv3=256,conv4=256,conv5=256,fc6=2048,fc7=2048,fc8=9699) => accuracy of step 1 is 61%, accuracy of step 3 is x1 (x1<x)

+ Case3: NW1(conv1=48,conv2=128,conv3=384,conv4=384,conv5=384,fc6=2048,fc7=2048,fc8=9699) => accuracy of step 1 is 64%, accuracy of step 3 is x2 (x2<x)

I have calculate the number of parameter in the case1: 100mil, case2: 70mil, case3: 90mil

because the case2 and case3 have less paramter, higher accuracy in step1, i expect it to have higher accuracy in step 3 but it didn't. Is this the overfitting problem

Hope to hear your suggestion and opinion in my situation

Best regards

Reply all

Reply to author

Forward

0 new messages