Is this the correct way of providng validation set in caffe?

Hossein Hasanpour

unread,

Apr 19, 2016, 2:36:34 AM4/19/16

to Caffe Users

Hello everyone.
I wanted to provide a validation set for my dataset(CIFAR10) in caffe.
to my knowledge, validation set is a fraction of test set right? like 25% of the actual test set.
Now, Something crossed my mind today,
Is it right to split my solver files into two and configure them as follows?
suppose in CIFAR10, we have 10,000 test images. and the whole test set is converted into leveldb.
again suppose my batch size for testing is 100 like below(and my training is 50):

name: "CIFAR10_full"
layer {
  name: "cifar"
  type: "Data"
  top: "data"
  top: "label"
  include {
    phase: TRAIN
  }
  transform_param {
    mean_file: "examples/cifar10/mean.binaryproto"
    mirror: true
  }


  data_param {
    source: "examples/cifar10/cifar10_train_lmdb"
    batch_size: 50#100
    backend: LMDB
  }
}
layer {
  name: "cifar"
  type: "Data"
  top: "data"
  top: "label"
  include {
    phase: TEST
  }
  transform_param {
    mean_file: "examples/cifar10/mean.binaryproto"
  }
  data_param {
    source: "examples/cifar10/cifar10_test_lmdb"
    batch_size: 100
    backend: LMDB
  }
}

So far so good, now in one solver file which would act as my train_val solver, I would write :

# test_iter specifies how many forward passes the test should carry out.
# In the case of CIFAR10, we have test batch size 100 and 100 test iterations,
# covering the full 10,000 testing images.
test_iter: 25 # 25*100 = 2500 which is one quarter of 10,000 or 25% of 10,000

# Carry out testing every 1000 training iterations.
test_interval: 1000

# The base learning rate, momentum and the weight decay of the network.
base_lr: 0.1
momentum: 0.9
etc...

And in the second solver I would write :

# The train/test net protocol buffer definition
net: "examples/cifar10/cifar10_full_relu_train_test_bn.prototxt"
# test_iter specifies how many forward passes the test should carry out.
# In the case of CIFAR10, we have test batch size 100 and 100 test iterations,
# covering the full 10,000 testing images.
test_iter: 100# 100*100 =10,000 the whole test size
# Carry out testing every 1000 training iterations.
test_interval: 1000
# The base learning rate, momentum and the weight decay of the network.
base_lr: 0.1
momentum: 0.9

So basically what I'm doing here is that, we limit the number of tests in the validation solver file (like instead of 100, we are using 25, which when multiplied by the batch size would yield 25*100=2500 which is 25% of the actual test size)
and then in the test solver file, we would use the normal number of test iterations e.i 100 (100*100 = 10,000) meaning use the whole test dataset.

Is it right? or not I'd be grateful to know.
Thanks in advance

Jan

unread,

Apr 19, 2016, 5:58:06 AM4/19/16

to Caffe Users

A good way to solve this is using stages. You can view stages as another subdivison of the phases, in part. the TEST phase. All stages are run in sequence. You can specify/"create" stages by adding the directive to your solver:

test_state {
  stage: "name_for_the_stage"
}

That name is used to identify the stage. You need to give as many "test_iter" values as you have stages. In the net definition you can use the stage name in the "include" directive to restrict the layer to only a specific stage. So if you want separate validation and testing stages you could do the following:

net.prototxt:

name: "CIFAR10_full"
layer {
  name: "cifar"
  type: "Data"
  top: "data"
  top: "label"
  include {
    phase: TRAIN
  }
  transform_param {
    mean_file: "examples/cifar10/mean.binaryproto"
    mirror: true
  }


  data_param {
    source: "examples/cifar10/cifar10_train_lmdb"
    batch_size: 50#100
    backend: LMDB
  }
}
layer {
  name: "cifar"
  type: "Data"
  top: "data"
  top: "label"
  include {
    phase: TEST


    stage: "validate"
  }
  transform_param {
    mean_file: "examples/cifar10/mean.binaryproto"
  }
  data_param {
    source: "examples/cifar10/cifar10_val_lmdb"
    batch_size: 100
    backend: LMDB
  }
}

layer {
  name: "cifar"
  type: "Data"
  top: "data"
  top: "label"
  include {
    phase: TEST


    stage: "test"
  }


  transform_param {
    mean_file: "examples/cifar10/mean.binaryproto"
  }
  data_param {
    source: "examples/cifar10/cifar10_test_lmdb"
    batch_size: 100
    backend: LMDB
  }
}

And for the solver:

# The train/test net protocol buffer definition
net: "examples/cifar10/cifar10_full_relu_train_test_bn.prototxt"

# test_iter specifies how many forward passes the test should carry out.


test_iter: 25 # 25*100 =2,500 the validation set size
test_iter: 75 # 75*100 =7,500 the test set size

# of course you need to divide the original test set into separate sets of the above specified sizes.

# define the stages
test_state { stage: "validation" }
test_state { stage: "test" }

# Carry out testing every 1000 training iterations.
test_interval: 1000

# The base learning rate, momentum and the weight decay of the network.
base_lr: 0.1
momentum:

0.9

...

Jan

Message has been deleted

Erik Christiansen

unread,

Jul 27, 2016, 1:51:02 PM7/27/16

to Caffe Users

What is the difference between Validation and Test here? My understanding is that the 'Validation' set is used to tune your hyperparameters, while the 'Test' set does not get touched until after all training is done. So when would 'Test' (rather than 'Validation') get run and what would it output? I see the Accuracy output during training when it does a validation run, but does this also occur with a test run?

Hossein Hasanpour

unread,

Jul 27, 2016, 11:34:33 PM7/27/16

to Caffe Users

yes thats correct.

you can train on test set as well, but using validation set, you see how good your network will generalize on unseen data. and whether it over-fits or not.

this depends on how you created train/val, test sets. I myself create different datasets for each one of them, one for train-val and the other for test.

and simply change the dataset from train-val to test and then test the model on it. (using test switch in caffe).

Reply all

Reply to author

Forward