ADAM not working sometimes

168 views

Skip to first unread message

Shawn Liu

unread,

Jun 14, 2016, 8:31:53 AM6/14/16

to Caffe Users

Hi all,

I've been using Caffe to train some ConvNets recently. I always used ADAM as the default optimization strategy. However, I noticed that sometimes it just doesn't work. The test accuracy doesn't change much, so does the training loss. However, if I switch to ADAGRAD, I can see test accuracy increase.

Any ideas where might go wrong? I used the bvlc_alexnet train_val prototxt. My solvers are below:

test_iter: 377
test_interval: 879
base_lr: 0.005
display: 109
max_iter: 26370
lr_policy: "exp"
gamma: 0.999805505096
weight_decay: 0.0005
snapshot: 879
snapshot_prefix: "snapshot/snapshot"
solver_mode: GPU
net: "train_val_conv_5.prototxt"
solver_type: ADAGRAD

test_iter: 377
test_interval: 879
base_lr: 0.001
momentum: 0.9
momentum2: 0.999
display: 109
max_iter: 26370
lr_policy: "fixed"
snapshot: 879
snapshot_prefix: "snapshot/snapshot"
solver_mode: GPU
net: "train_val_conv_5.prototxt"
solver_type: ADAM

Thanks,

Siyuan

Shawn Liu

unread,

Jun 14, 2016, 12:46:22 PM6/14/16

to Caffe Users

Any helps?

Reply all

Reply to author

Forward

0 new messages