Thank you Przemek,
I've attached my solver and network prototexts.
The network prototext came from the
paper's author and the solver came from a caffe tutorial I found.
I'm using Caffe version 1.0.0 compiled from source. One thing I forgot to mention before is that the author provided modified accuracy and euclidean distance source files. They did not initially compile with this version of caffe but I updated the #includes to adjust to the layer location changes mentioned
here: and they compiled fine.
Can you think of any reason why training would work on the CPU but not the GPU?