I am trying to run caffe on a cluster with 4 GPUs. If I run it I consistently see caffe hanging with this
error :
I1221 05:50:07.595844 17763 blocking_queue.cpp:50] Data layer prefetch queue empty
It runs fine when I run using upto 3 GPUs. On using 4 or more, I see this error randomly occuring
sometimes after 300 iterations with CIFAR10 data or sometimes after 9800 iterations but it
hangs in about 9/10 cases. In some rare case, it passes...
Happens the same for mnist dataset too. I am trying to run on a system with upto 8 GPUs.
Any particular fix for this issue ?
Thanks