First, if by ImageNet network you refer to Alxnet, note that it is a big network and it requires many images to be trained.
Regarding your question, perhaps the base_lr in solver is large. Try to reduce it. For example if it is currently set to 0.001, try to decrease it to 0.0001. If you get nan again, try to reduce more.
you may also consider adding scale:0.00039215 to your imagedata layer. But try decreasing the base_lr first.
I hope that helps.