I wonder why my previous post and plot got deleted. Here is another latest snapshot zoomed at the last part of the iterations.
By running inference you mean computing the accuracy on validation set? Wouldn't so much high loss result in lot false positives?
There is no parameters to set in layers file however a comment reads like this:
def reshape(self, bottom, top):
# load data for tops and reshape tops to fit (1 is the batch dim)
...