Right now this is the output:
I0211 14:03:39.892946 18839 solver.cpp:340] Iteration 0, Testing net (#0)
10:06:04.719017 1395 solver.cpp:340] Iteration 0, Testing net (#0)
I0211 12:42:16.511376 1395 solver.cpp:408] Test net output #0: accuracy = 0
I0211 12:42:16.522457 1395 solver.cpp:408] Test net output #1: loss = 3.23429 (* 1 = 3.23429 loss)
I0211 12:44:29.792454 1395 solver.cpp:236] Iteration 0, loss = 7.64491
I0211 12:44:29.792937 1395 solver.cpp:252] Train net output #0: loss = 7.64491 (* 1 = 7.64491 loss)
I0211 12:44:29.793012 1395 sgd_solver.cpp:106] Iteration 0, lr = 0.01
caffe: malloc.c:3700: _int_malloc: Assertion `victim->fd_nextsize->bk_nextsize == victim' failed.
*** Aborted at 1455212803 (unix time) try "date -d @1455212803" if you are using GNU date ***
PC: @ 0x7f51a1ccfcc9 (unknown)
The strange thing is that the network hasn't crashed (i.e. the train command is still being executed, but the top command doesn't show Caffe as running). I haven't seen this before. Also, I understand the dataset size isn't directly proportional to the training speed, but are they completely unrelated?
Thanks for your help.