1. total/average loss on one text-example (or over some range of examples) – as a sort of indicator of 'fit to model'
2. or running/cumulative loss over a range of examples, perhaps up to a full training epoch, as a sort of readout on how well training is progressing.