While working `Doc2Vec` loss-reporting could be a useful addition, & would be a welcome contribution from a capable programmer, most applications of `Doc2Vec` have no need for a loss-readout.
It's not necessary to interpret the results, or evaluate the performance of the model's outputs for specific purposes. None of the original work introducing the algorithm reports the loss, or any uses of the value for evaluation.
Some have the mistaken belief that training loss is some indicator of the general quality of a model – perhaps that's the case with your reviewer?
But training-loss is only a narrow measure of how well the model has adapted to the training data. A model with lower loss on its training corpus might actually be worse at intended downstream tasks than one with a higher loss – notably in cases of overfitting. So for data science purposes, other fuller measures of success at the ultimate tasks are usually more informative than the internal loss of an unsupervised modeling step.
Running-loss during training could plausibly be helpful in determining dynamically, rather than via a prechosen number of training-epochs, when further SGD-based optimization can't make a model any better at its internal prediction target. When running loss stops improving you might as well end training – from that point on, each epoch is only slightly jittering the model to be better on some cases but equally worse on others. Still, sometimes ML models don't even want to train to that loss-minimization point – instead using "early-stopping" – because of the risk that focusing on loss alone leads to overfitting/less-generalization.
However, it doesn't seem like dynamic stopping is what you're seeking. If your paper is about some sort of predictor, maybe what your reviewer really wants is the loss of your overall technique, *not* the internal loss of one contributing unsupervised Doc2Vec step?
If I'm overlooking some other tangible benefits of a loss readout, please let me know.
And if your need is acute, adding the kind of loss-reporting you need is likely a small project, for any Python data science coding talent you have on your project or could contract.
- Gordon