Hi Dan,
I notice that in the smbr, the acoustic_scale's default setting is 0.1, which means compared to language model, it has much less impact on the smbr training.
The intuition of this is that smbr should be trained after the normal way , i.e. CE, CTC can not learn more information out of the acoustic signal anymore. right?
Another question is, if the training domain and the test domain are very different, while doing smbr training,
should we include test data's text in building language model for generating den_lattice?
I can't think clearly of what impact would this cause: say, with test text included, in the den_lattice there might be some new paths appeared,
but these path would have low probability in the training data, therefore smbr will very likely to punish these paths.
Furthermore, is it a good idea to first do some data selection based on the test set's domain, than train acoustic model, or, with this selected sub-data-set only for the follow on smbr training?
Thanks in advance.
Best