Hi, all,
I tried LSTM discriminative training recently.
The configuration for LSTM
splice_indexes="-2,-1,0,1,2 0 0"
label_delay=5
num_lstm_layers=3
cell_dim=1024
hidden_dim=1024
recurrent_projection_dim=256
non_recurrent_projection_dim=256
chunk_width=20
chunk_left_context=40
Here are the results of LSTM and my best nnet2 TDNN (for comparison):
MODEL WER
TDNN-Xent 11.45
TDNN-SMBR 10.00
LSTM-Xent 10.89
And below is the WER of LSTM SMBR:
LearningRate Epoch1 Epoch2 Epoch3 Epoch4
0.0000125 11.40 11.73 11.93 11.95
0.00000125 10.53 10.52 10.59 11.71
0.000000125 10.60 10.60 10.59 10.62
So, from results above, LSTM-Xent is better than TDNN-Xent,
but SMBR does not help LSTM as much as TDNN, which is unnormal and I can't find why.
BTW, the LR used in wsj recipre is 0.0000125, I suspect it is too large.
So, is there a LSTM-SMBR result that I can refer to? I can't find one in RESULT file.
And any suggestion to improve my LSTM-SMBR WER?
Thanks.
Xiang