Hello,
I am using the WSJ recipe, but with my own data and I cannot figure out what I do wrong. I do not see anything useful error messages in the log files, but I do not know exactly what to look after either. I suspect it may be ivector extraction, but don't know how to start.
Can you please give me a hint where I could start debugging this?
For offline decoding, I use this decode command and obtain a best_wer:
# nnet3-latgen-faster-parallel --num-threads=4 --online-ivectors=scp:exp/nnet3_online_cmn/ivectors_dev_hires/ivector_online.scp --online-ivector-period=10 --frame-subsampling-factor=3 --frames-per-chunk=140 --extra-left-context=35 --extra-right-context=35 --extra-left-context-initial=0 --extra-right-context-final=0 --minimize=false --max-active=7000 --min-active=200 --beam=15.0 --lattice-beam=8.0 --acoustic-scale=1.0 --allow-partial=true --word-symbol-table=exp/chain2_online_cmn/tree_a_sp/graph_tgpr/words.txt exp/chain2_online_cmn/tdnn1i_sp/final.mdl exp/chain2_online_cmn/tree_a_sp/graph_tgpr/HCLG.fst "ark,s,cs:apply-cmvn --norm-means=false --norm-vars=false --utt2spk=ark:data/dev_hires/split36/17/utt2spk scp:data/dev_hires/split36/17/cmvn.scp scp:data/dev_hires/split36/17/feats.scp ark:- |" "ark:|lattice-scale --acoustic-scale=10.0 ark:- ark:- | gzip -c >exp/chain2_online_cmn/tdnn1i_sp/decode_tgpr_dev/lat.17.gz"
%WER 1.56 [ 283 / 18199, 32 ins, 83 del, 168 sub ] exp/chain2_online_cmn/tdnn1i_sp/decode_tgpr_dev/wer_17_0.0
For online decoding:
# online2-wav-nnet3-latgen-faster --do-endpointing=false --frames-per-chunk=20 --extra-left-context-initial=0 --online=true --config=exp/chain2_online_cmn/tdnn1i_sp_online/conf/online.conf --min-active=200 --max-active=7000 --beam=15.0 --lattice-beam=6.0 --acoustic-scale=1.0 --word-symbol-table=exp/chain2_online_cmn/tree_a_sp/graph_tgpr/words.txt exp/chain2_online_cmn/tdnn1i_sp_online/final.mdl exp/chain2_online_cmn/tree_a_sp/graph_tgpr/HCLG.fst ark:data/dev_hires/split36/7/spk2utt "ark,s,cs:wav-copy scp,p:data/dev_hires/split36/7/wav.scp ark:- |" "ark:|lattice-scale --acoustic-scale=10.0 ark:- ark:- | gzip -c >exp/chain2_online_cmn/tdnn1i_sp_online/decode_tgpr_dev/lat.7.gz"
%WER 98.32 [ 17894 / 18199, 47 ins, 15126 del, 2721 sub ] exp/chain2_online_cmn/tdnn1i_sp_online/decode_tgpr_dev/wer_7_0.0