Hello
While training on my own custom data set, this warning occurs every time during mono, tri and dnn model training.
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 1 with no stats; corresponding phone list: 6 7 8 9 10
the whole output is:
============================================================================
tri3 : LDA + MLLT + SAT Training
============================================================================
steps/align_si.sh --nj 15 --cmd
run.pl --use-graphs true data/train_100h data/lang exp_gv_100h/tri2_6000_120000 exp_gv_100h/tri2_ali
steps/align_si.sh: feature type is lda
steps/align_si.sh: aligning data in data/train_100h using model from exp_gv_100h/tri2_6000_120000, putting alignments in exp_gv_100h/tri2_ali
steps/diagnostic/analyze_alignments.sh --cmd
run.pl data/lang exp_gv_100h/tri2_ali
analyze_phone_length_stats.py: WARNING: optional-silence SIL is seen only 46.365802800119155% of the time at utterance begin. This may not be optimal.
analyze_phone_length_stats.py: WARNING: optional-silence SIL is seen only 48.91980360065467% of the time at utterance end. This may not be optimal.
steps/diagnostic/analyze_alignments.sh: see stats in exp_gv_100h/tri2_ali/log/analyze_alignments.log
steps/align_si.sh: done aligning data.
steps/train_sat.sh --cmd
run.pl 8000 160000 data/train_100h data/lang exp_gv_100h/tri2_ali exp_gv_100h/tri3_8000_160000
steps/train_sat.sh: feature type is lda
steps/train_sat.sh: obtaining initial fMLLR transforms since not present in exp_gv_100h/tri2_ali
steps/train_sat.sh: Accumulating tree stats
steps/train_sat.sh: Getting questions for tree clustering.
steps/train_sat.sh: Building the tree
steps/train_sat.sh: Initializing the model
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 1 with no stats; corresponding phone list: 6 7 8 9 10
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 4 with no stats; corresponding phone list: 19 20 21 22
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 8 with no stats; corresponding phone list: 35 36 37 38
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 9 with no stats; corresponding phone list: 39 40 41 42
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 10 with no stats; corresponding phone list: 43 44 45 46
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 11 with no stats; corresponding phone list: 47 48 49 50
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 13 with no stats; corresponding phone list: 55 56 57 58
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 17 with no stats; corresponding phone list: 71 72 73 74
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 18 with no stats; corresponding phone list: 75 76 77 78
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 19 with no stats; corresponding phone list: 79 80 81 82
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 24 with no stats; corresponding phone list: 99 100 101 102
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 25 with no stats; corresponding phone list: 103 104 105 106
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 26 with no stats; corresponding phone list: 107 108 109 110
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 27 with no stats; corresponding phone list: 111 112 113 114
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 29 with no stats; corresponding phone list: 119 120 121 122
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 35 with no stats; corresponding phone list: 143 144 145 146
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 36 with no stats; corresponding phone list: 147 148 149 150
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 38 with no stats; corresponding phone list: 155 156 157 158
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 39 with no stats; corresponding phone list: 159 160 161 162
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 40 with no stats; corresponding phone list: 163 164 165 166
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 45 with no stats; corresponding phone list: 183 184 185 186
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 48 with no stats; corresponding phone list: 195 196 197 198
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 49 with no stats; corresponding phone list: 199 200 201 202
WARNING (gmm-init-model[5.5]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 55 with no stats; corresponding phone list: 223 224 225 226
This is a bad warning.
steps/train_sat.sh: Converting alignments from exp_gv_100h/tri2_ali to use current tree
steps/train_sat.sh: Compiling graphs of transcripts
Pass 1
Pass 2
Estimating fMLLR transforms
Pass 3
Pass 4
Estimating fMLLR transforms
Pass 5
Pass 6
Estimating fMLLR transforms
Pass 7
Pass 8
Pass 9
Pass 10
Aligning data
Pass 11
How do we fix this?
Regards