@dan, @david, thanks.
I looked at the log: exp/train_combined_no_sil/log/create_xvector_feats_train_combined.xx.log
WARNING (select-voiced-frames[5.5.506~1-7b4c5]:main():select-voiced-frames.cc:76) Mismatch in number for frames 563 for features and VAD 557, for utterance id00820-zhX-Mjuc_dc-00083-babble
WARNING (select-voiced-frames[5.5.506~1-7b4c5]:main():select-voiced-frames.cc:76) Mismatch in number for frames 563 for features and VAD 557, for utterance id00820-zhX-Mjuc_dc-00083-music
WARNING (select-voiced-frames[5.5.506~1-7b4c5]:main():select-voiced-frames.cc:76) Mismatch in number for frames 563 for features and VAD 557, for utterance id00820-zhX-Mjuc_dc-00083-noise
LOG (apply-cmvn-sliding[5.5.506~1-7b4c5]:main():apply-cmvn-sliding.cc:75) Applied sliding-window cepstral mean normalization to 162845 utterances, 0 had errors.
WARNING (select-voiced-frames[5.5.506~1-7b4c5]:main():select-voiced-frames.cc:76) Mismatch in number for frames 563 for features and VAD 557, for utterance id00820-zhX-Mjuc_dc-00083-reverb
LOG (select-voiced-frames[5.5.506~1-7b4c5]:main():select-voiced-frames.cc:106) Done selecting voiced frames; processed 32569 utterances, 130276 had errors.
LOG (copy-feats[5.5.506~1-7b4c5]:main():copy-feats.cc:143) Copied 32569 feature matrices.
I am thinking those VAD is from stage 3, here is my stage 3:
if [ $stage -le 3 ]; then
# Take a random subset of the augmentations 5m
#utils/subset_data_dir.sh data/train_aug 5000000 data/train_aug_1m
# use all of them
utils/subset_data_dir.sh data/train_aug 5107552 data/train_aug_1m
utils/fix_data_dir.sh data/train_aug_1m
# Make MFCCs for the augmented data. Note that we do not compute a new
# vad.scp file here. Instead, we use the vad.scp from the clean version of
# the list.
steps/make_mfcc.sh --mfcc-config conf/mfcc.conf --nj 40 --cmd "$train_cmd" \
data/train_aug_1m exp/make_mfcc $mfccdir
# Combine the clean and augmented VoxCeleb2 list. This is now roughly
# double the size of the original clean list.
utils/combine_data.sh data/train_combined data/train_aug_1m data/train
fi
min