No features for speed perturbed utterences

161 views
Skip to first unread message

Vishay Raina

unread,
Jul 13, 2021, 3:02:01 AM7/13/21
to kaldi-help
After doing speed perturbation in wsj end to end recipe, while computing mfcc (for the speed perturbed set) im getting the following logs:

compute-mfcc-feats --write-utt2dur=ark,t:data/train_si284_spe2e_hires/log/utt2dur.9 --verbose=2 --config=conf/mfcc_hires.conf scp,p:data/train_si284_spe2e_hires/log/wav_train_si284_spe2e_hires.9.scp ark:- 
copy-feats --write-num-frames=ark,t:data/train_si284_spe2e_hires/log/utt2num_frames.9 --compress=true ark:- ark,scp:/opt/kaldi/egs/wsj/s5/data/train_si284_spe2e_hires/data/raw_mfcc_train_si284_spe2e_hires.9.ark,/opt/kaldi/egs/wsj/s5/data/train_si284_spe2e_hires/data/raw_mfcc_train_si284_spe2e_hires.9.scp 
sox WARN wav: Length in output .wav header will be wrong since can't seek to fix it
VLOG[1] (compute-mfcc-feats[5.5]:Read():wave-reader.cc:249) Read in RIFF chunk size: 2147479588, data chunk size: 2147479552. Assume 'stream mode' (reading data to EOF).
sox WARN rate: rate clipped 1 samples; decrease volume?
sox WARN dither: dither clipped 1 samples; decrease volume?
VLOG[2] (compute-mfcc-feats[5.5]:main():compute-mfcc-feats.cc:182) Processed features for key pv1-7a6f79837880403db0be87ffba8b1809-2019-06-2712:44:59.785521
sox WARN wav: Length in output .wav header will be wrong since can't seek to fix it
VLOG[1] (compute-mfcc-feats[5.5]:Read():wave-reader.cc:249) Read in RIFF chunk size: 2147479588, data chunk size: 2147479552. Assume 'stream mode' (reading data to EOF).
VLOG[2] (compute-mfcc-feats[5.5]:main():compute-mfcc-feats.cc:182) Processed features for key pv1-7a6f79837880403db0be87ffba8b1809-2019-06-2712:45:31.651781
sox WARN wav: Length in output .wav header will be wrong since can't seek to fix it
VLOG[1] (compute-mfcc-feats[5.5]:Read():wave-reader.cc:249) Read in RIFF chunk size: 2147479588, data chunk size: 2147479552. Assume 'stream mode' (reading data to EOF).
VLOG[2] (compute-mfcc-feats[5.5]:main():compute-mfcc-feats.cc:182) Processed features for key pv1-7a6f79837880403db0be87ffba8b1809-2019-06-2805:42:41.257614
sox WARN wav: Length in output .wav header will be wrong since can't seek to fix it
VLOG[1] (compute-mfcc-feats[5.5]:Read():wave-reader.cc:249) Read in RIFF chunk size: 2147479588, data chunk size: 2147479552. Assume 'stream mode' (reading data to EOF).
VLOG[2] (compute-mfcc-feats[5.5]:main():compute-mfcc-feats.cc:182) Processed features for key pv1-7a6f79837880403db0be87ffba8b1809-2019-06-2906:23:00.231703
sox WARN wav: Length in output .wav header will be wrong since can't seek to fix it


and when I try to generate egs, Im getting errors "no features found for utterence".
This is only happening for the speed perturbed utterences. 

Please help

Reply all
Reply to author
Forward
0 new messages