Failed to open script file data/train/split1/1/feats.scp

1,668 views
Skip to first unread message

Santiago Pineda Montoya

unread,
Oct 25, 2019, 11:49:43 AM10/25/19
to kaldi-help
Hi!

I'm following the Kaldi for Dummies tutorial and I get the error:

===== PREPARING ACOUSTIC DATA =====


===== FEATURES EXTRACTION =====

steps/make_mfcc.sh --nj 1 --cmd run.pl --max-jobs-run 1 data/train exp/make_mfcc/train mfcc
utils/validate_data_dir.sh: Successfully validated data-directory data/train
steps/make_mfcc.sh: [info]: no segments file exists: assuming wav.scp indexed by utterance.
run.pl: job failed, log is in exp/make_mfcc/train/make_mfcc_train.1.log
steps/make_mfcc.sh --nj 1 --cmd run.pl --max-jobs-run 1 data/test exp/make_mfcc/test mfcc
utils/validate_data_dir.sh: Successfully validated data-directory data/test
steps/make_mfcc.sh: [info]: no segments file exists: assuming wav.scp indexed by utterance.
run.pl: job failed, log is in exp/make_mfcc/test/make_mfcc_test.1.log
steps/compute_cmvn_stats.sh data/train exp/make_mfcc/train mfcc
steps/compute_cmvn_stats.sh: no such file data/train/feats.scp
steps/compute_cmvn_stats.sh data/test exp/make_mfcc/test mfcc
steps/compute_cmvn_stats.sh: no such file data/test/feats.scp

===== PREPARING LANGUAGE DATA =====

utils/prepare_lang.sh data/local/dict <UNK> data/local/lang data/lang
Checking data/local/dict/silence_phones.txt ...
--> reading data/local/dict/silence_phones.txt
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> data/local/dict/silence_phones.txt is OK

Checking data/local/dict/optional_silence.txt ...
--> reading data/local/dict/optional_silence.txt
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> data/local/dict/optional_silence.txt is OK

Checking data/local/dict/nonsilence_phones.txt ...
--> reading data/local/dict/nonsilence_phones.txt
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> data/local/dict/nonsilence_phones.txt is OK

Checking disjoint: silence_phones.txt, nonsilence_phones.txt
--> disjoint property is OK.

Checking data/local/dict/lexicon.txt
--> reading data/local/dict/lexicon.txt
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> data/local/dict/lexicon.txt is OK

Checking data/local/dict/extra_questions.txt ...
--> data/local/dict/extra_questions.txt is empty (this is OK)
--> SUCCESS [validating dictionary directory data/local/dict]

**Creating data/local/dict/lexiconp.txt from data/local/dict/lexicon.txt
fstaddselfloops data/lang/phones/wdisambig_phones.int data/lang/phones/wdisambig_words.int
prepare_lang.sh: validating output directory
utils/validate_lang.pl data/lang
Checking existence of separator file
separator file data/lang/subword_separator.txt is empty or does not exist, deal in word case.
Checking data/lang/phones.txt ...
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> data/lang/phones.txt is OK

Checking words.txt: #0 ...
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> data/lang/words.txt is OK

Checking disjoint: silence.txt, nonsilence.txt, disambig.txt ...
--> silence.txt and nonsilence.txt are disjoint
--> silence.txt and disambig.txt are disjoint
--> disambig.txt and nonsilence.txt are disjoint
--> disjoint property is OK

Checking sumation: silence.txt, nonsilence.txt, disambig.txt ...
--> found no unexplainable phones in phones.txt

Checking data/lang/phones/context_indep.{txt, int, csl} ...
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> 10 entry/entries in data/lang/phones/context_indep.txt
--> data/lang/phones/context_indep.int corresponds to data/lang/phones/context_indep.txt
--> data/lang/phones/context_indep.csl corresponds to data/lang/phones/context_indep.txt
--> data/lang/phones/context_indep.{txt, int, csl} are OK

Checking data/lang/phones/nonsilence.{txt, int, csl} ...
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> 56 entry/entries in data/lang/phones/nonsilence.txt
--> data/lang/phones/nonsilence.int corresponds to data/lang/phones/nonsilence.txt
--> data/lang/phones/nonsilence.csl corresponds to data/lang/phones/nonsilence.txt
--> data/lang/phones/nonsilence.{txt, int, csl} are OK

Checking data/lang/phones/silence.{txt, int, csl} ...
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> 10 entry/entries in data/lang/phones/silence.txt
--> data/lang/phones/silence.int corresponds to data/lang/phones/silence.txt
--> data/lang/phones/silence.csl corresponds to data/lang/phones/silence.txt
--> data/lang/phones/silence.{txt, int, csl} are OK

Checking data/lang/phones/optional_silence.{txt, int, csl} ...
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> 1 entry/entries in data/lang/phones/optional_silence.txt
--> data/lang/phones/optional_silence.int corresponds to data/lang/phones/optional_silence.txt
--> data/lang/phones/optional_silence.csl corresponds to data/lang/phones/optional_silence.txt
--> data/lang/phones/optional_silence.{txt, int, csl} are OK

Checking data/lang/phones/disambig.{txt, int, csl} ...
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> 2 entry/entries in data/lang/phones/disambig.txt
--> data/lang/phones/disambig.int corresponds to data/lang/phones/disambig.txt
--> data/lang/phones/disambig.csl corresponds to data/lang/phones/disambig.txt
--> data/lang/phones/disambig.{txt, int, csl} are OK

Checking data/lang/phones/roots.{txt, int} ...
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> 16 entry/entries in data/lang/phones/roots.txt
--> data/lang/phones/roots.int corresponds to data/lang/phones/roots.txt
--> data/lang/phones/roots.{txt, int} are OK

Checking data/lang/phones/sets.{txt, int} ...
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> 16 entry/entries in data/lang/phones/sets.txt
--> data/lang/phones/sets.int corresponds to data/lang/phones/sets.txt
--> data/lang/phones/sets.{txt, int} are OK

Checking data/lang/phones/extra_questions.{txt, int} ...
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> 9 entry/entries in data/lang/phones/extra_questions.txt
--> data/lang/phones/extra_questions.int corresponds to data/lang/phones/extra_questions.txt
--> data/lang/phones/extra_questions.{txt, int} are OK

Checking data/lang/phones/word_boundary.{txt, int} ...
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> 66 entry/entries in data/lang/phones/word_boundary.txt
--> data/lang/phones/word_boundary.int corresponds to data/lang/phones/word_boundary.txt
--> data/lang/phones/word_boundary.{txt, int} are OK

Checking optional_silence.txt ...
--> reading data/lang/phones/optional_silence.txt
--> data/lang/phones/optional_silence.txt is OK

Checking disambiguation symbols: #0 and #1
--> data/lang/phones/disambig.txt has "#0" and "#1"
--> data/lang/phones/disambig.txt is OK

Checking topo ...

Checking word_boundary.txt: silence.txt, nonsilence.txt, disambig.txt ...
--> data/lang/phones/word_boundary.txt doesn't include disambiguation symbols
--> data/lang/phones/word_boundary.txt is the union of nonsilence.txt and silence.txt
--> data/lang/phones/word_boundary.txt is OK

Checking word-level disambiguation symbols...
--> data/lang/phones/wdisambig.txt exists (newer prepare_lang.sh)
--> generating a 98 word/subword sequence
--> resulting phone sequence from L.fst corresponds to the word sequence
--> L.fst is OK
--> generating a 32 word/subword sequence
--> resulting phone sequence from L_disambig.fst corresponds to the word sequence
--> L_disambig.fst is OK

Checking data/lang/oov.{txt, int} ...
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> 1 entry/entries in data/lang/oov.txt
--> data/lang/oov.int corresponds to data/lang/oov.txt
--> data/lang/oov.{txt, int} are OK

--> data/lang/L.fst is olabel sorted
--> data/lang/L_disambig.fst is olabel sorted
--> SUCCESS [validating lang directory data/lang]

===== LANGUAGE MODEL CREATION =====
===== MAKING lm.arpa =====


===== MAKING G.fst =====

arpa2fst --disambig-symbol=#0 --read-symbol-table=data/lang/words.txt data/local/tmp/lm.arpa data/lang/G.fst
LOG (arpa2fst[5.5.490~1-b2f64]:Read():arpa-file-parser.cc:94) Reading \data\ section.
LOG (arpa2fst[5.5.490~1-b2f64]:Read():arpa-file-parser.cc:149) Reading \1-grams: section.
LOG (arpa2fst[5.5.490~1-b2f64]:RemoveRedundantStates():arpa-lm-compiler.cc:359) Reduced num-states from 1 to 1

===== MONO TRAINING =====

steps/train_mono.sh --nj 1 --cmd run.pl --max-jobs-run 1 data/train data/lang exp/mono
steps/train_mono.sh: Initializing monophone system.
feat-to-dim 'ark,s,cs:apply-cmvn --utt2spk=ark:data/train/split1/1/utt2spk scp:data/train/split1/1/cmvn.scp scp:data/train/split1/1/feats.scp ark:- | add-deltas ark:- ark:- |' -
add-deltas ark:- ark:-
apply-cmvn --utt2spk=ark:data/train/split1/1/utt2spk scp:data/train/split1/1/cmvn.scp scp:data/train/split1/1/feats.scp ark:-
WARNING (apply-cmvn[5.5.490~1-b2f64]:Open():util/kaldi-table-inl.h:106) Failed to open script file data/train/split1/1/feats.scp
ERROR (apply-cmvn[5.5.490~1-b2f64]:SequentialTableReader():util/kaldi-table-inl.h:860) Error constructing TableReader: rspecifier is scp:data/train/split1/1/feats.scp

[ Stack-Trace: ]
/home/csanta/pytorch-kaldi/kaldi/src/lib/libkaldi-base.so(kaldi::MessageLogger::LogMessage() const+0xa71) [0x7f16eb41c53f]
apply-cmvn(kaldi::MessageLogger::LogAndThrow::operator=(kaldi::MessageLogger const&)+0x11) [0x5620227f2d03]
apply-cmvn(kaldi::SequentialTableReader<kaldi::KaldiObjectHolder<kaldi::Matrix<float> > >::SequentialTableReader(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)+0x95) [0x5620227f9659]
apply-cmvn(main+0x755) [0x5620227f0885]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf1) [0x7f16e3a082e1]
apply-cmvn(_start+0x2a) [0x5620227f002a]

kaldi::KaldiFatalErrorERROR (feat-to-dim[5.5.490~1-b2f64]:main():feat-to-dim.cc:58) Could not read any features (empty archive?)

[ Stack-Trace: ]
/home/csanta/pytorch-kaldi/kaldi/src/lib/libkaldi-base.so(kaldi::MessageLogger::LogMessage() const+0xa71) [0x7fed3d11a53f]
feat-to-dim(kaldi::MessageLogger::LogAndThrow::operator=(kaldi::MessageLogger const&)+0x11) [0x5645a59db357]
feat-to-dim(main+0x2dc) [0x5645a59da87c]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf1) [0x7fed357062e1]
feat-to-dim(_start+0x2a) [0x5645a59da49a]

kaldi::KaldiFatalErrorerror getting feature dimension


I want to thank you all for your help and support

Armando

unread,
Oct 25, 2019, 11:55:53 AM10/25/19
to kaldi-help
feature extraction failed

see the message
run.pl: job failed, log is in exp/make_mfcc/test/make_mfcc_test.1.log

open that log, see what it says

Santiago Pineda Montoya

unread,
Oct 25, 2019, 12:02:25 PM10/25/19
to kaldi-help
any suggestion to solve the problem?

Santiago Pineda Montoya

unread,
Oct 25, 2019, 12:04:09 PM10/25/19
to kaldi-help
The log is:

# compute-mfcc-feats --write-utt2dur=ark,t:exp/make_mfcc/test/utt2dur.1 --verbose=2 --config=conf/mfcc.conf scp,p:exp/make_mfcc/test/wav_test.1.scp ark:- | copy-feats --write-num-frames=ark,t:exp/make_mfcc/test/utt2num_frames.1 --compre$
# Started at Fri Oct 25 15:47:27 UTC 2019
#
copy-feats --write-num-frames=ark,t:exp/make_mfcc/test/utt2num_frames.1 --compress=true ark:- ark,scp:/home/csanta/pytorch-kaldi/kaldi/egs/digits/mfcc/raw_mfcc_test.1.ark,/home/csanta/pytorch-kaldi/kaldi/egs/digits/mfcc/raw_mfcc_test.1.$
compute-mfcc-feats --write-utt2dur=ark,t:exp/make_mfcc/test/utt2dur.1 --verbose=2 --config=conf/mfcc.conf scp,p:exp/make_mfcc/test/wav_test.1.scp ark:-
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\0_6_4.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\1_3_5.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\2_4_7.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\3_1_9.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\4_9_2.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\5_0_8.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\6_5_0.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\7_8_6.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\8_2_3.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\9_7_1.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\0_4_3.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\1_7_5.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\2_8_9.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\3_5_8.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\4_3_0.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\5_9_7.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\6_0_1.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\7_2_6.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\8_6_2.wav'
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\9_1_4.wav'
LOG (compute-mfcc-feats[5.5.490~1-b2f64]:main():compute-mfcc-feats.cc:185)  Done 0 out of 0 utterances.
LOG (copy-feats[5.5.490~1-b2f64]:main():copy-feats.cc:143) Copied 0 feature matrices.
# Accounting: time=0 threads=1
# Ended (code 1) at Fri Oct 25 15:47:27 UTC 2019, elapsed time 0 seconds


El viernes, 25 de octubre de 2019, 10:55:53 (UTC-5), Armando escribió:

Armando

unread,
Oct 25, 2019, 12:08:48 PM10/25/19
to kaldi-help
so? do those file wav even exist?
it can not open them

Santiago Pineda Montoya

unread,
Oct 25, 2019, 12:41:39 PM10/25/19
to kaldi-help
The problem is the path: "'/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\8_2_3.wav". I have \ instead of /. But I get the same error with the following log:

compute-mfcc-feats(main+0x824) [0x564a283cc604]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf1) [0x7f1fd4a272e1]
compute-mfcc-feats(_start+0x2a) [0x564a283cbcda]

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:Read():feat/wave-reader.h:197) Exception caught in WaveHolder::Read(). kaldi::KaldiFatalError
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:317) Failed to load object from /home/csanta/pytorch-kaldi/kaldi/egs/digits/di$
ERROR (compute-mfcc-feats[5.5.490~1-b2f64]:Read():wave-reader.cc:190) WaveData: can read only PCM data, format id in file is: 3

[ Stack-Trace: ]
/home/csanta/pytorch-kaldi/kaldi/src/lib/libkaldi-base.so(kaldi::MessageLogger::LogMessage() const+0xa71) [0x7f1fdc43b53f]
compute-mfcc-feats(kaldi::MessageLogger::LogAndThrow::operator=(kaldi::MessageLogger const&)+0x11) [0x564a283cf2a3]
/home/csanta/pytorch-kaldi/kaldi/src/lib/libkaldi-feat.so(kaldi::WaveInfo::Read(std::istream&)+0x12d3) [0x7f1fdd1fa403]
/home/csanta/pytorch-kaldi/kaldi/src/lib/libkaldi-feat.so(kaldi::WaveData::Read(std::istream&)+0x4a) [0x7f1fdd1fb21c]
compute-mfcc-feats(kaldi::WaveHolder::Read(std::istream&)+0xd) [0x564a283d5049]
compute-mfcc-feats(kaldi::SequentialTableReaderScriptImpl<kaldi::WaveHolder>::EnsureObjectLoaded()+0x161) [0x564a283dc1df]
compute-mfcc-feats(kaldi::SequentialTableReaderScriptImpl<kaldi::WaveHolder>::Next()+0x27) [0x564a283dcac7]
compute-mfcc-feats(kaldi::SequentialTableReaderScriptImpl<kaldi::WaveHolder>::Open(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > cons$
compute-mfcc-feats(kaldi::SequentialTableReader<kaldi::WaveHolder>::Open(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)+0x1ed)$
compute-mfcc-feats(kaldi::SequentialTableReader<kaldi::WaveHolder>::SequentialTableReader(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char>$
compute-mfcc-feats(main+0x824) [0x564a283cc604]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf1) [0x7f1fd4a272e1]
compute-mfcc-feats(_start+0x2a) [0x564a283cbcda]

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:Read():feat/wave-reader.h:197) Exception caught in WaveHolder::Read(). kaldi::KaldiFatalError
WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:317) Failed to load object from /home/csanta/pytorch-kaldi/kaldi/egs/digits/di$
LOG (compute-mfcc-feats[5.5.490~1-b2f64]:main():compute-mfcc-feats.cc:185)  Done 0 out of 0 utterances.
LOG (copy-feats[5.5.490~1-b2f64]:main():copy-feats.cc:143) Copied 0 feature matrices.
# Accounting: time=0 threads=1
# Ended (code 1) at Fri Oct 25 16:34:22 UTC 2019, elapsed time 0 seconds

Armando

unread,
Oct 25, 2019, 12:51:19 PM10/25/19
to kaldi-help
It's not the same error, it's a different one, it's self-explaining
WaveData: can read only PCM data

your wave file probably are not encoded in the correct way according to kaldi requirement
d
file $yourwavefile
to see what kind of data you have
kaldi expects the audio to be pcm 16bit

depending on the encoding you can decode them easily with sox

Santiago Pineda Montoya

unread,
Oct 25, 2019, 1:10:51 PM10/25/19
to kaldi-help
I am trying with 
sox *.wav -r 16000 -b 16 *.wav 
and I get 
sox WARN wav: User options overriding size read in .wav header
sox FAIL formats: bad input format for file `0_6_4.wav': data encoding or sample size was not specified

Thanks for your help!

Armando

unread,
Oct 25, 2019, 1:13:03 PM10/25/19
to kaldi-help
what format is the original data?
use the command 
file *wav

Santiago Pineda Montoya

unread,
Oct 25, 2019, 1:17:28 PM10/25/19
to kaldi-help
1_3_5.wav:    RIFF (little-endian) data, WAVE audio, mono 16000 Hz
2_4_7.wav:    RIFF (little-endian) data, WAVE audio, mono 16000 Hz
3_1_9.wav:    RIFF (little-endian) data, WAVE audio, mono 16000 Hz
4_9_2.wav:    RIFF (little-endian) data, WAVE audio, mono 16000 Hz
5_0_8.wav:    RIFF (little-endian) data, WAVE audio, mono 16000 Hz
6_5_0.wav:    RIFF (little-endian) data, WAVE audio, mono 16000 Hz
7_8_6.wav:    RIFF (little-endian) data, WAVE audio, mono 16000 Hz
8_2_3.wav:    RIFF (little-endian) data, WAVE audio, mono 16000 Hz
9_7_1.wav:    RIFF (little-endian) data, WAVE audio, mono 16000 Hz

Armando

unread,
Oct 25, 2019, 1:19:30 PM10/25/19
to kaldi-help
try doing
sox 1_3_5.wav -e signed-integer 1_3_5.new.wav

Santiago Pineda Montoya

unread,
Oct 25, 2019, 3:20:41 PM10/25/19
to kaldi-help
I do that for each file:
for i in *.wav; do echo $i; sox "$i" -e signed-integer -b 16 ${i%%.wav}.new.wav; done

Armando

unread,
Oct 27, 2019, 8:14:43 AM10/27/19
to kaldi-help
yeah..well, does it work?

Santiago Pineda Montoya

unread,
Oct 28, 2019, 10:02:57 AM10/28/19
to kaldi-help
Yeah! Thanks for the help! :D
Reply all
Reply to author
Forward
0 new messages