Failed to open script file data/train/split1/1/feats.scp

Santiago Pineda Montoya

unread,

Oct 25, 2019, 11:49:43 AM10/25/19

to kaldi-help

Hi!

I'm following the Kaldi for Dummies tutorial and I get the error:

===== PREPARING ACOUSTIC DATA =====

===== FEATURES EXTRACTION =====

steps/make_mfcc.sh --nj 1 --cmd run.pl --max-jobs-run 1 data/train exp/make_mfcc/train mfcc

utils/validate_data_dir.sh: Successfully validated data-directory data/train

steps/make_mfcc.sh: [info]: no segments file exists: assuming wav.scp indexed by utterance.

run.pl: job failed, log is in exp/make_mfcc/train/make_mfcc_train.1.log

steps/make_mfcc.sh --nj 1 --cmd run.pl --max-jobs-run 1 data/test exp/make_mfcc/test mfcc

utils/validate_data_dir.sh: Successfully validated data-directory data/test

steps/make_mfcc.sh: [info]: no segments file exists: assuming wav.scp indexed by utterance.

run.pl: job failed, log is in exp/make_mfcc/test/make_mfcc_test.1.log

steps/compute_cmvn_stats.sh data/train exp/make_mfcc/train mfcc

steps/compute_cmvn_stats.sh: no such file data/train/feats.scp

steps/compute_cmvn_stats.sh data/test exp/make_mfcc/test mfcc

steps/compute_cmvn_stats.sh: no such file data/test/feats.scp

===== PREPARING LANGUAGE DATA =====

utils/prepare_lang.sh data/local/dict <UNK> data/local/lang data/lang

Checking data/local/dict/silence_phones.txt ...

--> reading data/local/dict/silence_phones.txt

--> text seems to be UTF-8 or ASCII, checking whitespaces

--> text contains only allowed whitespaces

--> data/local/dict/silence_phones.txt is OK

Checking data/local/dict/optional_silence.txt ...

--> reading data/local/dict/optional_silence.txt

--> text seems to be UTF-8 or ASCII, checking whitespaces

--> text contains only allowed whitespaces

--> data/local/dict/optional_silence.txt is OK

Checking data/local/dict/nonsilence_phones.txt ...

--> reading data/local/dict/nonsilence_phones.txt

--> text seems to be UTF-8 or ASCII, checking whitespaces

--> text contains only allowed whitespaces

--> data/local/dict/nonsilence_phones.txt is OK

Checking disjoint: silence_phones.txt, nonsilence_phones.txt

--> disjoint property is OK.

Checking data/local/dict/lexicon.txt

--> reading data/local/dict/lexicon.txt

--> text seems to be UTF-8 or ASCII, checking whitespaces

--> text contains only allowed whitespaces

--> data/local/dict/lexicon.txt is OK

Checking data/local/dict/extra_questions.txt ...

--> data/local/dict/extra_questions.txt is empty (this is OK)

--> SUCCESS [validating dictionary directory data/local/dict]

**Creating data/local/dict/lexiconp.txt from data/local/dict/lexicon.txt

fstaddselfloops data/lang/phones/wdisambig_phones.int data/lang/phones/wdisambig_words.int

prepare_lang.sh: validating output directory

utils/validate_lang.pl data/lang

Checking existence of separator file

separator file data/lang/subword_separator.txt is empty or does not exist, deal in word case.

Checking data/lang/phones.txt ...

--> text seems to be UTF-8 or ASCII, checking whitespaces

--> text contains only allowed whitespaces

--> data/lang/phones.txt is OK

Checking words.txt: #0 ...

--> text seems to be UTF-8 or ASCII, checking whitespaces

--> text contains only allowed whitespaces

--> data/lang/words.txt is OK

Checking disjoint: silence.txt, nonsilence.txt, disambig.txt ...

--> silence.txt and nonsilence.txt are disjoint

--> silence.txt and disambig.txt are disjoint

--> disambig.txt and nonsilence.txt are disjoint

--> disjoint property is OK

Checking sumation: silence.txt, nonsilence.txt, disambig.txt ...

--> found no unexplainable phones in phones.txt

Checking data/lang/phones/context_indep.{txt, int, csl} ...

--> text seems to be UTF-8 or ASCII, checking whitespaces

--> text contains only allowed whitespaces

--> 10 entry/entries in data/lang/phones/context_indep.txt

--> data/lang/phones/context_indep.int corresponds to data/lang/phones/context_indep.txt

--> data/lang/phones/context_indep.csl corresponds to data/lang/phones/context_indep.txt

--> data/lang/phones/context_indep.{txt, int, csl} are OK

Checking data/lang/phones/nonsilence.{txt, int, csl} ...

--> text seems to be UTF-8 or ASCII, checking whitespaces

--> text contains only allowed whitespaces

--> 56 entry/entries in data/lang/phones/nonsilence.txt

--> data/lang/phones/nonsilence.int corresponds to data/lang/phones/nonsilence.txt

--> data/lang/phones/nonsilence.csl corresponds to data/lang/phones/nonsilence.txt

--> data/lang/phones/nonsilence.{txt, int, csl} are OK

Checking data/lang/phones/silence.{txt, int, csl} ...

--> text seems to be UTF-8 or ASCII, checking whitespaces

--> text contains only allowed whitespaces

--> 10 entry/entries in data/lang/phones/silence.txt

--> data/lang/phones/silence.int corresponds to data/lang/phones/silence.txt

--> data/lang/phones/silence.csl corresponds to data/lang/phones/silence.txt

--> data/lang/phones/silence.{txt, int, csl} are OK

Checking data/lang/phones/optional_silence.{txt, int, csl} ...

--> text seems to be UTF-8 or ASCII, checking whitespaces

--> text contains only allowed whitespaces

--> 1 entry/entries in data/lang/phones/optional_silence.txt

--> data/lang/phones/optional_silence.int corresponds to data/lang/phones/optional_silence.txt

--> data/lang/phones/optional_silence.csl corresponds to data/lang/phones/optional_silence.txt

--> data/lang/phones/optional_silence.{txt, int, csl} are OK

Checking data/lang/phones/disambig.{txt, int, csl} ...

--> text seems to be UTF-8 or ASCII, checking whitespaces

--> text contains only allowed whitespaces

--> 2 entry/entries in data/lang/phones/disambig.txt

--> data/lang/phones/disambig.int corresponds to data/lang/phones/disambig.txt

--> data/lang/phones/disambig.csl corresponds to data/lang/phones/disambig.txt

--> data/lang/phones/disambig.{txt, int, csl} are OK

Checking data/lang/phones/roots.{txt, int} ...

--> text seems to be UTF-8 or ASCII, checking whitespaces

--> text contains only allowed whitespaces

--> 16 entry/entries in data/lang/phones/roots.txt

--> data/lang/phones/roots.int corresponds to data/lang/phones/roots.txt

--> data/lang/phones/roots.{txt, int} are OK

Checking data/lang/phones/sets.{txt, int} ...

--> text seems to be UTF-8 or ASCII, checking whitespaces

--> text contains only allowed whitespaces

--> 16 entry/entries in data/lang/phones/sets.txt

--> data/lang/phones/sets.int corresponds to data/lang/phones/sets.txt

--> data/lang/phones/sets.{txt, int} are OK

Checking data/lang/phones/extra_questions.{txt, int} ...

--> text seems to be UTF-8 or ASCII, checking whitespaces

--> text contains only allowed whitespaces

--> 9 entry/entries in data/lang/phones/extra_questions.txt

--> data/lang/phones/extra_questions.int corresponds to data/lang/phones/extra_questions.txt

--> data/lang/phones/extra_questions.{txt, int} are OK

Checking data/lang/phones/word_boundary.{txt, int} ...

--> text seems to be UTF-8 or ASCII, checking whitespaces

--> text contains only allowed whitespaces

--> 66 entry/entries in data/lang/phones/word_boundary.txt

--> data/lang/phones/word_boundary.int corresponds to data/lang/phones/word_boundary.txt

--> data/lang/phones/word_boundary.{txt, int} are OK

Checking optional_silence.txt ...

--> reading data/lang/phones/optional_silence.txt

--> data/lang/phones/optional_silence.txt is OK

Checking disambiguation symbols: #0 and #1

--> data/lang/phones/disambig.txt has "#0" and "#1"

--> data/lang/phones/disambig.txt is OK

Checking topo ...

Checking word_boundary.txt: silence.txt, nonsilence.txt, disambig.txt ...

--> data/lang/phones/word_boundary.txt doesn't include disambiguation symbols

--> data/lang/phones/word_boundary.txt is the union of nonsilence.txt and silence.txt

--> data/lang/phones/word_boundary.txt is OK

Checking word-level disambiguation symbols...

--> data/lang/phones/wdisambig.txt exists (newer prepare_lang.sh)

Checking word_boundary.int and disambig.int

--> generating a 98 word/subword sequence

--> resulting phone sequence from L.fst corresponds to the word sequence

--> L.fst is OK

--> generating a 32 word/subword sequence

--> resulting phone sequence from L_disambig.fst corresponds to the word sequence

--> L_disambig.fst is OK

Checking data/lang/oov.{txt, int} ...

--> text seems to be UTF-8 or ASCII, checking whitespaces

--> text contains only allowed whitespaces

--> 1 entry/entries in data/lang/oov.txt

--> data/lang/oov.int corresponds to data/lang/oov.txt

--> data/lang/oov.{txt, int} are OK

--> data/lang/L.fst is olabel sorted

--> data/lang/L_disambig.fst is olabel sorted

--> SUCCESS [validating lang directory data/lang]

===== LANGUAGE MODEL CREATION =====

===== MAKING lm.arpa =====

===== MAKING G.fst =====

arpa2fst --disambig-symbol=#0 --read-symbol-table=data/lang/words.txt data/local/tmp/lm.arpa data/lang/G.fst

LOG (arpa2fst[5.5.490~1-b2f64]:Read():arpa-file-parser.cc:94) Reading \data\ section.

LOG (arpa2fst[5.5.490~1-b2f64]:Read():arpa-file-parser.cc:149) Reading \1-grams: section.

LOG (arpa2fst[5.5.490~1-b2f64]:RemoveRedundantStates():arpa-lm-compiler.cc:359) Reduced num-states from 1 to 1

===== MONO TRAINING =====

steps/train_mono.sh --nj 1 --cmd run.pl --max-jobs-run 1 data/train data/lang exp/mono

steps/train_mono.sh: Initializing monophone system.

feat-to-dim 'ark,s,cs:apply-cmvn --utt2spk=ark:data/train/split1/1/utt2spk scp:data/train/split1/1/cmvn.scp scp:data/train/split1/1/feats.scp ark:- | add-deltas ark:- ark:- |' -

add-deltas ark:- ark:-

apply-cmvn --utt2spk=ark:data/train/split1/1/utt2spk scp:data/train/split1/1/cmvn.scp scp:data/train/split1/1/feats.scp ark:-

WARNING (apply-cmvn[5.5.490~1-b2f64]:Open():util/kaldi-table-inl.h:106) Failed to open script file data/train/split1/1/feats.scp

ERROR (apply-cmvn[5.5.490~1-b2f64]:SequentialTableReader():util/kaldi-table-inl.h:860) Error constructing TableReader: rspecifier is scp:data/train/split1/1/feats.scp

[ Stack-Trace: ]

/home/csanta/pytorch-kaldi/kaldi/src/lib/libkaldi-base.so(kaldi::MessageLogger::LogMessage() const+0xa71) [0x7f16eb41c53f]

apply-cmvn(kaldi::MessageLogger::LogAndThrow::operator=(kaldi::MessageLogger const&)+0x11) [0x5620227f2d03]

apply-cmvn(kaldi::SequentialTableReader<kaldi::KaldiObjectHolder<kaldi::Matrix<float> > >::SequentialTableReader(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)+0x95) [0x5620227f9659]

apply-cmvn(main+0x755) [0x5620227f0885]

/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf1) [0x7f16e3a082e1]

apply-cmvn(_start+0x2a) [0x5620227f002a]

kaldi::KaldiFatalErrorERROR (feat-to-dim[5.5.490~1-b2f64]:main():feat-to-dim.cc:58) Could not read any features (empty archive?)

[ Stack-Trace: ]

/home/csanta/pytorch-kaldi/kaldi/src/lib/libkaldi-base.so(kaldi::MessageLogger::LogMessage() const+0xa71) [0x7fed3d11a53f]

feat-to-dim(kaldi::MessageLogger::LogAndThrow::operator=(kaldi::MessageLogger const&)+0x11) [0x5645a59db357]

feat-to-dim(main+0x2dc) [0x5645a59da87c]

/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf1) [0x7fed357062e1]

feat-to-dim(_start+0x2a) [0x5645a59da49a]

kaldi::KaldiFatalErrorerror getting feature dimension

I want to thank you all for your help and support

https://drive.google.com/open?id=1rqSbc6rdRtKKH0eV21EQpstYX2awm6RQ

Armando

unread,

Oct 25, 2019, 11:55:53 AM10/25/19

to kaldi-help

feature extraction failed

see the message

run.pl: job failed, log is in exp/make_mfcc/test/make_mfcc_test.1.log

open that log, see what it says

Santiago Pineda Montoya

unread,

Oct 25, 2019, 12:02:25 PM10/25/19

to kaldi-help

any suggestion to solve the problem?

Santiago Pineda Montoya

unread,

Oct 25, 2019, 12:04:09 PM10/25/19

to kaldi-help

The log is:

# compute-mfcc-feats --write-utt2dur=ark,t:exp/make_mfcc/test/utt2dur.1 --verbose=2 --config=conf/mfcc.conf scp,p:exp/make_mfcc/test/wav_test.1.scp ark:- | copy-feats --write-num-frames=ark,t:exp/make_mfcc/test/utt2num_frames.1 --compre$

# Started at Fri Oct 25 15:47:27 UTC 2019

#

copy-feats --write-num-frames=ark,t:exp/make_mfcc/test/utt2num_frames.1 --compress=true ark:- ark,scp:/home/csanta/pytorch-kaldi/kaldi/egs/digits/mfcc/raw_mfcc_test.1.ark,/home/csanta/pytorch-kaldi/kaldi/egs/digits/mfcc/raw_mfcc_test.1.$

compute-mfcc-feats --write-utt2dur=ark,t:exp/make_mfcc/test/utt2dur.1 --verbose=2 --config=conf/mfcc.conf scp,p:exp/make_mfcc/test/wav_test.1.scp ark:-

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\0_6_4.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\1_3_5.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\2_4_7.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\3_1_9.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\4_9_2.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\5_0_8.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\6_5_0.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\7_8_6.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\8_2_3.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\9_7_1.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\0_4_3.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\1_7_5.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\2_8_9.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\3_5_8.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\4_3_0.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\5_9_7.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\6_0_1.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\7_2_6.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\8_6_2.wav'

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:310) Failed to open file '/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\test\cristian\9_1_4.wav'

LOG (compute-mfcc-feats[5.5.490~1-b2f64]:main():compute-mfcc-feats.cc:185) Done 0 out of 0 utterances.

LOG (copy-feats[5.5.490~1-b2f64]:main():copy-feats.cc:143) Copied 0 feature matrices.

# Accounting: time=0 threads=1

# Ended (code 1) at Fri Oct 25 15:47:27 UTC 2019, elapsed time 0 seconds

El viernes, 25 de octubre de 2019, 10:55:53 (UTC-5), Armando escribió:

Armando

unread,

Oct 25, 2019, 12:08:48 PM10/25/19

to kaldi-help

so? do those file wav even exist?

it can not open them

Santiago Pineda Montoya

unread,

Oct 25, 2019, 12:41:39 PM10/25/19

to kaldi-help

The problem is the path: "'/home/csanta/pytorch-kaldi/kaldi/egs/digits/digits_audio\train\camilo3\8_2_3.wav". I have \ instead of /. But I get the same error with the following log:

compute-mfcc-feats(main+0x824) [0x564a283cc604]

/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf1) [0x7f1fd4a272e1]

compute-mfcc-feats(_start+0x2a) [0x564a283cbcda]

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:Read():feat/wave-reader.h:197) Exception caught in WaveHolder::Read(). kaldi::KaldiFatalError

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:317) Failed to load object from /home/csanta/pytorch-kaldi/kaldi/egs/digits/di$

ERROR (compute-mfcc-feats[5.5.490~1-b2f64]:Read():wave-reader.cc:190) WaveData: can read only PCM data, format id in file is: 3

[ Stack-Trace: ]

/home/csanta/pytorch-kaldi/kaldi/src/lib/libkaldi-base.so(kaldi::MessageLogger::LogMessage() const+0xa71) [0x7f1fdc43b53f]

compute-mfcc-feats(kaldi::MessageLogger::LogAndThrow::operator=(kaldi::MessageLogger const&)+0x11) [0x564a283cf2a3]

/home/csanta/pytorch-kaldi/kaldi/src/lib/libkaldi-feat.so(kaldi::WaveInfo::Read(std::istream&)+0x12d3) [0x7f1fdd1fa403]

/home/csanta/pytorch-kaldi/kaldi/src/lib/libkaldi-feat.so(kaldi::WaveData::Read(std::istream&)+0x4a) [0x7f1fdd1fb21c]

compute-mfcc-feats(kaldi::WaveHolder::Read(std::istream&)+0xd) [0x564a283d5049]

compute-mfcc-feats(kaldi::SequentialTableReaderScriptImpl<kaldi::WaveHolder>::EnsureObjectLoaded()+0x161) [0x564a283dc1df]

compute-mfcc-feats(kaldi::SequentialTableReaderScriptImpl<kaldi::WaveHolder>::Next()+0x27) [0x564a283dcac7]

compute-mfcc-feats(kaldi::SequentialTableReaderScriptImpl<kaldi::WaveHolder>::Open(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > cons$

compute-mfcc-feats(kaldi::SequentialTableReader<kaldi::WaveHolder>::Open(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)+0x1ed)$

compute-mfcc-feats(kaldi::SequentialTableReader<kaldi::WaveHolder>::SequentialTableReader(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char>$

compute-mfcc-feats(main+0x824) [0x564a283cc604]

/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf1) [0x7f1fd4a272e1]

compute-mfcc-feats(_start+0x2a) [0x564a283cbcda]

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:Read():feat/wave-reader.h:197) Exception caught in WaveHolder::Read(). kaldi::KaldiFatalError

WARNING (compute-mfcc-feats[5.5.490~1-b2f64]:EnsureObjectLoaded():util/kaldi-table-inl.h:317) Failed to load object from /home/csanta/pytorch-kaldi/kaldi/egs/digits/di$

LOG (compute-mfcc-feats[5.5.490~1-b2f64]:main():compute-mfcc-feats.cc:185) Done 0 out of 0 utterances.

LOG (copy-feats[5.5.490~1-b2f64]:main():copy-feats.cc:143) Copied 0 feature matrices.

# Accounting: time=0 threads=1

# Ended (code 1) at Fri Oct 25 16:34:22 UTC 2019, elapsed time 0 seconds

Armando

unread,

Oct 25, 2019, 12:51:19 PM10/25/19

to kaldi-help

It's not the same error, it's a different one, it's self-explaining

WaveData: can read only PCM data

your wave file probably are not encoded in the correct way according to kaldi requirement

d

file $yourwavefile

to see what kind of data you have

kaldi expects the audio to be pcm 16bit

depending on the encoding you can decode them easily with sox

Santiago Pineda Montoya

unread,

Oct 25, 2019, 1:10:51 PM10/25/19

to kaldi-help

I am trying with

sox *.wav -r 16000 -b 16 *.wav

and I get

sox WARN wav: User options overriding size read in .wav header

sox FAIL formats: bad input format for file `0_6_4.wav': data encoding or sample size was not specified

Thanks for your help!

Armando

unread,

Oct 25, 2019, 1:13:03 PM10/25/19

to kaldi-help

what format is the original data?

use the command

file *wav

Santiago Pineda Montoya

unread,

Oct 25, 2019, 1:17:28 PM10/25/19

to kaldi-help

1_3_5.wav: RIFF (little-endian) data, WAVE audio, mono 16000 Hz

2_4_7.wav: RIFF (little-endian) data, WAVE audio, mono 16000 Hz

3_1_9.wav: RIFF (little-endian) data, WAVE audio, mono 16000 Hz

4_9_2.wav: RIFF (little-endian) data, WAVE audio, mono 16000 Hz

5_0_8.wav: RIFF (little-endian) data, WAVE audio, mono 16000 Hz

6_5_0.wav: RIFF (little-endian) data, WAVE audio, mono 16000 Hz

7_8_6.wav: RIFF (little-endian) data, WAVE audio, mono 16000 Hz

8_2_3.wav: RIFF (little-endian) data, WAVE audio, mono 16000 Hz

9_7_1.wav: RIFF (little-endian) data, WAVE audio, mono 16000 Hz

Armando

unread,

Oct 25, 2019, 1:19:30 PM10/25/19

to kaldi-help

try doing

sox 1_3_5.wav -e signed-integer 1_3_5.new.wav

Santiago Pineda Montoya

unread,

Oct 25, 2019, 3:20:41 PM10/25/19

to kaldi-help

I do that for each file:

for i in *.wav; do echo $i; sox "$i" -e signed-integer -b 16 ${i%%.wav}.new.wav; done

Armando

unread,

Oct 27, 2019, 8:14:43 AM10/27/19

to kaldi-help

yeah..well, does it work?

Santiago Pineda Montoya

unread,

Oct 28, 2019, 10:02:57 AM10/28/19

to kaldi-help

Yeah! Thanks for the help! :D

Reply all

Reply to author

Forward