Cannot find utterance id KWS

256 views
Skip to first unread message

aliiire...@gmail.com

unread,
Jan 30, 2018, 2:15:21 AM1/30/18
to kaldi-help
Hi I'm afraid to ask this
I want to run KWS code in FARDAT dataset but in log output said: "Cannot find utterance id".
what's wrong?


# lattice-add-penalty --word-ins-penalty=0 "ark:gzip -cdf exp/mono/decode_test/lat.1.gz|" ark:- | lattice-align-words --max-expand=180 data/lang_test_word//phones/word_boundary.int exp/mono/final.mdl ark:- ark:- | lattice-scale --acoustic-scale=0.1 --lm-scale=1.0 ark:- ark:- | lattice-to-kws-index --max-states-scale=4 --allow-partial=true --frame-subsampling-factor=1 --max-silence-frames=50 --strict=true ark:data/kws//utter_id ark:- ark:- | kws-index-union --skip-optimization=false --strict=true --max-states=1000000 ark:- "ark:|gzip -c > exp/mono/decode_test/kws/index.1.gz" 
# Started at Tue Jan 30 10:31:02 +0330 2018
#
lattice-scale --acoustic-scale=0.1 --lm-scale=1.0 ark:- ark:- 
lattice-add-penalty --word-ins-penalty=0 'ark:gzip -cdf exp/mono/decode_test/lat.1.gz|' ark:- 
lattice-to-kws-index --max-states-scale=4 --allow-partial=true --frame-subsampling-factor=1 --max-silence-frames=50 --strict=true ark:data/kws//utter_id ark:- ark:- 
kws-index-union --skip-optimization=false --strict=true --max-states=1000000 ark:- 'ark:|gzip -c > exp/mono/decode_test/kws/index.1.gz' 
lattice-align-words --max-expand=180 data/lang_test_word//phones/word_boundary.int exp/mono/final.mdl ark:- ark:- 
LOG (lattice-to-kws-index[5.2]:main():lattice-to-kws-index.cc:105) Processing lattice 007_1_43
WARNING (lattice-to-kws-index[5.2]:main():lattice-to-kws-index.cc:114) Cannot find utterance id for 007_1_43
LOG (lattice-to-kws-index[5.2]:main():lattice-to-kws-index.cc:105) Processing lattice 007_1_44
WARNING (lattice-to-kws-index[5.2]:main():lattice-to-kws-index.cc:114) Cannot find utterance id for 007_1_44
LOG (lattice-to-kws-index[5.2]:main():lattice-to-kws-index.cc:105) Processing lattice 007_1_45
WARNING (lattice-to-kws-index[5.2]:main():lattice-to-kws-index.cc:114) Cannot find utterance id for 007_1_45
LOG (lattice-to-kws-index[5.2]:main():lattice-to-kws-index.cc:105) Processing lattice 007_1_46
WARNING (lattice-to-kws-index[5.2]:main():lattice-to-kws-index.cc:114) Cannot find utterance id for 007_1_46
LOG (lattice-to-kws-index[5.2]:main():lattice-to-kws-index.cc:105) Processing lattice 007_1_47
WARNING (lattice-to-kws-index[5.2]:main():lattice-to-kws-index.cc:114) Cannot find utterance id for 007_1_47
.
.
.[51 line similar]
.

LOG (lattice-align-words[5.2]:main():lattice-align-words.cc:125) Successfully aligned 81 lattices; 0 had errors.
LOG (lattice-scale[5.2]:main():lattice-scale.cc:90) Done 81 lattices.
LOG (lattice-to-kws-index[5.2]:main():lattice-to-kws-index.cc:105) Processing lattice 097_2_41
WARNING (lattice-to-kws-index[5.2]:main():lattice-to-kws-index.cc:114) Cannot find utterance id for 097_2_41
LOG (lattice-to-kws-index[5.2]:main():lattice-to-kws-index.cc:220) Done 0 lattices, failed for 81
LOG (kws-index-union[5.2]:main():kws-index-union.cc:104) Done 0 indices
# Accounting: time=1 threads=1
# Ended (code 1) at Tue Jan 30 10:31:03 +0330 2018, elapsed time 1 seconds

Jan Trmal

unread,
Jan 30, 2018, 4:35:12 PM1/30/18
to kaldi-help
that would mean something is wrong with the file
 data/kws//utter_id

it should contain lines such as 
007_1_43 1
007_1_44 2
and so on (the numbers will most probably be different)
y.

--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+unsubscribe@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/9ef68939-a828-44a9-b1e4-4f510ec02728%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

aliiire...@gmail.com

unread,
Feb 1, 2018, 5:57:27 AM2/1/18
to kaldi-help
Hi, Thanks

I change this code
local/kws_data_prep.sh
line 39
cat $datadir/wav.scp
to 
cat $datadir/stm
that's run, correctly(maybe!). 


where is "lattice archive" say in "kws_data_prep.sh"?

for compute-atwv how can create "ref-rspecifier/hyp-rspecifier"? handly or have a function?
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.

Jan Trmal

unread,
Feb 1, 2018, 2:37:50 PM2/1/18
to kaldi-help
I think you have some very old version of kaldi or damaged copy of it 
see
the line 39 does not contain that statement at all.
y.

To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+unsubscribe@googlegroups.com.

To post to this group, send email to kaldi...@googlegroups.com.

aliiire...@gmail.com

unread,
Feb 1, 2018, 3:13:32 PM2/1/18
to kaldi-help
i used WSJ 



and I want to used babel in future.

please tell about, How can create "ref-rspecifier/hyp-rspecifier"? handly or have a function? 

Jan Trmal

unread,
Feb 1, 2018, 3:25:03 PM2/1/18
to kaldi-help
OK, I'm not familiar with that setup -- in general, the names of the lattices will be primarily driven by the utterance names (i.e. you should use the segments file).
The wav.scp use will be valid only in cases when you have single utterance per file (which might be the case for WSJ, but I'm simply not sure enough to argue).

ad compute-atwv -- if you run the command without any parameters, it will print a fairly extensive info.
the file being <ref-rspecifier> is your reference. You will have to generate it by hand or write your own script for that. There is a "cheat-script" for cases where the reference is in form of the rttm file, but that script needs F4DE installed and on overall it's easier to generate the reference by "hand".
the file being <hyp-rspecifier> contains your hypotheses about the location of the individual keywords. That file gets generated during the keyword search.
y.

y.  

To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+unsubscribe@googlegroups.com.

To post to this group, send email to kaldi...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages