Error in utils/prepare_lang.sh for G.fst

235 views
Skip to first unread message

achintyaha

unread,
Jul 12, 2019, 3:28:18 AM7/12/19
to kaldi-help

I created an arpa format lm using CMU-LM_Toolkit and converted it to G.fst using 'arpa2fst --disambig-symbol=#0 cmu.arpa G.fst'.
I am getting this outpput where running 'utils/prepare_lang.sh data/local/dict '<OOV>' data/local/lang data/lang' and not able to understand what exactly is the issue?

.
.
--> data/lang/G.fst is determinizable
FATAL: FstCompiler: Bad arc ilabel integer = "#0", source = standard input, line = 1
ERROR: FstHeader::Read: Bad FST header: standard input
--> ERROR: failure running check_g_properties.pl
--> Testing determinizability of L_disambig . G
fstdeterminizestar
fsttablecompose data/lang/L_disambig.fst data/lang/G.fst
--> L_disambig . G is determinizable
--> ERROR (see error messages above)
prepare_lang.sh: error validating output
.
.

achintyaha

unread,
Jul 12, 2019, 7:03:35 AM7/12/19
to kaldi-help
This is the G.fst

3    0    #0    <eps>    8.62525368
3    7637    a    a    4.37306929
3    7638    about    about    7.58172226
3    7639    accept    accept    8.56262302
3    7640    accompanied    accompanied    8.56262302
3    7641    according    according    7.71527147
3    7642    accordingly    accordingly    8.56262302
3    7643    achieve    achieve    8.96810818
3    7644    acting    acting    8.96810818
3    7645    action    action    8.05167961
.
.
.

Daniel Povey

unread,
Jul 12, 2019, 6:19:17 PM7/12/19
to kaldi-help
It looks to me like for some reason the LM disambiguation symbol #0 is not present in your words.txt.
It should be added by prepare_lang.sh, I think.

--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/140b5803-c590-45e0-9742-0edd1ae7b8b1%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

achintyaha

unread,
Jul 13, 2019, 3:42:14 AM7/13/19
to kaldi-help
Thanks Dan.
Wasable to resolve it.

But now I am stuck at a new error.
While mfcc feature extration, I am getting error (printed in exp/make_mfcc/data/train/make_mfcc_train.*.log)

ERROR (compute-mfcc-feats[5.5.235~1-8cbd5]:Read():parse-options.cc:365) Invalid option --write-utt2dur=ark,t:exp/make_mfcc/data/train/utt2dur.2

Recently I did git pull and after that it is failing. What can be the reason for this error?

Sumit
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi...@googlegroups.com.

Jan Trmal

unread,
Jul 13, 2019, 8:27:47 AM7/13/19
to kaldi-help
you didn't recompile kaldi after git pull
y.

To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.

To post to this group, send email to kaldi...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages