It's trying to train the LM on some data, and I suspect the input data
file (probably data/local/lm/text.no_oov) is empty or does not exist.
Dan
On Tue, Dec 20, 2016 at 10:31 PM, Yeonjong Choi <
cyj...@gmail.com> wrote:
> Dear all
>
> Hello, I'm new for using Kaldi toolkit.
> I have a question about perplexity problem.
>
> I'm now trying to make ASR system for "RSR2015" dataset, using
> fisher_english example scripts.
> I prepared data files (wav.scp, spk2gender, utt2spk, spk2utt, text) in each
> of s5/data/test and s5/data/train_all,
> and I commented out line 11, 12 of run.sh (so I don't use
> local/fisher_data_prep.sh).
>
> When I try to run line 27 of run.sh (local/fisher_train_lms.sh),
> the message shows like below :
>
> ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
> Not installing the kaldi_lm toolkit since it is already there.
> Not creating raw N-gram counts ngrams.gz and heldout_ngrams.gz since they
> already exist in data/local/lm/3gram-mincount
> (remove them if you want them regenerated)
> Iteration 1/6 of optimizing discounting parameters
> discount_ngrams: for n-gram order 1, D=0.600000, tau=0.675000 phi=2.000000
> discount_ngrams: for n-gram order 2, D=0.800000, tau=0.675000 phi=2.000000
> discount_ngrams: for n-gram order 3, D=0.000000, tau=0.825000 phi=2.000000
> interpolate_ngrams: 148 words in wordslist
> Perplexity over 0.000000 words is -nan
> Perplexity over 0.000000 words (excluding 0.000000 OOVs) is -nan
>
> real 0m0.012s
> user 0m0.008s
> sys 0m0.060s
> interpolate_ngrams: 148 words in wordslist
> discount_ngrams: for n-gram order 1, D=0.600000, tau=0.900000 phi=2.000000
> discount_ngrams: for n-gram order 2, D=0.800000, tau=0.900000 phi=2.000000
> discount_ngrams: for n-gram order 3, D=0.000000, tau=1.100000 phi=2.000000
> Perplexity over 0.000000 words is -nan
> Perplexity over 0.000000 words (excluding 0.000000 OOVs) is -nan
>
> real 0m0.013s
> user 0m0.000s
> sys 0m0.080s
> discount_ngrams: for n-gram order 1, D=0.600000, tau=1.215000 phi=2.000000
> discount_ngrams: for n-gram order 2, D=0.800000, tau=1.215000 phi=2.000000
> discount_ngrams: for n-gram order 3, D=0.000000, tau=1.485000 phi=2.000000
> interpolate_ngrams: 148 words in wordslist
> Perplexity over 0.000000 words is -nan
> Perplexity over 0.000000 words (excluding 0.000000 OOVs) is -nan
>
> real 0m0.012s
> user 0m0.000s
> sys 0m0.060s
> Bad perplexities . at
> /work1/t2g-shinoda2011/16M31343/kaldi-trunk/egs/fisher_english/rsr/../../../tools/kaldi_lm/
optimize_alpha.pl
> line 30.
> ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
> Can anyone help me solving this problem?
> Thank you for your help!
>
> Yeonjong Choi
>
> --
> You received this message because you are subscribed to the Google Groups
> "kaldi-help" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to
kaldi-help+...@googlegroups.com.
> For more options, visit
https://groups.google.com/d/optout.