How to do onilne decoding after triphone training?

158 views
Skip to first unread message

achintyaha

unread,
Jul 22, 2019, 6:06:54 AM7/22/19
to kaldi-help
Hi,

I have successfully trained monophones and triphones. I am also able to get decoding result.
Now I am looking to do online decoding (provide only audio and get the transcription results)

Right now I can only find onlline decoding methods using nnet, so is that the only way and if yes, can that be done on a local machine (Macbook air laptop)?

Data size:
Approximately 60 hrs.

Link I followed:

Steps I used so far:

x=data/train
mfccdir=mfcc
steps/make_mfcc.sh --cmd run.pl $x exp/make_mfcc/$x $mfccdir
steps/compute_cmvn_stats.sh $x exp/make_mfcc/$x $mfccdir


steps/train_mono.sh --boost-silence 1.25 --cmd "run.pl" data/train data/lang exp/mono

steps/align_si.sh --boost-silence 1.25 --cmd "run.pl" data/train data/lang exp/mono exp/mono_ali


steps/train_deltas.sh --boost-silence 1.25 --cmd "run.pl" 2000 10000 data/train data/lang exp/mono_ali exp/tri1

steps/align_si.sh --cmd "run.pl" data/train data/lang exp/tri1 exp/tri1_ali


steps/train_lda_mllt.sh --cmd "run.pl" --splice-opts "--left-context=3 --right-context=3" 2500 15000 data/train data/lang exp/tri1_ali exp/tri2a

steps/align_si.sh --cmd "run.pl" --use-graphs true data/train data/lang exp/tri2a exp/tri2a_ali


steps/train_sat.sh --cmd "run.pl" 2500 15000 data/train data/lang exp/tri2a_ali exp/tri3a

steps/align_fmllr.sh --cmd "run.pl" data/train data/lang exp/tri3a exp/tri3a_ali


steps/train_sat.sh --cmd "run.pl" 4200 40000 data/train data/lang exp/tri3a_ali exp/tri4a

steps/align_fmllr.sh --cmd "run.pl" data/train data/lang exp/tri4a exp/tri4a_ali


utils/mkgraph.sh data/lang exp/tri4a exp/tri4a/graph || exit 1


steps/online/prepare_online_decoding.sh data/train data/lang exp/tri4a exp/tri4a_online


Next suggested steps:

steps/online/nnet2/train_diag_ubm.sh data/train 1024 exp/tri4b exp/diag_ubm

steps/online/nnet2/train_ivector_extractor.sh data/train exp/diag_ubm exp/extractor

steps/online/nnet2/extract_ivectors_online.sh data/train exp/extractor exp/ivectors_train


Daniel Povey

unread,
Jul 22, 2019, 3:49:23 PM7/22/19
to kaldi-help
No you can't do that kind of thing on a mac laptop, you need a server with GPU(s).

It is possible to do online decoding with GMMs though.
See, for example,
egs/rm/s5/local/online/run_gmm.sh


--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/e769fec5-90c2-4e83-bac7-ea9d6775f409%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages