What does gmm-est do?

265 views
Skip to first unread message

willy8...@gmail.com

unread,
Apr 6, 2016, 12:08:16 AM4/6/16
to kaldi-help
Hi, I've founded that Kaldi mainly uses command gmm-est in several GMM-HMM training script such as train_mono.sh, train_deltas.sh, etc.
The manual of gmm-est says it "Do Maximum Likelihood re-estimation of GMM-based acoustic model", which as I know means traditional Baum-Welch reestimation method. However some people told me it implements segmental k-means optimization algorithm rather than Baum-Welch.
Can somebody give me the answer? And another question is if this did means Baum-Welch algorithm then is there any command that allows segmetal k-means algorithm?
Thanks.

Daniel Povey

unread,
Apr 6, 2016, 3:01:23 PM4/6/16
to kaldi-help
It's Baum-Welch.  I'm not even sure what segmental k-means is, but it sounds like something quite obscure and not in the mainstream of speech recognition.
Dan



--
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Xiang Li

unread,
Apr 8, 2016, 7:34:35 AM4/8/16
to kaldi-help, willy8...@gmail.com
Segmental k-means is a HTK term, which means GMM-BW + realign in Kaldi, IIRC. 
And the corresponding commands are in every GMM-HMM training script, for example train_mono.sh.
HMM-BW in HTK book means GMM-BW + forward-backward in Kaldi.
Kaldi uses Viterbi instead of FB.


在 2016年4月6日星期三 UTC+8下午12:08:16,willy8...@gmail.com写道:
Reply all
Reply to author
Forward
0 new messages