Re: kaldi-DNN

45 views
Skip to first unread message

Daniel Povey

unread,
Jan 20, 2026, 12:15:06 AMJan 20
to Hema A. Murthy, Vighnesh Bantwal Kamath, Arunan J, kaldi...@googlegroups.com
I recommend to start clean with a pytorch-based approach e.g. based on ctc; that performs better these days with modern neural net architectures.  It should be quite easy to find something.  

On Mon, Jan 19, 2026 at 6:27 PM Hema A. Murthy <he...@cse.iitm.ac.in> wrote:
Dear Dan Povey
       We are trying to replace the emission probabilities obtained from DNN with our own custom probabilities.
Could somebody help us with the files that are relevant?  There are way too many NN codes in kaldi-tree.
All of us are pretty new to kaldi itself.
Thanks, Hema A Murthy

Hema A. Murthy

unread,
Jan 22, 2026, 4:34:59 AM (13 days ago) Jan 22
to kaldi-help

Thanks.  Alignments using CTC are not as good as HMM+Signal Processing for Indian language data.
-hema

Daniel Povey

unread,
Jan 22, 2026, 7:36:00 AM (13 days ago) Jan 22
to kaldi...@googlegroups.com
oh i see.  for alignment you could probably use one of the standard GMM recipes.

--
Go to http://kaldi-asr.org/forums.html to find out how to join the kaldi-help group
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/kaldi-help/3f550166-1527-4701-9a21-2f1fe1819ff8n%40googlegroups.com.

ASR_OCEAN

unread,
Jan 23, 2026, 4:32:32 AM (12 days ago) Jan 23
to kaldi-help
Hi Dan!
I have also a related issue.
I am trying to find phoneme alignment using pytorch CTC. 
For TIMIT dataset it works very well (training loss goes down below 1.0) 
but for non-English language such as Evenki language (https://doreco.huma-num.fr/preview/languages/even1259), 
the CTC loss is between 2 and 3 and the predictions is not correct.

Thank you.

Daniel Povey

unread,
Jan 23, 2026, 10:22:03 AM (12 days ago) Jan 23
to kaldi...@googlegroups.com
maybe you have not enough data to train that language.  you could maybe try fine-tuning a system already trained for a previous language, that might work better.
but GMM-based recipes are definitely more stable.


Reply all
Reply to author
Forward
0 new messages