ctm for words

miamoto9

unread,

May 26, 2021, 8:13:26 PM5/26/21

to kaldi-help

Hello! I have an HMM-DNN trained model and I have a script that creates the alignment and ctm file for a given audio file and corresponding transcription.

I am using get_train_ctm.sh to do it, but I am getting times at the utterance level. Because of this I created the segments file and a reco2file_and_channel file ("utt_id utt_id A"). The original wav file that I am working with has around 38 seconds. After creating these 2 mentioned files, I am getting the same result I was obtaining before, at the utterance level. The TRUe for segments is on. Anyone knows what can be the error?

Can I create the reco2file_and_channel like I mentioned above?

Thanks

Daniel Povey

unread,

May 27, 2021, 1:29:53 AM5/27/21

to kaldi-help

get_train_ctm.sh should use the segments and reco2file_and_channel files, if $data/segments is present and you don't specify "--segments false".

That shouldn't be hard to debug.

--
Go to http://kaldi-asr.org/forums.html to find out how to join the kaldi-help group
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/309aa629-ddbe-478e-b161-8639fd7bcd22n%40googlegroups.com.

miamoto9

unread,

May 27, 2021, 8:00:39 AM5/27/21

to kaldi-help

Maybe the problem is in the alignment step? I have a chain model and I am doing alignment with steps/nnet3/align.sh using online ivectors extracted for my single utterance. Is this the correct approach?

Thanks

Daniel Povey

unread,

May 27, 2021, 9:41:11 AM5/27/21

to kaldi-help

You may need the option

--frame-shift 0.03

To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/3ba42b38-d14a-4b1d-8e25-af7bc23cf634n%40googlegroups.com.

miamoto9

unread,

May 27, 2021, 9:54:47 AM5/27/21

to kaldi-help

Yes, i found that in some other chat from kaldi help. Thanks!

Do you recommend using the chain model or the last hmm/gmm model to get ctm file and alignment file?

Daniel Povey

unread,

May 27, 2021, 9:59:14 AM5/27/21

to kaldi-help

Either should be OK

To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/fe69f977-daea-405a-ab6c-4c5a4bb86d23n%40googlegroups.com.

Reply all

Reply to author

Forward