--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+unsubscribe@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/31be4565-9890-408f-afff-193ec36dda40%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Use the correct --frame-shift parameter (chain models usually run on factor 3 subsampling of the original audio parametrization rate).y.
On Tue, Mar 27, 2018 at 7:12 AM, crpatel <chirag...@gmail.com> wrote:
I have trained chain model using my own data. The model performs correctly with respect to WER. However, the time stamps given by it are not accurate. Time stamps seems accurate at the beginning of audio but start shifting when it approaches towards the end of audio. Our audio files are of length between 2 to 8 minutes. The shift is particularly visible in long files. It seems that the offset in the word time stamp is increasing towards the end of the audio.
My decoding pipeline is as follows:
lattice-push | lattice-align-words | lattice-to-ctm-conf
Could you please explain why this is happening?
--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+unsubscribe@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/80bf20a5-5a1c-4623-92cf-089bbdb7400e%40googlegroups.com.