show-alignments data/lang/phones.txt exp/mono/0.mdl \ "ark,t:gunzip -c exp/mono/ali.1.gz |"
to get readable alignment files. Am I right in saying the numbers in [ ] in the alignment files are transition ids corresponding to each frame in sequential order (frame 1 - id1, frame 2-id3..etc)? I am only interested in getting the frames for the aligned phoneme but I wanted to be sure I am understanding the file correctly.
Moreover, how good the alignments are (say for tri3 model) if the utterance is one sentence long or just one word long? Alignments for the single word long utterances are more accurate than sentence long utterance. My experiment works on specific frames extracted, so the alignments need to be as accurate as possible. Any guidance would be appreciated.
With regards,
Subash
--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/CA%2B30hV%3DbmUMFVJiw_1-qTw_1%2BUX7mQ%3DvGqAhE%2BXeHTW9ys%2BQQw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/CAEWAuyRBbCmrNS3q6oK1SD-GX%3DSvq6cTFpB6Xy_a1HVUyEsUow%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/CA%2B30hVn7EbKFBskA8_MAvm1Ys3t-2KqXXbv21r%2BK3Ktj4UQiZg%40mail.gmail.com.