Confidence in alignment, robustness to incorrect transcript

83 views

Skip to first unread message

gschalk

unread,

Nov 20, 2021, 9:00:53 PM11/20/21

to aeneas-forced-alignment

Hi,

Aeneas absolutely rocks!!

I have two questions to the group today:

1) Is a way for me to learn the confidence in the alignment of a particular section of text. In other words, if I want to eliminate those sections that are probably incorrectly aligned, is there a way for me to kick them out based on some metric of performance/fit/etc. produced by aeneas? I understand that, at the most basic level, one could study the fit or offset from diagonal path in the DTW, but I am not sure if any of these variables or other performance metrics may be available on the command line.

2) I have long audio files whose transcripts occasionally miss maybe 15 seconds of spoken text. Sometimes aeneas is robust to the those transcription problems, sometimes it's not. I understand I can make the DTW more "loose," and there are other parameters I can play with, but is there some principled way of thinking about what kind of parameters I should be changing in what ways to get optimal results (kind of like the automated head/tail detection where I provide a max head/tail time)?

Thanks for any help you may provide.

Gerwin

Reply all

Reply to author

Forward

0 new messages