Question on WER calculation using Levnenshtein distance

37 views
Skip to first unread message

Anantha Krishnan

unread,
Mar 3, 2024, 10:31:49 AMMar 3
to kaldi-help
I see that WER = (S + D + I)/N, where N is the total number of words. I also see that Levenshtein edit distance is used in the compute-wer.cc program. 

Do these three edits correspond to phonemes or words. Meaning, does substitution imply the number of words substituted in the edit distance calculation between the utterance and the reference sentence? Or does it imply the number of phonemes substituted in finding the Levnenshtein distance between the word of utterance and the reference word? 

I hope it corresponds to phonemes. Otherwise, it doesn't make sense. 

nshm...@gmail.com

unread,
Mar 4, 2024, 2:27:30 AMMar 4
to kaldi-help
> Do these three edits correspond to phonemes or words.

They correspond to words, hence the name "word error rate"
> I hope it corresponds to phonemes. Otherwise, it doesn't make sense. 

There are separate PER and CER (phoneme error rate and character error rate)
Reply all
Reply to author
Forward
0 new messages