Issue with language recognition

18 views
Skip to first unread message

Girish Gopinath

unread,
Mar 18, 2024, 10:05:59 PMMar 18
to UniMRCP
Hello everyone:

We have a production system that uses Asterisk, UniMRCP and GSR for recognising speech. One of the requirements of the system is to ask the caller to utter the language name for which they require our service. The system recognises the language, converts to text and appropriate action is taken.

This works fine, except for the language Tagalog. When callers say 'Tagalog', the system doesn't find a match. We did some testing and debugging and found that the problem is due to the pronounciation of 'Tagalog'. It is pronounced as "Tuh-gaa-log". Due to this the confidence level is very low and UniMRCP reports a no-match. Below are a couple of results from our tests:

Successful: We pronounced "Taa-gaa-log". Confidence is 0.63
{"transcript":"Tagalog","confidence":0.630923927}],"isFinal":true,"resultEndTime":"7.860s","languageCode":"en-us"}

Failure: Actual pronounciation "Tuh-gaa-log". Confidence 0.27
{"transcript":"Tagalog","confidence":0.276484549}],"isFinal":true,"resultEndTime":"8.040s","languageCode":"en-us"}

We tried modifying the grammar by adding an entry Tuhgaalog, but didn't work. Obviously, we cannot ask the users to say Taa-gaa-log when they call.

Has anyone experienced similar problems? If so, please suggest how you resolved this issue. 

Any help in resolving this is greatly appreciated.

Thank you,
/Girish
Reply all
Reply to author
Forward
0 new messages