Google recognizer dictionary & MS Azure recognizer

30 views
Skip to first unread message

Clayton Hann

unread,
Aug 14, 2024, 6:34:31 AM8/14/24
to utterlyvoiceusers
Thanks for latest version.  The spacing options really helped.  The "newest' recognizer updates were a bit of a letdown (not utterlyvoices fault).  Deepgram medical is far behind Google.  Running whisper locally will be a challenge unless you have high end hardware+GPU.

Any chance of including whisper as Open AI API, rather than locally?  Also any chance of getting MS Azure medical as a recognizer?

Finally, if there's any way to have a list of commonly used words as a file locally that you could pass to the Google API that would be great.  The Google API has that feature.  

Thanks again for the software.   

Utterly Voice

unread,
Aug 14, 2024, 9:37:49 AM8/14/24
to Clayton Hann, utterlyvoiceusers
It's great to hear that the spacing options are working for you!

Yes, we were disappointed by whisper as well. Using high end hardware can help, but the real issue is that whisper does not truly support streaming. With whisper, we have to wait until the utterance audio buffer is complete before starting to process the data. With other streaming recognizers, we start processing the data at the beginning of an utterance. This makes a big difference with real time speech recognition. Whisper is better designed for processing audio files.

We have added Open AI API and MS Azure to our task list for upcoming versions.

We have attempted to use google's speech context in the past, but we found that our internal biasing actually performed better. Have you tried that? See the "phrase bias" section at https://utterlyvoice.com/help/bias. You can look at the "basic" mode for an example, where certain phrases have a negative bias (0.5). Your mode would look similar to that, but with positive biases (1.5 is usually sufficient).

--
You received this message because you are subscribed to the Google Groups "utterlyvoiceusers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to utterlyvoiceus...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/utterlyvoiceusers/24b84636-9c33-4de8-880e-0bfff4788b64n%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages