Vosk Speech to text

13 views
Skip to first unread message

Akash Sharma

unread,
May 22, 2024, 1:21:00 AMMay 22
to kaldi-help
Hi, I am working on a speech to text project for converting Hindi Audio calls to transcripts. I am using  vosk-model-hi-0.22’. The model gives good output but due to different accents and background noise, the model apparently hallucinates and gives out random (sometimes rhyming) words. I have used audio pre-processing as well, but it doesn't change the output enough. Need suggestions on training and fine tuning.
Reply all
Reply to author
Forward
0 new messages