I am using VOSK and need to create a new small model.

1,477 views
Skip to first unread message

Sean McGhee

unread,
Oct 18, 2022, 11:03:46 AM10/18/22
to kaldi-help
Hi, I am using VOSK to provide verbal commanding support for some applications. I need to limit the model to a specific set of words and no more to reduce ambiguity - about 1500 words.

I see that the models VOSK uses are based on Kaldi models and I have Kaldi installed on an Ubuntu VM (I do not have access to a larger server). What is the simplest way I can go about creating a model from this corpus which will be compatible with VOSK?

Thanks!
Sean

hart...@gmail.com

unread,
Oct 19, 2022, 4:10:32 AM10/19/22
to kaldi-help
VOSK provides a so-called "update package" for some languages. With the update package, you can change the vocabulary easily. see https://alphacephei.com/vosk/lm

Sage Khan (Sage Khan)

unread,
Oct 19, 2022, 6:22:27 AM10/19/22
to kaldi-help
Train the model with Kaldi.  https://alphacephei.com/vosk/lm gives the list directory structure required for implementing VOSK based ASR. The model file vosk looks for are already in KALDI trained model. Just need to copy paste them to work.

hart...@gmail.com

unread,
Oct 19, 2022, 8:16:09 AM10/19/22
to kaldi-help
Yes, sorry. My link was too general: the sections "Update process" and "Outputs" are relevant for the update packages. https://alphacephei.com/vosk/lm#update-process

Sean McGhee

unread,
Oct 19, 2022, 9:51:34 AM10/19/22
to kaldi-help
Thanks, all, for the info. I will see what I can do :-)
Reply all
Reply to author
Forward
0 new messages