I am using VOSK and need to create a new small model.

Sean McGhee

unread,

Oct 18, 2022, 11:03:46 AM10/18/22

to kaldi-help

Hi, I am using VOSK to provide verbal commanding support for some applications. I need to limit the model to a specific set of words and no more to reduce ambiguity - about 1500 words.

I see that the models VOSK uses are based on Kaldi models and I have Kaldi installed on an Ubuntu VM (I do not have access to a larger server). What is the simplest way I can go about creating a model from this corpus which will be compatible with VOSK?

Thanks!

Sean

hart...@gmail.com

unread,

Oct 19, 2022, 4:10:32 AM10/19/22

to kaldi-help

VOSK provides a so-called "update package" for some languages. With the update package, you can change the vocabulary easily. see https://alphacephei.com/vosk/lm

Sage Khan (Sage Khan)

unread,

Oct 19, 2022, 6:22:27 AM10/19/22

to kaldi-help

Train the model with Kaldi. https://alphacephei.com/vosk/lm gives the list directory structure required for implementing VOSK based ASR. The model file vosk looks for are already in KALDI trained model. Just need to copy paste them to work.

hart...@gmail.com

unread,

Oct 19, 2022, 8:16:09 AM10/19/22

to kaldi-help

Yes, sorry. My link was too general: the sections "Update process" and "Outputs" are relevant for the update packages. https://alphacephei.com/vosk/lm#update-process

Sean McGhee

unread,

Oct 19, 2022, 9:51:34 AM10/19/22

to kaldi-help

Thanks, all, for the info. I will see what I can do :-)

Reply all

Reply to author

Forward