> Im building a subject-dependent system for online decoding
> (gmm-latgen-faster) that allows for swapping out different language models.
> My understanding is, that the trained models for the users (final.mdl) and
> the language models (HCLG.fst) are mostly independent of each other, meaning
> that I can train several user-models and language-models:
>
> final_user1.mdl
> final_user2.mdl
> ...
>
> HCLG_1.fst + words_1.txt
> HCLG_2.fst + words_2.txt
> ...
>
> and use any triplet of them (final_user*.mdl, HCLG_*.fst, words_*.txt) for
> online decoding. Is this correct?
That seems reasonable.
> My approach to building the language-models and user-models is the following
> (please correct me if I got something wrong):
>
> - use prepare_lang.sh
> - run the training script several times with training data of different
> users to get the user-models (final_user1.mdl, final_user2.mdl, etc.) and
> (tree)
> (the tree should be the same for each pass, right?)
You won't be able to use the same graph if the tree is not the same.
It's unusual to train models per user; this likely won't work the best
unless you have quite a lot of data per user (e.g. more than 10
hours). I would just train the normal way using adaptation, via
train_sat.sh.
If you have a reasonable amount of data per user (e.g. at least an
hour or two) it might be worthwhile to try MAP adaptation of the
speaker-independent model to the individual speaker characteristics
using train_map.sh though.
> - then for each new language model I change words_*.txt and run
> prepare_lang.sh again. I know I need to make sure to use the same phoneset
> (pass "--phone-symbol-table phones.txt"). Is there anything else to watch
> out for?
What you describe should work.
> - then I run makegraph.sh using the new L_disambig.fst, G.fst,
disambig.int
> and ANY of the (tree) and (final_user*.mdl) files to build HCLG_*.fst
> - Is it a problem, if the disambiguation symbols created by prepare_lang.sh
> (
disambig.int, L_disambig.fst) differ from the ones used for training the
> user-models?
That doesn't matter.
Dan
>
> --
> Go to
http://kaldi-asr.org/forums.html find out how to join
> ---
> You received this message because you are subscribed to the Google Groups
> "kaldi-help" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to
kaldi-help+...@googlegroups.com.
> To post to this group, send email to
kaldi...@googlegroups.com.
> To view this discussion on the web visit
>
https://groups.google.com/d/msgid/kaldi-help/4ebfd8ba-2a0c-4f4d-9a1b-a2c7a60021f1%40googlegroups.com.
> For more options, visit
https://groups.google.com/d/optout.