Even if "<SPOKEN_NOISE>" doesn't appear in the transcripts, words in
the transcripts that are out of the vocabulary may get mapped to it
automatically if lang/oov.txt is set to "<SPOKEN_NOISE>".
Regarding "<s>" and "</s>", they are the beginning-of-sentence and
end-of-sentence symbols. They should not appear in the lexicon; they
appear in the language model but they get removed by the time you
compose with the lexicon.
Dan
> --
> You received this message because you are subscribed to the Google Groups
> "kaldi-help" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to
kaldi-help+...@googlegroups.com.
> For more options, visit
https://groups.google.com/d/optout.