--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/63eb618f-87f6-4798-93d7-d06b2506dbbe%40googlegroups.com.
I just created a PR herehttps://github.com/kaldi-asr/kaldi/pull/3658where I added some options to tdnnf-layer (not tested).Can you try addingcontext=left-onlyorcontext=shift-leftto some of the TDNN-F layers? (Presumably the ones with stride 3.)and let us know how the results change?
On Tue, Oct 15, 2019 at 7:16 AM David van Leeuwen <david.v...@gmail.com> wrote:
Hello,--In moving from the very old nnet2 multisplice training scripts to the more recent nnet3 tdnn_1{c,d} scripts, performance on, e.g., Librispeech has improved a lot–-and that is great!However, I've noticed that the right context has also gone up from 40 ms to 400 ms over the same progression of network configurations. I've tried to study the configurations of the networks, and I think the network specification syntax has also progressed over the years.I seems that in librispeech `local/chain/tuning/run_tdnn_1d.sh` the context is indicated in `tdnnf-layer` statements with a `time-stride=$n` option, which then is converted to `.linear` layers with time-offsets `-$n,0` and `.affine` layers with time-offsets `0,$n`. I gather these have the combined effect of a symmetric context {-$n,0,$n}. All in all these are resulting in an overall (left, right) context of (40, 40) frames.If I would want to reduce the latency (probably giving in on ASR accuracy) for the chain models and specify asymmetric contexts, as in `local/nnet3/tun_tdnn.sh`, what would be my best approach for using as nnet configuration tool?Cheers,–-david
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi...@googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/5fca6eb0-ee4a-4493-a97e-3dcdb0e7bde6%40googlegroups.com.
Hi,Results with `context=shift-left` for all `time-stride=3` layers are somewhat worse than the symmetric configuration, but the latency is down to a snappy 40ms.Here is a comparison to its symmetric counterpart, a scaled-down (narrower layers: bottleneck 128 wide 1024) version of librispeech tdnn_1d:
--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/050fd5ee-3d73-41e5-bfb9-6fab04a343d3%40googlegroups.com.