Read this PDF for an example (see section LSTM Initialization): https://arxiv.org/pdf/1707.00722.pdf
On Saturday, February 17, 2018 at 8:43:43 AM UTC+1, mirfan.ms...@seecs.edu.pk wrote:I want to know which weights initialization method is used is it randomization or something else?
--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+unsubscribe@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/65791622-bf56-4f85-81a3-a7d5b30cfad9%40googlegroups.com.
# ng-affine-options='' [Additional options used for the full matrices in the LSTM, can be used to do things like set biases to initialize to 1]
# ng-per-element-scale-options='' [Additional options used for the diagonal matrices in the LSTM ]
def set_default_configs(self):
self.config = {'input':'[-1]',
'cell-dim' : -1, # this is a compulsory argument
'clipping-threshold' : 30.0,
'delay' : -1,
'ng-per-element-scale-options' : ' max-change=0.75',
'ng-affine-options' : ' max-change=0.75 ',
'self-repair-scale-nonlinearity' : 0.00001,
'zeroing-interval' : 20,
'zeroing-threshold' : 15.0,
'decay-time': -1.0
}
I'm confused here. I think normal distribution is being used for weight and 1 for bias vectors. Can you tell which method is used?
We pretty just use the standard glorot initialization (stddev of each weight matrix's parameters = 1/sqrt(input dimension)). The output layer of the network has zero initialization though.The rules for the bias initialization and peephole parameter initialization, I don't recall; you'd have to check the xconfig scripts (xconfig/lstm.py) or the generated configs files.Dan
On Sat, Feb 17, 2018 at 3:20 PM, Zoltán Somogyi <zsomo...@gmail.com> wrote:
Read this PDF for an example (see section LSTM Initialization): https://arxiv.org/pdf/1707.00722.pdf
On Saturday, February 17, 2018 at 8:43:43 AM UTC+1, mirfan.ms...@seecs.edu.pk wrote:I want to know which weights initialization method is used is it randomization or something else?
--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+unsubscribe@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/07a9661b-8917-429f-8c29-4cae43f27155%40googlegroups.com.