Setting bias to forget gates in LSTM networks and data contained in blobs

97 views
Skip to first unread message

Marshall Worth

unread,
May 3, 2017, 4:39:07 PM5/3/17
to Caffe Users
All, I have been looking over the following example

http://christopher5106.github.io/deep/learning/2016/06/07/recurrent-neural-net-with-Caffe.html

and I am trying to figure out this line

Set the bias to the forget gate to 5.0 as explained in the clockwork RNN paper
solver.net.params['lstm1'][2].data[15:30]=5

Is there any documentation that shows what is contained in that location, why did he choose [15:30]? How did he know to that the bias to the forget gates was there? Is the data stacked with all of the weight, bias, etc information? What order is it in?

What are in these locations
solver.net.params['lstm1'][0]
solver.net.params['lstm1'][1]
solver.net.params['lstm1'][2]

I ran across an include file that stated it could be the input and output blobs of 'lstm1'?

Thanks for your help, if you know of any relevant locations that explain some more detailed documentation of the LSTM network I would greatly appreciate it.

Rsaeed

unread,
Apr 4, 2018, 4:28:22 AM4/4/18
to Caffe Users
Hi,

I was following the same tutorial but unable to make it work. Please let me know if it was working for you

Thanks
Reply all
Reply to author
Forward
0 new messages