Generating sine wave with LSTM: results not satisfactory

55 views

Skip to first unread message

Derk

unread,

Oct 19, 2016, 9:17:06 AM10/19/16

to torch7

I have a project where I want to do sequence generation with RNNs. To start I want to generate a simple sequence with an LSTM network, namely sin(x). The goal is that sampling from the network gives the next sample on the sine wave. This looks to me as a simple task.

I generate the sine wave data with

data = torch.Tensor(1001):range(1,1001):apply(math.rad):apply(math.sin)
input = data[{{1,1000}}]
target = data[{{2,1001}}]

Data batches are filled with sequences of length 50 with random offsets

for k = 1, batch_size do  
   inputs[{{}, {k}}] = input[{{shuffle[k], shuffle[k]+seq_length-1}}]
   targets[{{}, {k}}] = target[{{shuffle[k], shuffle[k]+seq_length-1}}]
end

The network looks like

lstm = nn.Sequencer(
   nn.Sequential()
      :add(nn.FastLSTM(no_param, 100))
      :add(nn.FastLSTM(100, 100))
      :add(nn.Linear(100,no_param))
      :add(nn.Tanh())
   )

criterion = nn.SequencerCriterion(nn.MSECriterion())

Training is done for a number of epochs with the Adam optimizer and it looks like the loss is converging to a low value:

Epoch: 489, error: 6.2851066044103e-05
Epoch: 490, error: 5.619043210411e-05
Epoch: 491, error: 4.9503041066306e-05
Epoch: 492, error: 9.3122510279393e-05
Epoch: 493, error: 5.871072009701e-05
Epoch: 494, error: 5.4363651393891e-05
Epoch: 495, error: 3.6511985437243e-05
Epoch: 496, error: 4.3361181429568e-05
Epoch: 497, error: 4.2539467036372e-05
Epoch: 498, error: 6.5783935830644e-05
Epoch: 499, error: 3.9669659029783e-05
Epoch: 500, error: 2.1134164055749e-05

Now I do the sampling from the network. First the hidden state is given 10 points to warm-up. Then I sample the next 50 points in the sequence.

lstm:forget() -- set hidden state to zero
local x = torch.Tensor(1,1,1):zero()

-- warm-up the lstm
for i=1,warmup do
 x[1][1][1] = data[start+i-1] -- start
 print(x[1][1][1])
 lstm:forward(x)
end

print('output')
output = torch.Tensor(N)
for i=1,N do
 local x_next = lstm:forward(x)
 output[i] = x_next:squeeze()
 x = x_next
end

The sampled results look like this. In red the warm-up point and in green the sampled points from the network.

The result does not look satisfactory, although I think this should be a fairly easy task for an LSTM network. What am I doing wrong? All suggestions are welcome.

Complete code for training: https://gist.github.com/dmus/b7167bdbeee135fa4ea420f074649676

Code for sampling: https://gist.github.com/dmus/a01698ed787f4b1828ed6cfab7a1ffb6

Derk

unread,

Oct 21, 2016, 8:07:17 AM10/21/16

to torch7

Okay, the problem was that the state of the lstm was forgotten after every forward. nn.Sequencer() does this implicitly, so a call to lstm:remember() solves this

Op woensdag 19 oktober 2016 15:17:06 UTC+2 schreef Derk:

Reply all

Reply to author

Forward

0 new messages