There are several working Keras DQN implementations on the
internet, search for "keras dqn". For example this one seems
pretty straightforward: https://keon.io/deep-q-learning/. Although
it supports only simple CartPole environment, it should be easy to
modify it for Atari.
Tambet
I have been trying to replicate the NIPS (2013) results using a Keras implementation. I use the same hyperparameters as in the deep_q_rl NIPS implementation. But so far, the agent doesn't seem to learn at all. The Q values and rewards do not increase in Pong and Breakout, which are the two games I have tried so far.--
After extensive attempts at debugging for over a month, I am now starting to question whether Keras is at the root of the problem, as other parts of the code seems to be in line with deep_q_rl and the NIPS paper. In particular, I suspect that the problem might be with the way Keras implements RMSProp. But I am not sure if this is the actual reason.
Has anyone here tried to implement DQN using Keras? If yes, was the implementation successful at learning?
You received this message because you are subscribed to the Google Groups "Deep Q-Learning" group.
To unsubscribe from this group and stop receiving emails from it, send an email to deep-q-learni...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/deep-q-learning/1a61a88f-1907-4039-9a4b-3b4e15f83532%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
model.compile(optimizer=Adam(lr=1e-6), loss="mse")
MEMORY_SIZE = 750000
experience = collections.deque(maxlen=MEMORY_SIZE)
Hi Neal. Perhaps we can help each other and figure out what's going wrong in our code. I have been at this for more than a month now - still no luck. We can discuss further over Skype if you want.The NIPS architecture has 2 convolutional layers with 16 and 32 filters, followed by a fully connected layer with 256 units, followed by a fully connected layer for the Q values. It uses experience replay but no target networks. Is this your architecture as well?
--
You received this message because you are subscribed to a topic in the Google Groups "Deep Q-Learning" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/deep-q-learning/8-7ECtWIyX0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to deep-q-learni...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/deep-q-learning/eba6eaf3-f7d0-4679-856e-984714d71e9f%40googlegroups.com.