Reinforcement learning & game playing Mnih, V., Kavukcuoglu, K., Silver, D., et al. Playing Atari with deep reinforcement learning. NIPS Deep Learning Workshop, 2013. Mnih, V., Kavukcuoglu, K., Silver, D., et al. Human-level control through deep reinforcement learning. Nature 518, 529–533, 2015. demo http://cs.stanford.edu/people/karpathy/convnetjs/demo/rldemo.html Guo, X., Singh, S., Lee, H., et al. Deep learning for real-time Atari game play using offline Monte-Carlo tree search planning. NIPS 2014. Chris J. Maddison, Aja Huang, Ilya Sutskever, David Silver. Move Evaluation in Go Using Deep Convolutional Neural Networks. arXiv:1412.6564, 2014. Bakker, B. Reinforcement learning with Long Short-Term Memory. NIPS 2002. Koutnik J., Cuccu G., Schmidhuber J., Gomez F. Evolving Large-Scale Neural Networks for Vision-Based Reinforcement Learning. In Proc. Genetic and Evolutionary Computation Conference (GECCO), 2013.
RNN / LSTM & sequence learning Hochreiter, S. and Schmidhuber, J. Long Short-Term Memory. Neural Computation, 9(8):1735–1780. 1997. Gers, A. and Schmidhuber, J. LSTM Recurrent Networks Learn Simple Context Free and Context Sensitive Languages. IEEE Transactions on Neural Networks 12(6):1333-1340, 2001. Sutskever, Ilya, Vinyals, Oriol, and Le, Quoc. Sequence to sequence learning with neural networks. NIPS 2014. Vinyals, Oriol, Toshev, Alexander, Bengio, Samy, and Erhan, Dumitru. Show and tell: A neural image caption generator. arXiv:1411.4555, November 2014.
Attention Mnih, V., Heess, N., Graves, A., et al. Recurrent models of visual attention. NIPS 2014. Ba, Jimmy, Mnih, Volodymyr, and Kavukcuoglu, Koray. Multiple object recognition with visual attention. arXiv preprint arXiv:1412.7755, 2014. Bahdanau, Dzmitry, Cho, Kyunghyun, and Bengio, Yoshua. Neural machine translation by jointly learning to align and translate. arXiv:1409.0473, September 2014. Gregor, K., Danihelka, I., Graves, A., Wierstra, D. DRAW: A Recurrent Neural Network For Image Generation arXiv:1502.04623, 2015.
Program learning, long-term memory and metalearning (learning to learn) Weston, Jason, Chopra, Sumit, and Bordes, Antoine. Memory networks. arXiv:1410.3916, 2014. Graves, Alex, Wayne, Greg, and Danihelka, Ivo. Neural turing machines. arXiv:1410.5401, 2014. Schmidhuber, J. An introspective network that can learn to run its own weight change algorithm. ICANN 1993. Hochreiter, S., Younger, A. S., Conwell, P. R. Learning to Learn Using Gradient Descent. ICANN 2001