暂无分享,去创建一个
Daan Wierstra | Shakir Mohamed | Silvia Chiappa | Sébastien Racanière | Daan Wierstra | S. Mohamed | S. Chiappa | Sébastien Racanière | S. Racanière
[1] Ronald J. Williams,et al. Gradient-based learning algorithms for recurrent networks and their computational complexity , 1995 .
[2] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[3] Richard S. Sutton,et al. Predictive Representations of State , 2001, NIPS.
[4] A. Noë,et al. A sensorimotor account of vision and visual consciousness. , 2001, The Behavioral and brain sciences.
[5] Peter Auer,et al. Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.
[6] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[7] Christos Dimitrakakis,et al. TORCS, The Open Racing Car Simulator , 2005 .
[8] Peter Dayan,et al. Hippocampal Contributions to Control: The Third Way , 2007, NIPS.
[9] Pierre-Yves Oudeyer,et al. Intrinsic Motivation Systems for Autonomous Mental Development , 2007, IEEE Transactions on Evolutionary Computation.
[10] Y. Niv. Reinforcement learning in the brain , 2009 .
[11] Alex Graves,et al. Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.
[12] Erik Talvitie,et al. Model Regularization for Stable Sample Rollouts , 2014, UAI.
[13] Wojciech Zaremba,et al. Recurrent Neural Network Regularization , 2014, ArXiv.
[14] Thomas B. Schön,et al. From Pixels to Torques: Policy Learning with Deep Dynamical Models , 2015, ICML 2015.
[15] Nitish Srivastava,et al. Unsupervised Learning of Video Representations using LSTMs , 2015, ICML.
[16] Tianqi Chen,et al. Empirical Evaluation of Rectified Activations in Convolutional Network , 2015, ArXiv.
[17] Viorica Patraucean,et al. Spatio-temporal video autoencoder with differentiable memory , 2015, ArXiv.
[18] Martin A. Riedmiller,et al. Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images , 2015, NIPS.
[19] Samy Bengio,et al. Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks , 2015, NIPS.
[20] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[21] Honglak Lee,et al. Action-Conditional Video Prediction using Deep Networks in Atari Games , 2015, NIPS.
[22] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents (Extended Abstract) , 2012, IJCAI.
[23] Byron Boots,et al. Learning to Filter with Predictive State Inference Machines , 2015, ICML.
[24] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.