暂无分享,去创建一个
Joel Z. Leibo | Demis Hassabis | Koray Kavukcuoglu | David Amos | David Silver | Piotr W. Mirowski | David Saxton | Arun Ahuja | Greg Wayne | Chia-Chun Hung | Shakir Mohamed | Matthew Botvinick | Tim Harley | Mehdi Mirza | Danilo Jimenez Rezende | Timothy P. Lillicrap | Adam Santoro | Josh Abramson | Jack W. Rae | Adam Cain | Malcolm Reynolds | Agnieszka Grabska-Barwinska | Mevlana Gemici | Chloe Hillier | T. Lillicrap | K. Kavukcuoglu | D. Hassabis | D. Silver | Greg Wayne | D. Saxton | Arun Ahuja | Adam Santoro | M. Botvinick | M. Mirza | Tim Harley | S. Mohamed | Chia-Chun Hung | Josh Abramson | David Amos | Agnieszka Grabska-Barwinska | P. Mirowski | Mevlana Gemici | Malcolm Reynolds | Adam Cain | Chloe Hillier | Mehdi Mirza | David Silver | Piotr Wojciech Mirowski | A. Grabska-Barwinska
[1] Michael I. Jordan,et al. Advances in Neural Information Processing Systems 30 , 1995 .
[2] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[3] P. Dudchenko. The hippocampus as a cognitive map , 2010 .
[4] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[5] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[6] Jason Weston,et al. Memory Networks , 2014, ICLR.
[7] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[8] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[9] Honglak Lee,et al. Control of Memory, Active Perception, and Action in Minecraft , 2016, ICML.
[10] Sergey Levine,et al. High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.
[11] Francesco Visin,et al. A guide to convolution arithmetic for deep learning , 2016, ArXiv.
[12] Demis Hassabis,et al. Grounded Language Learning in a Simulated 3D World , 2017, ArXiv.
[13] Zeb Kurth-Nelson,et al. Learning to reinforcement learn , 2016, CogSci.
[14] Marcin Andrychowicz,et al. One-Shot Imitation Learning , 2017, NIPS.
[15] David Amos,et al. Generative Temporal Models with Memory , 2017, ArXiv.
[16] Razvan Pascanu,et al. Learning to Navigate in Complex Environments , 2016, ICLR.
[17] Tom Schaul,et al. Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.