暂无分享,去创建一个
Dileep George | Tom Silver | Miguel Lázaro-Gredilla | Nimrod Dorfman | Mohamed Eldawy | Szymon Sidor | D. Scott Phoenix | David A. Mély | Xinghua Lou | Ken Kansky | Szymon Sidor | D. George | D. Phoenix | M. Lázaro-Gredilla | Ken Kansky | Tom Silver | Mohamed Eldawy | Xinghua Lou | N. Dorfman
[1] A. Huxley. Themes And Variations , 1950 .
[2] John R. Anderson. Cognitive Psychology and Its Implications , 1980 .
[3] Gary L. Drescher,et al. Made-up minds - a constructivist approach to artificial intelligence , 1991 .
[4] W. Weiten,et al. Psychology: Themes and Variations , 1991 .
[5] Carlos Guestrin,et al. Generalizing plans to new environments in relational MDPs , 2003, IJCAI 2003.
[6] Shobha Venkataraman,et al. Efficient Solution Algorithms for Factored MDPs , 2003, J. Artif. Intell. Res..
[7] Hagai Attias,et al. Planning by Probabilistic Inference , 2003, AISTATS.
[8] Andre Cohen,et al. An object-oriented representation for efficient reinforcement learning , 2008, ICML '08.
[9] L. Williams,et al. Contents , 2020, Ophthalmology (Rochester, Minn.).
[10] Peter Stone,et al. Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..
[11] Nir Friedman,et al. Probabilistic Graphical Models - Principles and Techniques , 2009 .
[12] Tommi S. Jaakkola,et al. Learning Bayesian Network Structure using LP Relaxations , 2010, AISTATS.
[13] Qiang Liu,et al. Variational Planning for Graph-based MDPs , 2013, NIPS.
[14] David Wingate,et al. A Physics-Based Model Prior for Object-Oriented MDPs , 2014, ICML.
[15] Martin A. Riedmiller,et al. Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images , 2015, NIPS.
[16] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[17] Murray Shanahan,et al. Towards Deep Symbolic Reinforcement Learning , 2016, ArXiv.
[18] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[19] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[20] Pieter Abbeel,et al. Value Iteration Networks , 2016, NIPS.
[21] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[22] Razvan Pascanu,et al. Interaction Networks for Learning about Objects, Relations and Physics , 2016, NIPS.
[23] Nicolas Usunier,et al. Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks , 2016, ArXiv.
[24] Tom Schaul,et al. The Predictron: End-To-End Learning and Planning , 2016, ICML.
[25] Joshua B. Tenenbaum,et al. A Compositional Object-Based Approach to Learning Physical Dynamics , 2016, ICLR.
[26] Tom Schaul,et al. Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.