W H Warren,et al. The Way the Ball Bounces: Visual and Auditory Perception of Elasticity and Control of the Bounce Pass , 1987, Perception.
 Manfred Morari,et al. Model predictive control: Theory and practice - A survey , 1989, Autom..
 Richard S. Sutton,et al. Dyna, an integrated architecture for learning, planning, and reacting , 1990, SGAR.
 P. Frensch,et al. Complex problem solving : the European perspective , 1995 .
 Jürgen Schmidhuber,et al. Reinforcement Learning with Self-Modifying Policies , 1998, Learning to Learn.
 Ronald J. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning , 2004, Machine Learning.
 Lydia M. Hopper,et al. Observational learning of tool use in children: Investigating cultural spread through diffusion chains and learning mechanisms through ghost displays. , 2010, Journal of experimental child psychology.
 Carl E. Rasmussen,et al. PILCO: A Model-Based and Data-Efficient Approach to Policy Search , 2011, ICML.
 James N. MacGregor,et al. Human Performance on Insight Problem Solving: A Review , 2011, J. Probl. Solving.
 Jessica B. Hamrick,et al. Simulation as an engine of physical scene understanding , 2013, Proceedings of the National Academy of Sciences.
 Vikash K. Mansinghka,et al. Reconciling intuitive physics and Newtonian mechanics for colliding objects. , 2013, Psychological review.
 A. Markman,et al. Retrospective revaluation in sequential decision making: a tale of two systems. , 2014, Journal of experimental psychology. General.
 Joshua B. Tenenbaum,et al. How, whether, why: Causal judgments as counterfactual contrasts , 2015, CogSci.
 Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
 Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents (Extended Abstract) , 2012, IJCAI.
 Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
 Jae Hee Lee,et al. Hole in One: Using Qualitative Reasoning for Solving Hard Physical Puzzle Problems , 2016, ECAI.
 Alejandra Pascual-Garrido,et al. Wild capuchin monkeys adjust stone tools according to changing nut properties , 2016, Scientific reports.
 Sergey Levine,et al. One-shot learning of manipulation skills with online dynamics adaptation and neural network priors , 2015, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
 Sergey Levine,et al. Model-based reinforcement learning with parametrized physical models and optimism-driven exploration , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).
 Razvan Pascanu,et al. Interaction Networks for Learning about Objects, Relations and Physics , 2016, NIPS.
 François Osiurak,et al. Tool use and affordance: Manipulation-based versus reasoning-based approaches. , 2016, Psychological review.
 Anna Coenen,et al. psiTurk: An open-source framework for conducting replicable behavioral experiments online , 2016, Behavior research methods.
 Razvan Pascanu,et al. Imagination-Augmented Agents for Deep Reinforcement Learning , 2017, NIPS.
 Samuel Gershman,et al. Imaginative Reinforcement Learning: Computational Principles and Neural Mechanisms , 2017, Journal of Cognitive Neuroscience.
 Joshua B. Tenenbaum,et al. A Compositional Object-Based Approach to Learning Physical Dynamics , 2016, ICLR.
 Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
 Marc Toussaint,et al. Differentiable Physics and Stable Modes for Tool-Use and Manipulation Planning , 2018, Robotics: Science and Systems.
 Joshua B. Tenenbaum,et al. Learning to act by integrating mental simulations and physical experiments , 2018, CogSci.
 Jiajun Wu,et al. Neurocomputational Modeling of Human Physical Scene Understanding , 2018 .
 Jessica B. Hamrick,et al. Relational inductive bias for physical construction in humans and machines , 2018, CogSci.
 Joel Z. Leibo,et al. Prefrontal cortex as a meta-reinforcement learning system , 2018, Nature Neuroscience.
 Erik Talvitie,et al. The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces , 2018, ArXiv.
 Silvio Savarese,et al. Learning Task-Oriented Grasping for Tool Manipulation from Simulated Self-Supervision , 2018, Robotics: Science and Systems.
 J. Randall Flanagan,et al. Multiple motor memories are learned to control different points on a tool , 2018, Nature Human Behaviour.
 John Schulman,et al. Gotta Learn Fast: A New Benchmark for Generalization in RL , 2018, ArXiv.
 Jim Fleming,et al. Reasoning and Generalization in RL: A Tool Use Perspective , 2019, ArXiv.
 Tania Lombrozo,et al. “Learning by Thinking” in Science and in Everyday Life , 2019 .
 Patrick van der Smagt,et al. Switching Linear Dynamics for Variational Bayes Filtering , 2019, ICML.
 Sergey Levine,et al. Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables , 2019, ICML.
 Sergey Levine,et al. Reasoning About Physical Interactions with Object-Oriented Prediction and Planning , 2018, ICLR.
 Alexei A. Efros,et al. Time-Agnostic Prediction: Predicting Predictable Video Frames , 2018, ICLR.
 Joshua B. Tenenbaum,et al. The Tools Challenge: Rapid Trial-and-Error Learning in Physical Problem Solving , 2019, CogSci.
 Sergey Levine,et al. Improvisation through Physical Understanding: Using Novel Objects as Tools with Visual Foresight , 2019, Robotics: Science and Systems.
 Taehoon Kim,et al. Quantifying Generalization in Reinforcement Learning , 2019, ICML.
 Keyframing the Future: Keyframe Discovery for Visual Prediction and Planning , 2019, L4DC.
 Oliver Kroemer,et al. A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms , 2019, ArXiv.