论文信息 - Building Machines that Learn and Think for Themselves: Commentary on Lake et al., Behavioral and Brain Sciences, 2017 - 字舞流文

Building Machines that Learn and Think for Themselves: Commentary on Lake et al., Behavioral and Brain Sciences, 2017

Abstract We agree with Lake and colleagues on their list of “key ingredients” for building human-like intelligence, including the idea that model-based reasoning is essential. However, we favor an approach that centers on one additional ingredient: autonomy. In particular, we aim toward agents that can both build and exploit their own internal models, with minimal human hand engineering. We believe an approach centered on autonomous learning has the greatest chance of success as we scale toward real-world complexity, tackling domains for which ready-made formal models are not available. Here, we survey several important examples of the progress that has been made toward building autonomous agents with human-like abilities, and highlight some outstanding challenges.

Tom Schaul | Nando de Freitas | Shane Legg | Christopher Summerfield | Joel Z. Leibo | Demis Hassabis | Daan Wierstra | T. Weber | Dharshan Kumaran | Greg Wayne | Matthew Botvinick | Danilo Jimenez Rezende | Joseph Modayil | Adam Santoro | Peter W. Battaglia | David G. T. Barrett | S. Mohamed | Neil C. Rabinowitz | Tim Lillicrap | T. Schaul | T. Lillicrap | D. Hassabis | Greg Wayne | S. Legg | D. Kumaran | Daan Wierstra | Joseph Modayil | T. Weber | N. D. Freitas | P. Battaglia | Adam Santoro | M. Botvinick | D. Barrett | C. Summerfield | S. Mohamed

[1] Sepp Hochreiter,et al. Learning to Learn Using Gradient Descent , 2001, ICANN.

[2] Kurt Hornik,et al. Artificial Neural Networks — ICANN 2001 , 2001, Lecture Notes in Computer Science.

[3] Simo Särkkä,et al. Advances in Neural Information Processing Systems 25 (NIPS 2012) , 2002 .

[4] D. Kahneman. Thinking, Fast and Slow , 2011 .

[5] Jonathan D. Cohen,et al. The Computational and Neural Basis of Cognitive Control: Charted Territory and New Frontiers , 2014, Cogn. Sci..

[6] Marc'Aurelio Ranzato,et al. Video (language) modeling: a baseline for generative models of natural videos , 2014, ArXiv.

[7] Joshua B. Tenenbaum,et al. Human-level concept learning through probabilistic program induction , 2015, Science.

[8] Alec Solway,et al. Reinforcement learning, efficient coding, and the statistics of natural tasks , 2015, Current Opinion in Behavioral Sciences.

[9] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[10] Daan Wierstra,et al. One-Shot Generalization in Deep Generative Models , 2016, ICML.

[11] Geoffrey E. Hinton,et al. Attend, Infer, Repeat: Fast Scene Understanding with Generative Models , 2016, NIPS.

[12] Koray Kavukcuoglu,et al. Pixel Recurrent Neural Networks , 2016, ICML.

[13] Tom Schaul,et al. Unifying Count-Based Exploration and Intrinsic Motivation , 2016, NIPS.

[14] Sergio Gomez Colmenarejo,et al. Hybrid computing using a neural network with dynamic external memory , 2016, Nature.

[15] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[16] Peter L. Bartlett,et al. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning , 2016, ArXiv.

[17] Razvan Pascanu,et al. Interaction Networks for Learning about Objects, Relations and Physics , 2016, NIPS.

[18] Nando de Freitas,et al. Neural Programmer-Interpreters , 2015, ICLR.

[19] Joel Z. Leibo,et al. Model-Free Episodic Control , 2016, ArXiv.

[20] Daan Wierstra,et al. One-shot Learning with Memory-Augmented Neural Networks , 2016, ArXiv.

[21] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.

[22] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.

[23] Razvan Pascanu,et al. Metacontrol for Adaptive Imagination-Based Optimization , 2017, ICLR.

[24] Razvan Pascanu,et al. Discovering objects and their relations from entangled scene representations , 2017, ICLR.

[25] Zeb Kurth-Nelson,et al. Learning to reinforcement learn , 2016, CogSci.

[26] Hugo Larochelle,et al. Optimization as a Model for Few-Shot Learning , 2016, ICLR.

[27] Tom Schaul,et al. The Predictron: End-To-End Learning and Planning , 2016, ICML.

[28] Neil D. Lawrence,et al. The Emergence of Organizing Structure in Conceptual Representation , 2018, Cognitive science.