论文信息 - Human Learning in Atari - 字舞流文

Human Learning in Atari

Atari games are an excellent testbed for studying intelligent behavior, as they offer a range of tasks that differ widely in their visual representation, game dynamics, and goals presented to an agent. The last two years have seen a spate of research into artificial agents that use a single algorithm to learn to play these games. The best of these artificial agents perform at better-than-human levels on most games, but require hundreds of hours of game-play experience to produce such behavior. Humans, on the other hand, can learn to perform well on these tasks in a matter of minutes. In this paper we present data on human learning trajectories for several Atari games, and test several hypotheses about the mechanisms that lead to such rapid learning.

Joshua B. Tenenbaum | Samuel Gershman | Pedro Tsividis | Thomas Pouncy | Jaqueline L. Xu | J. Tenenbaum | S. Gershman | Pedro Tsividis | Thomas Pouncy | J. L. Xu

[1] L. Rips. Inductive judgments about natural categories. , 1975 .

[2] D. Medin,et al. The role of theories in conceptual coherence. , 1985, Psychological review.

[3] S. Carey. Conceptual Change in Childhood , 1985 .

[4] Linda B. Smith,et al. The importance of shape in early lexical learning , 1988 .

[5] Ellen M. Markman,et al. Categorization and Naming in Children: Problems of Induction , 1989 .

[6] Elizabeth S. Spelke,et al. Principles of Object Perception , 1990, Cogn. Sci..

[7] T. B. Ward. Structured Imagination: the Role of Category Structure in Exemplar Generation , 1994, Cognitive Psychology.

[8] B. Ross,et al. Predictions From Uncertain Categorizations , 1994, Cognitive Psychology.

[9] S. Carey,et al. Whose gaze will infants follow? The elicitation of gaze-following in 12-month-olds , 1998 .

[10] P. Bloom. How children learn the meanings of words , 2000 .

[11] Patrice D. Tremoulet,et al. Perception of Animacy from the Motion of a Single Object , 2000, Perception.

[12] György Gergely,et al. One-year-old infants use teleological representations of actions productively , 2003, Cogn. Sci..

[13] R. Baillargeon. Infants' Physical World , 2004 .

[14] A. Schlottmann,et al. Perceived physical and social causality in animated motions: spontaneous reports and ratings. , 2006, Acta psychologica.

[15] Katherine D. Kinzler,et al. Core knowledge. , 2007, Developmental science.

[16] J. Tenenbaum,et al. Word learning as Bayesian inference. , 2007, Psychological review.

[17] G. Csibra. Goal attribution to inanimate agents by 6.5-month-old infants , 2008, Cognition.

[18] R. Baillargeon,et al. An Account of Infants' Physical Reasoning , 2008 .

[19] T. Lombrozo. Explanation and categorization: How “why?” informs “what?” , 2009, Cognition.

[20] Joseph Jay Williams,et al. The role of explanation in discovery and generalization: evidence from category learning , 2010, ICLS.

[21] Charles Kemp,et al. A probabilistic account of exemplar and category generation , 2013, Cognitive Psychology.

[22] Noah D. Goodman,et al. The mentalistic basis of core social cognition: experiments in preverbal infants and a computational model. , 2013, Developmental science.

[23] Honglak Lee,et al. Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning , 2014, NIPS.

[24] Susan J. Hespos,et al. Divisions of the physical world: Concepts of objects and substances. , 2015, Psychological bulletin.

[25] Sergey Levine,et al. Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models , 2015, ArXiv.

[26] Joshua B. Tenenbaum,et al. Human-level concept learning through probabilistic program induction , 2015, Science.

[27] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[28] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[29] Joshua B. Tenenbaum,et al. Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.

[30] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.