Machine Theory of Mind

Theory of mind (ToM; Premack & Woodruff, 1978) broadly refers to humans' ability to represent the mental states of others, including their desires, beliefs, and intentions. We propose to train a machine to build such models too. We design a Theory of Mind neural network -- a ToMnet -- which uses meta-learning to build models of the agents it encounters, from observations of their behaviour alone. Through this process, it acquires a strong prior model for agents' behaviour, as well as the ability to bootstrap to richer predictions about agents' characteristics and mental states using only a small number of behavioural observations. We apply the ToMnet to agents behaving in simple gridworld environments, showing that it learns to model random, algorithmic, and deep reinforcement learning agents from varied populations, and that it passes classic ToM tasks such as the "Sally-Anne" test (Wimmer & Perner, 1983; Baron-Cohen et al., 1985) of recognising that others can hold false beliefs about the world. We argue that this system -- which autonomously learns how to model other agents in its world -- is an important step forward for developing multi-agent AI systems, for building intermediating technology for machine-human interaction, and for advancing the progress on interpretable AI.

[1]  Sarit Kraus,et al.  Making friends on the fly: Cooperating with new teammates , 2017, Artif. Intell..

[2]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[3]  Ricardo Vilalta,et al.  A Perspective View and Survey of Meta-Learning , 2002, Artificial Intelligence Review.

[4]  Anca D. Dragan,et al.  Should Robots be Obedient? , 2017, IJCAI.

[5]  M. Nowak Five Rules for the Evolution of Cooperation , 2006, Science.

[6]  Daan Wierstra,et al.  Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[7]  Noah D. Goodman,et al.  Learning the Preferences of Ignorant, Inconsistent Agents , 2015, AAAI.

[8]  Joshua B. Tenenbaum,et al.  Bayesian Theory of Mind: Modeling Joint Belief-Desire Attribution , 2011, CogSci.

[9]  Joanna M. Dally,et al.  Social cognition by food-caching corvids. The western scrub-jay as a natural psychologist , 2007, Philosophical Transactions of the Royal Society B: Biological Sciences.

[10]  Z. Nadasdy,et al.  Taking the intentional stance at 12 months of age , 1995, Cognition.

[11]  Joshua B. Tenenbaum,et al.  Ten-month-old infants infer the value of goals from the costs of actions , 2017, Science.

[12]  Frans A. Oliehoek,et al.  A Concise Introduction to Decentralized POMDPs , 2016, SpringerBriefs in Intelligent Systems.

[13]  Anca D. Dragan,et al.  Pragmatic-Pedagogic Value Alignment , 2017, ISRR.

[14]  Alexander A. Alemi,et al.  Deep Variational Information Bottleneck , 2017, ICLR.

[15]  Rob Fergus,et al.  Modeling Others using Oneself in Multi-Agent Reinforcement Learning , 2018, ICML.

[16]  Anind K. Dey,et al.  Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.

[17]  Demis Hassabis,et al.  Imagine all the people: how the brain creates and uses personality models to predict behavior. , 2014, Cerebral cortex.

[18]  A. Woodward Infants selectively encode the goal object of an actor's reach , 1998, Cognition.

[19]  S. Carey The Origin of Concepts , 2000 .

[20]  Joshua B. Tenenbaum,et al.  Help or Hinder: Bayesian Models of Social Goal Inference , 2009, NIPS.

[21]  Raymond J. Dolan,et al.  Game Theory of Mind , 2008, PLoS Comput. Biol..

[22]  Christopher G. Lucas,et al.  The Child as Econometrician: A Rational Model of Preference Understanding in Children , 2014, PloS one.

[23]  David Mackay,et al.  Probable networks and plausible predictions - a review of practical Bayesian methods for supervised neural networks , 1995 .

[24]  Sergey Levine,et al.  Meta-Learning and Universality: Deep Representations and Gradient Descent can Approximate any Learning Algorithm , 2017, ICLR.

[25]  Jieyu Zhao,et al.  Simple Principles of Metalearning , 1996 .

[26]  Amanda L. Woodward,et al.  Infants track action goals within and across agents , 2007, Cognition.

[27]  A. Gopnik,et al.  Children's understanding of representational change and its relation to the understanding of false belief and the appearance-reality distinction. , 1988, Child development.

[28]  M. Tomasello,et al.  Great apes anticipate that other individuals will act according to false beliefs , 2016, Science.

[29]  Joshua B. Tenenbaum,et al.  The Naïve Utility Calculus: Computational Principles Underlying Commonsense Psychology , 2016, Trends in Cognitive Sciences.

[30]  Nando de Freitas,et al.  Robust Imitation of Diverse Behaviors , 2017, NIPS.

[31]  N. Clayton,et al.  Evidence suggesting that desire-state attribution may govern food sharing in Eurasian jays , 2013, Proceedings of the National Academy of Sciences.

[32]  Ryo Nakahashi,et al.  Modeling Human Understanding of Complex Intentional Action with a Bayesian Nonparametric Subgoal Model , 2015, AAAI.

[33]  M. Tomasello,et al.  Does the chimpanzee have a theory of mind? 30 years later , 2008, Trends in Cognitive Sciences.

[34]  Sepp Hochreiter,et al.  Learning to Learn Using Gradient Descent , 2001, ICANN.

[35]  A. Gopnik,et al.  Why the Child's Theory of Mind Really Is a Theory , 1992 .

[36]  Tom Schaul,et al.  Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.

[37]  Tom Schaul,et al.  Building Machines that Learn and Think for Themselves: Commentary on Lake et al., Behavioral and Brain Sciences, 2017 , 2017, 1711.08378.

[38]  R. Baillargeon,et al.  Psychological Reasoning in Infancy. , 2016, Annual review of psychology.

[39]  Peter Dayan,et al.  Monte Carlo Planning Method Estimates Planning Horizons during Interactive Social Exchange , 2015, PLoS Comput. Biol..

[40]  Patrice D. Tremoulet,et al.  Perceptual causality and animacy , 2000, Trends in Cognitive Sciences.

[41]  Chris L. Baker,et al.  Rational quantitative attribution of beliefs, desires and percepts in human mentalizing , 2017, Nature Human Behaviour.

[42]  S. Gächter Behavioral Game Theory , 2008, Encyclopedia of Evolutionary Psychological Science.

[43]  S. Baron-Cohen,et al.  Does the autistic child have a “theory of mind” ? , 1985, Cognition.

[44]  Siddhartha S. Srinivasa,et al.  Legibility and predictability of robot motion , 2013, 2013 8th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[45]  Jan Peters,et al.  Relative Entropy Inverse Reinforcement Learning , 2011, AISTATS.

[46]  A. Goldman,et al.  Mirror neurons and the simulation theory of mind-reading , 1998, Trends in Cognitive Sciences.

[47]  Stefano Ermon,et al.  Generative Adversarial Imitation Learning , 2016, NIPS.

[48]  Colin Camerer,et al.  A Cognitive Hierarchy Model of Games , 2004 .

[49]  Daniel C. Dennett,et al.  Two Contrasts: Folk Craft versus Folk Science and Belief versus Opinion , 1991 .

[50]  David Silver,et al.  A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning , 2017, NIPS.

[51]  H. Wimmer,et al.  Beliefs about beliefs: Representation and constraining function of wrong beliefs in young children's understanding of deception , 1983, Cognition.

[52]  Anca D. Dragan,et al.  Cooperative Inverse Reinforcement Learning , 2016, NIPS.

[53]  R. Gordon Folk Psychology as Simulation , 1986 .

[54]  Peter Dayan,et al.  Improving Generalization for Temporal Difference Learning: The Successor Representation , 1993, Neural Computation.

[55]  Stephen J. Roberts,et al.  Learning Against Non-Stationary Agents with Opponent Modelling and Deep Reinforcement Learning , 2018, AAAI Spring Symposia.

[56]  Sebastian Thrun,et al.  Learning to Learn: Introduction and Overview , 1998, Learning to Learn.

[57]  A. Leslie Pretense and representation: The origins of "theory of mind." , 1987 .

[58]  B. Bower A Child's Theory of Mind , 1993 .

[59]  S. Brison The Intentional Stance , 1989 .

[60]  Joshua B. Tenenbaum,et al.  Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.

[61]  G. Csibra,et al.  Action Anticipation Through Attribution of False Belief by 2-Year-Olds , 2007, Psychological science.

[62]  Peter Stone,et al.  Autonomous agents modelling other agents: A comprehensive survey and open problems , 2017, Artif. Intell..

[63]  Joshua B. Tenenbaum,et al.  Coordinate to cooperate or compete: Abstract goals and joint intentions in social interaction , 2016, CogSci.

[64]  Andrew Y. Ng,et al.  Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.

[65]  C. Frith,et al.  The Neural Basis of Mentalizing , 2006, Neuron.

[66]  Pieter Abbeel,et al.  Apprenticeship learning via inverse reinforcement learning , 2004, ICML.

[67]  Marcin Andrychowicz,et al.  One-Shot Imitation Learning , 2017, NIPS.

[68]  Eyal Amir,et al.  Bayesian Inverse Reinforcement Learning , 2007, IJCAI.

[69]  C. Raymond Perrault,et al.  Beyond Question-Answering. , 1981 .

[70]  A. Woodward Infants' ability to distinguish between purposeful and non-purposeful behaviors , 1999 .