A game theoretic framework for incentive-based models of intrinsic motivation in artificial systems

An emerging body of research is focusing on understanding and building artificial systems that can achieve open-ended development influenced by intrinsic motivations. In particular, research in robotics and machine learning is yielding systems and algorithms with increasing capacity for self-directed learning and autonomy. Traditional software architectures and algorithms are being augmented with intrinsic motivations to drive cumulative acquisition of knowledge and skills. Intrinsic motivations have recently been considered in reinforcement learning, active learning and supervised learning settings among others. This paper considers game theory as a novel setting for intrinsic motivation. A game theoretic framework for intrinsic motivation is formulated by introducing the concept of optimally motivating incentive as a lens through which players perceive a game. Transformations of four well-known mixed-motive games are presented to demonstrate the perceived games when players' optimally motivating incentive falls in three cases corresponding to strong power, affiliation and achievement motivation. We use agent-based simulations to demonstrate that players with different optimally motivating incentive act differently as a result of their altered perception of the game. We discuss the implications of these results both for modeling human behavior and for designing artificial agents or robots.

[1]  A. Tversky,et al.  Prospect theory: an analysis of decision under risk — Source link , 2007 .

[2]  Jutta Heckhausen,et al.  Motivation and action , 1991 .

[3]  R. McKelvey,et al.  An experimental study of the centipede game , 1992 .

[4]  Kathryn E. Merrick,et al.  Motivated Reinforcement Learning - Curious Characters for Multiuser Games , 2009 .

[5]  Christopher L. Simpkins,et al.  Deriving Behavior from Personality : A Reinforcement Learning Approach , 2010 .

[6]  J. Nash Equilibrium Points in N-Person Games. , 1950, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Chris Christodoulou,et al.  Multiagent Reinforcement Learning in the Iterated Prisoner's Dilemma: Fast cooperation through evolved payoffs , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[8]  Colin F. Camerer,et al.  Behavioral Game Theory:: Predicting Human Behavior in Strategic Situations , 2011 .

[9]  Pavel Stopka,et al.  Individual variation evades the Prisoner's Dilemma , 2002, BMC Evolutionary Biology.

[10]  Sarit Kraus,et al.  Adaptive multi-robot coordination: A game-theoretic perspective , 2010, 2010 IEEE International Conference on Robotics and Automation.

[11]  Tilman Börgers,et al.  Learning Through Reinforcement and Replicator Dynamics , 1997 .

[12]  E. Rowland Theory of Games and Economic Behavior , 1946, Nature.

[13]  Wim B. G. Liebrand,et al.  The effects of social motives on behavior in social dilemmas in two cultures. , 1985 .

[14]  A. Tversky,et al.  Prospect theory: analysis of decision under risk , 1979 .

[15]  D. M. Kuhlman,et al.  Expectations of choice behavior held by cooperators, competitors, and individualists across four classes of experimental games. , 1976 .

[16]  J M Smith,et al.  Evolution and the theory of games , 1976 .

[17]  Colin Camerer Behavioral Game Theory: Experiments in Strategic Interaction , 2003 .

[18]  Aaron Bryan Loyall,et al.  Believable agents: building interactive personalities , 1997 .

[19]  Rob Saunders,et al.  From Passive to Proactive Design Elements , 2007 .

[20]  F. B. Vernadat,et al.  Decisions with Multiple Objectives: Preferences and Value Tradeoffs , 1994 .

[21]  R. L. Keeney,et al.  Decisions with Multiple Objectives: Preferences and Value Trade-Offs , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[22]  Craig Boutilier,et al.  The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[23]  Robert H. Crites,et al.  Multiagent reinforcement learning in the Iterated Prisoner's Dilemma. , 1996, Bio Systems.

[24]  P. Glimcher Foundations of Neuroeconomic Analysis , 2010 .

[25]  Peter C. Fishburn,et al.  LEXICOGRAPHIC ORDERS, UTILITIES AND DECISION RULES: A SURVEY , 1974 .

[26]  D. van Knippenberg,et al.  Leader self-sacrifice and leadership effectiveness: the moderating role of leader prototypicality. , 2005, The Journal of applied psychology.

[27]  M. Dufwenberg Game theory. , 2011, Wiley interdisciplinary reviews. Cognitive science.

[28]  J. W. Atkinson,et al.  Achievement motive and test anxiety conceived as motive to approach success and motive to avoid failure. , 1960, Journal of abnormal and social psychology.

[29]  Li-Lin Rao,et al.  Is There a Violation of Savage’s Sure-Thing Principle in the Prisoner’s Dilemma Game? , 2010, Adapt. Behav..

[30]  Yan Meng,et al.  Multi-Robot Searching using Game-Theory Based Approach , 2008 .

[31]  D. M. Kuhlman,et al.  Individual differences in game motivation as moderators of preprogrammed strategy effects in prisoner's dilemma. , 1975, Journal of personality and social psychology.

[32]  Kathryn B. Laskey,et al.  Game Theory and Experimental Games: The Study of Strategic Interaction , 1984 .

[33]  John W. Atkinson,et al.  Motivation and achievement , 1974 .

[34]  Michael Wooldridge,et al.  Game Theory and Decision Theory in Multi-Agent Systems , 2002, Autonomous Agents and Multi-Agent Systems.

[35]  Annapurna Valluri,et al.  Learning and Cooperation in Sequential Games , 2006, Adapt. Behav..

[36]  A. Rapoport,et al.  Prisoner's Dilemma , 1965 .

[37]  D C McClelland,et al.  Power motivation and risk-taking behavior. , 1973, Journal of personality.

[38]  P. Todd,et al.  Simple Heuristics That Make Us Smart , 1999 .

[39]  Kenneth W. Terhune,et al.  Motives, situation, and interpersonal conflict within Prisoner's Dilemma. , 1968 .

[40]  J. W. Atkinson Motivational determinants of risk-taking behavior. , 1957, Psychological review.

[41]  Sarit Kraus,et al.  Robust solutions to Stackelberg games: Addressing bounded rationality and limited observations in human cognition , 2010, Artif. Intell..