Intrinsic Rewards for Maintenance, Approach, Avoidance, and Achievement Goal Types

In reinforcement learning, reward is used to guide the learning process. The reward is often designed to be task-dependent, and it may require significant domain knowledge to design a good reward function. This paper proposes general reward functions for maintenance, approach, avoidance, and achievement goal types. These reward functions exploit the inherent property of each type of goal and are thus task-independent. We also propose metrics to measure an agent's performance for learning each type of goal. We evaluate the intrinsic reward functions in a framework that can autonomously generate goals and learn solutions to those goals using a standard reinforcement learning algorithm. We show empirically how the proposed reward functions lead to learning in a mobile robot application. Finally, using the proposed reward functions as building blocks, we demonstrate how compound reward functions, reward functions to generate sequences of tasks, can be created that allow the mobile robot to learn more complex behaviors.

[1]  Pierre-Yves Oudeyer,et al.  Intrinsically motivated goal exploration for active motor learning in robots: A case study , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  Alain Wegmann,et al.  Where do goals come from: the underlying principles of goal-oriented requirements engineering , 2005, 13th IEEE International Conference on Requirements Engineering (RE'05).

[3]  Sebastian Thrun,et al.  Lifelong robot learning , 1993, Robotics Auton. Syst..

[4]  Daniel Dewey,et al.  Reinforcement Learning and the Reward Engineering Principle , 2014, AAAI Spring Symposia.

[5]  Pierre-Yves Oudeyer,et al.  Intrinsic Motivation Systems for Autonomous Mental Development , 2007, IEEE Transactions on Evolutionary Computation.

[6]  Jochen J. Steil,et al.  Bootstrapping inverse kinematics with Goal Babbling , 2010, 2010 IEEE 9th International Conference on Development and Learning.

[7]  Marco Mirolli,et al.  GRAIL: A Goal-Discovering Robotic Architecture for Intrinsically-Motivated Learning , 2016, IEEE Transactions on Cognitive and Developmental Systems.

[8]  James T. Graham,et al.  Opportunistic Motivated Learning Agents , 2012, ICAISC.

[9]  G. Baldassarre,et al.  Functions and Mechanisms of Intrinsic Motivations The Knowledge Versus Competence Distinction , 2012 .

[10]  Kathryn E. Merrick,et al.  Motivated Reinforcement Learning - Curious Characters for Multiuser Games , 2009 .

[11]  Marcin Andrychowicz,et al.  Hindsight Experience Replay , 2017, NIPS.

[12]  James L. McClelland,et al.  Autonomous Mental Development by Robots and Animals , 2001, Science.

[13]  Michael Winikoff,et al.  Rich goal types in agent programming , 2011, AAMAS.

[14]  A. Barto,et al.  Novelty or Surprise? , 2013, Front. Psychol..

[15]  Axel van Lamsweerde,et al.  Goal-Oriented Requirements Engineering: A Guided Tour , 2001, RE.

[16]  Gerald DeJong,et al.  Reinforcement Learning and Shaping: Encouraging Intended Behaviors , 2002, ICML.

[17]  Kathryn E. Merrick,et al.  Experience-Based Generation of Maintenance and Achievement Goals on a Mobile Robot , 2016, Paladyn J. Behav. Robotics.

[18]  Daniel Moldt,et al.  Goal Representation for BDI Agent Systems , 2004, PROMAS.

[19]  Kathryn E. Merrick,et al.  Intrinsic Motivation and Introspection in Reinforcement Learning , 2012, IEEE Transactions on Autonomous Mental Development.

[20]  Andrea Bonarini,et al.  Incremental Skill Acquisition for Self-motivated Learning Animats , 2006, SAB.

[21]  Anand S. Rao,et al.  BDI Agents: From Theory to Practice , 1995, ICMAS.

[22]  A. Elliot Handbook of Approach and Avoidance Motivation , 2008 .

[23]  Pieter Abbeel,et al.  Automatic Goal Generation for Reinforcement Learning Agents , 2017, ICML.

[24]  Marco Mirolli,et al.  Biological Cumulative Learning through Intrinsic Motivations: A Simulated Robotic Study on the Development of Visually-Guided Reaching , 2010, EpiRob.

[25]  Marco Mirolli,et al.  Functions and Mechanisms of Intrinsic Motivations , 2013, Intrinsically Motivated Learning in Natural and Artificial Systems.

[26]  Michael Winikoff,et al.  Goals in agent systems: a unifying framework , 2008, AAMAS.

[27]  Marco Mirolli,et al.  Intrinsically Motivated Learning in Natural and Artificial Systems , 2013 .

[28]  Nuttapong Chentanez,et al.  Intrinsically Motivated Reinforcement Learning , 2004, NIPS.

[29]  Pierre-Yves Oudeyer,et al.  What is Intrinsic Motivation? A Typology of Computational Approaches , 2007, Frontiers Neurorobotics.

[30]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[31]  T. Martin McGinnity,et al.  Novelty Detection as an Intrinsic Motivation for Cumulative Learning Robots , 2013, Intrinsically Motivated Learning in Natural and Artificial Systems.

[32]  Koen V. Hindriks,et al.  Satisfying Maintenance Goals , 2007, DALT.

[33]  James Harland,et al.  On proactivity and maintenance goals , 2006, AAMAS '06.

[34]  Pierre-Yves Oudeyer,et al.  Maturationally-constrained competence-based intrinsically motivated learning , 2010, 2010 IEEE 9th International Conference on Development and Learning.

[35]  Ethem Alpaydin,et al.  Simplified ART: A new class of ART algorithms , 1998 .

[36]  Andrew Y. Ng,et al.  Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.

[37]  Marco Mirolli,et al.  Intrinsic motivation mechanisms for competence acquisition , 2012, 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[38]  John Schulman,et al.  Concrete Problems in AI Safety , 2016, ArXiv.

[39]  Kathryn E. Merrick,et al.  Modelling motivation for experience-based attention focus in reinforcement learning , 2007 .

[40]  Hugo Vieira Neto,et al.  Visual Novelty Detection for Inspection Tasks using Mobile Robots , 2004 .