Tracking Emotions: Intrinsic Motivation Grounded on Multi - Level Prediction Error Dynamics

We present an intrinsic motivation architecture that generates behaviors towards self-generated and dynamic goals and that regulates goal selection and the balance between exploitation and exploration through multi-level monitoring of prediction error dynamics. This architecture modulates exploration noise and leverages computational resources according to the dynamics of the overall performance of the learning system. Results show that this architecture outperforms intrinsic motivation approaches where exploratory noise and goals are fixed. We suggest that the tracking of prediction error dynamics allows an artificial agent to be intrinsically motivated to seek new experiences but constrained to those that generate reducible prediction error. We argue about the potential relationship between emotional valence and rates of progress toward a goal.

[1]  C. Hofsten On the development of perception and action. , 2003 .

[2]  Jochen Steil,et al.  Online Associative Multi-Stage Goal Babbling Toward Versatile Learning of Sensorimotor Skills , 2019, 2019 Joint IEEE 9th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob).

[3]  Pierre-Yves Oudeyer,et al.  Intrinsic Motivation Systems for Autonomous Mental Development , 2007, IEEE Transactions on Evolutionary Computation.

[4]  Teuvo Kohonen,et al.  The self-organizing map , 1990, Neurocomputing.

[5]  R. Adolphs How should neuroscience study emotions? by distinguishing emotion states, concepts, and experiences , 2016, Social cognitive and affective neuroscience.

[6]  Giulio Sandini,et al.  Learning task space control through goal directed exploration , 2011, 2011 IEEE International Conference on Robotics and Biomimetics.

[7]  Erik Rietveld,et al.  The feeling of grip: novelty, error dynamics, and the predictive brain , 2017, Synthese.

[8]  Minoru Asada,et al.  Autonomous development of goals: From generic rewards to goal and self detection , 2014, 4th International Conference on Development and Learning and on Epigenetic Robotics.

[9]  Pierre-Yves Oudeyer,et al.  What is Intrinsic Motivation? A Typology of Computational Approaches , 2007, Frontiers Neurorobotics.

[10]  Bruno Lara,et al.  Exploration Behaviors, Body Representations, and Simulation Processes for the Development of Cognition in Artificial Agents , 2016, Front. Robot. AI.

[11]  S. V. D. Cruys,et al.  Affective Value in the Predictive Mind , 2017 .

[12]  M Kawato,et al.  Internal models for motor control. , 2007, Novartis Foundation symposium.

[13]  Ana Paiva,et al.  Emotion-Based Intrinsic Motivation for Reinforcement Learning Agents , 2011, ACII.

[14]  David Colliaux,et al.  Intrinsic motivation and episodic memories for robot exploration of high-dimensional sensory spaces , 2020, Adapt. Behav..

[15]  P. Hanges,et al.  A control system model of organizational motivation: Theoretical development and applied implications , 1987 .

[16]  Jürgen Schmidhuber,et al.  Stacked Convolutional Auto-Encoders for Hierarchical Feature Extraction , 2011, ICANN.

[17]  C. Carver,et al.  Origins and Functions of Positive and Negative Affect: A Control-Process View. , 1990 .

[18]  Mateus Joffily,et al.  Emotional Valence and the Free-Energy Principle , 2013, PLoS Comput. Biol..

[19]  Pierre-Yves Oudeyer,et al.  Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning , 2017, J. Mach. Learn. Res..

[20]  A. Kluger,et al.  The effects of feedback interventions on performance: A historical review, a meta-analysis, and a preliminary feedback intervention theory. , 1996 .

[21]  M. Walton,et al.  Action sets and decisions in the medial frontal cortex , 2004, Trends in Cognitive Sciences.

[22]  Pierre-Yves Oudeyer,et al.  Socially guided intrinsic motivation for robot learning of motor skills , 2014, Auton. Robots.

[23]  Christopher Kanan,et al.  Rethinking Continual Learning for Autonomous Agents and Robots , 2019, ArXiv.

[24]  Randall C. O’Reilly,et al.  Unraveling the Mysteries of Motivation , 2020, Trends in Cognitive Sciences.

[25]  J. Wagemans,et al.  Putting reward in art: A tentative prediction error account of visual art , 2011, i-Perception.

[26]  Chaur-Chin Chen,et al.  Similarity Measurement Between Images , 2005, COMPSAC.

[27]  Robert P. Abelson,et al.  Velocity relation : satisfaction as a function of the first derivative of outcome over time , 1991 .