Frustration as a way toward autonomy and self-improvement in robotic navigation

Autonomy and self-improvement capabilities are still challenging in the field of robotics. Allowing a robot to autonomously navigate in wide and unknown environments not only requires a set of robust strategies to cope with miscellaneous situations, but also needs mechanisms of self-assessment for guiding learning and for monitoring strategies. Monitoring strategies requires feedbacks on the behavior's quality, from a given fitness system in order to take correct decisions. In this work, we focus on how an emotional controller can be used to modulate robot behaviors. Following an incremental and constructivist approach, we present a generic neural architecture, based on an online novelty detection algorithm that may be able to evaluate any sensory-motor strategies. This architecture learns contingencies between sensations and actions, giving the expected sensation from the past perception. Prediction error, coming from surprising events, provides a direct measure of the quality of the underlying sensory-motor contingencies involved. We show how a simple emotional controller based on the prediction progress allows the system to regulate its behavior to solve complex navigation tasks and to communicate its disability in deadlock situations. We propose that this model could be a key structure toward self-monitoring. We made several experiments that can account for such properties with different behaviors (road following and place cells based navigation).

[1]  E. Oja,et al.  Fast adaptive formation of orthogonalizing filters and associative memory in recurrent networks of neuron-like elements , 1976, Biological Cybernetics.

[2]  Christopher M. Bishop,et al.  Novelty detection and neural network validation , 1994 .

[3]  L. Devroye,et al.  Detection of Abnormal Behavior Via Nonparametric Estimation of the Support , 1980 .

[4]  Av Adolphe Chauvin Robustness of Visual Place Cells in Dynamic Indoor and Outdoor Environment , 2006 .

[5]  Illah R. Nourbakhsh,et al.  A survey of socially interactive robots , 2003, Robotics Auton. Syst..

[6]  Anthony C. Davison,et al.  Statistics of Extremes , 2015, International Encyclopedia of Statistical Science.

[7]  Shelley E. Taylor,et al.  Self-Evaluation Processes , 1995 .

[8]  S. Grossberg A neural theory of punishment and avoidance, I: Qualitative theory , 1972 .

[9]  Stephen Grossberg,et al.  A Neural Theory of Punishment and Avoidance , II : Quantitative Theory , 2003 .

[10]  Philippe Gaussier,et al.  Perception as a Dynamical Sensori-Motor Attraction Basin , 2005, ECAL.

[11]  C. Giovannangeli,et al.  About the constuctivist role of self-evaluation for interactive learnings and self-development , 2008 .

[12]  Pierre-Yves Oudeyer,et al.  Intrinsic Motivation Systems for Autonomous Mental Development , 2007, IEEE Transactions on Evolutionary Computation.

[13]  Philippe Gaussier,et al.  From view cells and place cells to cognitive map learning: processing stages of the hippocampal system , 2002, Biological Cybernetics.

[14]  James L. McClelland,et al.  Neural models of memory , 1999, Current Opinion in Neurobiology.

[15]  T. Kohonen,et al.  Bibliography of Self-Organizing Map SOM) Papers: 1998-2001 Addendum , 2003 .

[16]  Philippe Gaussier,et al.  A Hierarchy of Associations in Hippocampo-Cortical Systems: Cognitive Maps and Navigation Strategies , 2005, Neural Computation.

[17]  R. Passingham The hippocampus as a cognitive map J. O'Keefe & L. Nadel, Oxford University Press, Oxford (1978). 570 pp., £25.00 , 1979, Neuroscience.

[18]  Markus Vincze,et al.  Knowing your limits - self-evaluation and prediction in object recognition , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[19]  Antoine Manzanera,et al.  A New Hybrid differential filter for Motion Detection , 2004, ICCVG.

[20]  Stephen Grossberg,et al.  The ART of Adaptive Pattern Recognition Self-organizing by a Neu Network , 1988 .

[21]  P. Sen,et al.  Theory of rank tests , 1969 .

[22]  S. Marsland Novelty Detection in Learning Systems , 2008 .

[23]  Stephen J. Roberts,et al.  A Probabilistic Resource Allocating Network for Novelty Detection , 1994, Neural Computation.

[24]  V. Braitenberg Vehicles, Experiments in Synthetic Psychology , 1984 .

[25]  Philippe Gaussier,et al.  PerAc: A neural architecture to control artificial animals , 1995, Robotics Auton. Syst..

[26]  Philippe Gaussier,et al.  Emotions as a dynamical system: the interplay between the meta-control and communication function of emotions , 2011, Paladyn J. Behav. Robotics.

[27]  Philippe Gaussier,et al.  How can a robot evaluate its own behavior? A neural model for self-assessment , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[28]  Samuel Kaski,et al.  Bibliography of Self-Organizing Map (SOM) Papers: 1981-1997 , 1998 .

[29]  S. Amari Dynamics of pattern formation in lateral-inhibition type neural fields , 1977, Biological Cybernetics.

[30]  Bernard Widrow,et al.  Adaptive switching circuits , 1988 .