Active learning of local predictable representations with artificial curiosity

In this article, we present some preliminary work on integrating an artificial curiosity mechanism in PROPRE, a generic and modular neural architecture, to obtain online, open-ended and active learning of a sensory-motor space, where large areas can be unlearnable. PROPRE consists of the combination of the projection of the input motor flow, using a self-organizing map, with the regression of the sensory output flow from this projection representation, using a linear regression. The main feature of PROPRE is the use of a predictability module that provides an interestingness measure for the current motor stimulus depending on a simple evaluation of the sensory prediction quality. This measure modulates the projection learning so that to favor the representations that predict the output better than a local average. Especially, this leads to the learning of local representations where an input/output relationship is defined [1]. In this article, we propose an artificial curiosity mechanism based on the monitoring of learning progress, as proposed in [2], in the neighborhood of each local representation. Thus, PROPRE simultaneously learns interesting representations of the input flow (depending on their capacities to predict the output) and explores actively this input space where the learning progress is the higher. We illustrate our architecture on the learning of a direct model of an arm whose hand can only be perceived in a restricted visual space. The modulation of the projection learning leads to a better performance and the use of the curiosity mechanism provides quicker learning and even improves the final performance.

[1]  Alexander Gepperth,et al.  Multimodal space representation driven by self-evaluation of predictability , 2014, 4th International Conference on Development and Learning and on Epigenetic Robotics.

[2]  Nuttapong Chentanez,et al.  Intrinsically Motivated Learning of Hierarchical Collections of Skills , 2004 .

[3]  Pierre-Yves Oudeyer,et al.  Active learning of inverse models with intrinsically motivated goal exploration in robots , 2013, Robotics Auton. Syst..

[4]  E. Deci,et al.  Intrinsic and Extrinsic Motivations: Classic Definitions and New Directions. , 2000, Contemporary educational psychology.

[5]  Alexander Gepperth,et al.  PROPRE: PROjection and PREdiction for multimodal correlations learning. An application to pedestrians visual data discrimination , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[6]  Sebastian Thrun,et al.  Exploration in active learning , 1998 .

[7]  Alexander Gepperth,et al.  Using self-organizing maps for regression: the importance of the output function , 2015, ESANN.

[8]  Pierre-Yves Oudeyer,et al.  Intrinsic Motivation Systems for Autonomous Mental Development , 2007, IEEE Transactions on Evolutionary Computation.

[9]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[10]  Frank Guerin,et al.  Learning like a baby: a survey of artificial intelligence approaches , 2011, The Knowledge Engineering Review.

[11]  Giulio Sandini,et al.  Developmental robotics: a survey , 2003, Connect. Sci..

[12]  B. John Oommen,et al.  Topology-oriented self-organizing maps: a survey , 2014, Pattern Analysis and Applications.

[13]  P. L. Adams THE ORIGINS OF INTELLIGENCE IN CHILDREN , 1976 .

[14]  Gilles Pagès,et al.  Theoretical aspects of the SOM algorithm , 1998, Neurocomputing.

[15]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[16]  Yann Boniface,et al.  Dynamic self-organising map , 2011, Neurocomputing.

[17]  Juyang Weng,et al.  Developmental Robotics: Theory and Experiments , 2004, Int. J. Humanoid Robotics.

[18]  Alexander Gepperth,et al.  Learning of local predictable representations in partially learnable environments , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[19]  Jürgen Schmidhuber,et al.  Curious model-building control systems , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.

[20]  Pierre-Yves Oudeyer,et al.  R-IAC: Robust Intrinsically Motivated Exploration and Active Learning , 2009, IEEE Transactions on Autonomous Mental Development.