Incremental episodic segmentation and imitative learning of humanoid robot through self-exploration

Imitation learning through self-exploration is an essential mechanism in developing sensorimotor skills for human infants as well for robots. We assume that a primitive sense of self is the prerequisite for successful social interaction rather than an outcome of it. During imitation learning, a crucial element of conception involves segmenting the continuous flow of motion into simpler units - motion primitives - by identifying the boundaries of an action. Secondly, in realistic environment the robot must be able to learn the observed motion patterns incrementally in a stable adaptive manner without corrupting previously learned information. In this paper, we propose an on-line and unsupervised motion segmentation method allowing the robot to imitate and perform actions by observing the motion patterns performed by other partner through Incremental Kernel Slow Feature Analysis. The segmentation model directly operates on the images acquired from the robots vision sensor (camera) without requiring any kinematic model of the demonstrator. After segmentation, the spatio-temporal motion sequences are learned incrementally through Topological Gaussian Adaptive Resonance Hidden Markov Model. The learning model dynamically generates the topological structure in a self-organizing and self-stabilizing manner. Each node represents the encoded motion element (i.e. joint angles). The complete architecture was evaluated by simulation experiments performed on DARwIn-OP humanoid robot.

[1]  Dana Kulic,et al.  Incremental learning of full body motion primitives for humanoid robots , 2008, Humanoids 2008 - 8th IEEE-RAS International Conference on Humanoid Robots.

[2]  Aude Billard,et al.  Stochastic gesture production and recognition model for a humanoid robot , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[3]  P. Berkes Pattern Recognition with Slow Feature Analysis , 2005 .

[4]  Tat-Jun Chin,et al.  Incremental Kernel Principal Component Analysis , 2007, IEEE Transactions on Image Processing.

[5]  Odest Chadwicke Jenkins,et al.  Learning from demonstration using a multi-valued function regressor for time-series data , 2010, 2010 10th IEEE-RAS International Conference on Humanoid Robots.

[6]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[7]  Stefan Schaal,et al.  Movement Segmentation and Recognition for Imitation Learning , 2012, AISTATS.

[8]  Aude Billard,et al.  Incremental learning of gestures by imitation in a humanoid robot , 2007, 2007 2nd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[9]  A. Meltzoff The 'like me' framework for recognizing and becoming an intentional agent. , 2007, Acta psychologica.

[10]  Yoshihiko Nakamura,et al.  Humanoid Robot's Autonomous Acquisition of Proto-Symbols through Motion Segmentation , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[11]  Christian Laugier,et al.  Incremental Learning of Statistical Motion Patterns With Growing Hidden Markov Models , 2007, IEEE Transactions on Intelligent Transportation Systems.

[12]  Robert H. Halstead,et al.  Matrix Computations , 2011, Encyclopedia of Parallel Computing.

[13]  Jürgen Schmidhuber,et al.  An intrinsic value system for developing multiple invariant representations with incremental slowness learning , 2013, Front. Neurorobot..

[14]  Toyoaki Nishida,et al.  Incremental learning of gestures for human–robot interaction , 2009, AI & SOCIETY.

[15]  A. Meltzoff,et al.  What imitation tells us about social cognition: a rapprochement between developmental psychology and cognitive neuroscience. , 2003, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[16]  Brian Scassellati,et al.  Using probabilistic reasoning over time to self-recognize , 2009, Robotics Auton. Syst..

[17]  Laurenz Wiskott,et al.  Slowness and Sparseness Lead to Place, Head-Direction, and Spatial-View Cells , 2007, PLoS Comput. Biol..

[18]  Gisa Aschersleben,et al.  Early development of action control , 2006 .

[19]  Klaus Obermayer,et al.  Regularized Sparse Kernel Slow Feature Analysis , 2011, ECML/PKDD.

[20]  Stefan Schaal,et al.  Is imitation learning the route to humanoid robots? , 1999, Trends in Cognitive Sciences.

[21]  Varun Raj Kompella,et al.  Hierarchical Incremental Slow Feature Analysis , 2012 .

[22]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2003, ICTAI.

[23]  Matthew Brand,et al.  Incremental Singular Value Decomposition of Uncertain Data with Missing Values , 2002, ECCV.

[24]  Toyoaki Nishida,et al.  Fluid Imitation , 2012, Int. J. Soc. Robotics.

[25]  Aude Billard,et al.  Discriminative and adaptive imitation in uni-manual and bi-manual tasks , 2006, Robotics Auton. Syst..

[26]  Terrence J. Sejnowski,et al.  Slow Feature Analysis: Unsupervised Learning of Invariances , 2002, Neural Computation.

[27]  Michael Lindenbaum,et al.  Sequential Karhunen-Loeve basis extraction and its application to images , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[28]  Yoshihiko Nakamura,et al.  Embodied Symbol Emergence Based on Mimesis Theory , 2004, Int. J. Robotics Res..

[29]  Jacqueline Nadel,et al.  Imitation in Infancy. Cambridge Studies in Cognitive and Perceptual Development. , 1999 .

[30]  Stefanos Zafeiriou,et al.  Incremental Slow Feature Analysis with Indefinite Kernel for Online Temporal Video Segmentation , 2012, ACCV.

[31]  Marko Tscherepanow,et al.  TopoART: A Topology Learning Hierarchical ART Network , 2010, ICANN.

[32]  Ferda Nur Alpaslan,et al.  Behavior categorization using Correlation Based Adaptive Resonance Theory , 2009, 2009 17th Mediterranean Conference on Control and Automation.

[33]  Ming-Hsuan Yang,et al.  Incremental Learning for Robust Visual Tracking , 2008, International Journal of Computer Vision.

[34]  Dana Kulic,et al.  Incremental on-line hierarchical clustering of whole body motion patterns , 2007, RO-MAN 2007 - The 16th IEEE International Symposium on Robot and Human Interactive Communication.

[35]  Luc Van Gool,et al.  Temporal Relations in Videos for Unsupervised Activity Analysis , 2011, BMVC.

[36]  Stefanos Zafeiriou,et al.  Efficient Online Subspace Learning With an Indefinite Kernel for Visual Tracking and Recognition , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[37]  Toyoaki Nishida,et al.  On comparing SSA-based change point discovery algorithms , 2011, 2011 IEEE/SICE International Symposium on System Integration (SII).

[38]  Toyoaki Nishida,et al.  Unsupervised simultaneous learning of gestures, actions and their associations for Human-Robot Interaction , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[39]  G. Butterworth Neonatal imitation: Existence, mechanisms and motives. , 1999 .

[40]  Niko Wilbert,et al.  Slow feature analysis , 2011, Scholarpedia.

[41]  Dana Kulic,et al.  Whole body motion primitive segmentation from monocular video , 2009, 2009 IEEE International Conference on Robotics and Automation.

[42]  Scott Niekum,et al.  Incremental Semantically Grounded Learning from Demonstration , 2013, Robotics: Science and Systems.

[43]  Dacheng Tao,et al.  Slow Feature Analysis for Human Action Recognition , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Thomas Martinetz,et al.  Topology representing networks , 1994, Neural Networks.

[45]  Ferda Nur Alpaslan,et al.  Simple and complex behavior learning using behavior hidden Markov model and CobART , 2013, Neurocomputing.