Developmental Approach for Behavior Learning Using Primitive Motion Skills

Imitation learning through self-exploration is essential in developing sensorimotor skills. Most developmental theories emphasize that social interactions, especially understanding of observed actions, could be first achieved through imitation, yet the discussion on the origin of primitive imitative abilities is often neglected, referring instead to the possibility of its innateness. This paper presents a developmental model of imitation learning based on the hypothesis that humanoid robot acquires imitative abilities as induced by sensorimotor associative learning through self-exploration. In designing such learning system, several key issues will be addressed: automatic segmentation of the observed actions into motion primitives using raw images acquired from the camera without requiring any kinematic model; incremental learning of spatio-temporal motion sequences to dynamically generates a topological structure in a self-stabilizing manner; organization of the learned data for easy and efficient retrieval using a dynamic associative memory; and utilizing segmented motion primitives to generate complex behavior by the combining these motion primitives. In our experiment, the self-posture is acquired through observing the image of its own body posture while performing the action in front of a mirror through body babbling. The complete architecture was evaluated by simulation and real robot experiments performed on DARwIn-OP humanoid robot.

[1]  Dana Kulic,et al.  Incremental learning of full body motion primitives for humanoid robots , 2008, Humanoids 2008 - 8th IEEE-RAS International Conference on Humanoid Robots.

[2]  Qingfu Zhang,et al.  Global path planning of wheeled robots using multi-objective memetic algorithms , 2015, Integr. Comput. Aided Eng..

[3]  M. Brass,et al.  Imitation: is cognitive neuroscience solving the correspondence problem? , 2005, Trends in Cognitive Sciences.

[4]  Terrence J. Sejnowski,et al.  Slow Feature Analysis: Unsupervised Learning of Invariances , 2002, Neural Computation.

[5]  Hojjat Adeli,et al.  Spiking Neural Networks , 2009, Int. J. Neural Syst..

[6]  Stefan Schaal,et al.  Movement Segmentation and Recognition for Imitation Learning , 2012, AISTATS.

[7]  H. Adeli,et al.  Brain-computer interface technologies: from signal to action , 2013, Reviews in the neurosciences.

[8]  Michael Lindenbaum,et al.  Sequential Karhunen-Loeve basis extraction and its application to images , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[9]  Aude Billard,et al.  Incremental learning of gestures by imitation in a humanoid robot , 2007, 2007 2nd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[10]  Monica N. Nicolescu,et al.  Natural methods for robot task learning: instructive demonstrations, generalization and practice , 2003, AAMAS '03.

[11]  Hojjat Adeli,et al.  Enhanced probabilistic neural network with local decision circles: A robust classifier , 2010, Integr. Comput. Aided Eng..

[12]  R. W. Mitchell,et al.  Kinesthetic-visual matching and the self-concept as explanations of mirror-self-recognition. , 1997 .

[13]  Chu Kiong Loo,et al.  Robot behaviour learning using Topological Gaussian Adaptive Resonance Hidden Markov Model , 2016, Neural Computing and Applications.

[14]  Ezequiel López-Rubio,et al.  Learning Topologies with the Growing Neural Forest , 2016, Int. J. Neural Syst..

[15]  C Stanger,et al.  Self development and self-conscious emotions. , 1989, Child development.

[16]  Christian Laugier,et al.  Incremental Learning of Statistical Motion Patterns With Growing Hidden Markov Models , 2007, IEEE Transactions on Intelligent Transportation Systems.

[17]  Stefanos Zafeiriou,et al.  Incremental Slow Feature Analysis with Indefinite Kernel for Online Temporal Video Segmentation , 2012, ACCV.

[18]  Jacqueline Nadel,et al.  Imitation in infancy , 2011 .

[19]  Boaz Lerner,et al.  The Bayesian ARTMAP , 2007, IEEE Transactions on Neural Networks.

[20]  Philippe Rochat,et al.  The self in infancy: Theory and research. , 1995 .

[21]  Jürgen Schmidhuber,et al.  An intrinsic value system for developing multiple invariant representations with incremental slowness learning , 2013, Front. Neurorobot..

[22]  A. Meltzoff,et al.  What imitation tells us about social cognition: a rapprochement between developmental psychology and cognitive neuroscience. , 2003, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[23]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2004 .

[24]  Brian Scassellati,et al.  Using probabilistic reasoning over time to self-recognize , 2009, Robotics Auton. Syst..

[25]  Marko Tscherepanow,et al.  TopoART: A Topology Learning Hierarchical ART Network , 2010, ICANN.

[26]  Hojjat Adeli,et al.  Machine Learning: Neural Networks, Genetic Algorithms, and Fuzzy Systems , 1994 .

[27]  Heiko Wersing,et al.  Online Learning of Objects in a Biologically Motivated Visual Architecture , 2007, Int. J. Neural Syst..

[28]  T. Poggio,et al.  Cognitive neuroscience: Neural mechanisms for the recognition of biological movements , 2003, Nature Reviews Neuroscience.

[29]  Matthias W. Seeger,et al.  Gaussian Processes For Machine Learning , 2004, Int. J. Neural Syst..

[30]  Ming-Hsuan Yang,et al.  Incremental Learning for Robust Visual Tracking , 2008, International Journal of Computer Vision.

[31]  Stefan Schaal,et al.  Is imitation learning the route to humanoid robots? , 1999, Trends in Cognitive Sciences.

[32]  Chu Kiong Loo,et al.  Incremental episodic segmentation and imitative learning of humanoid robot through self-exploration , 2016, Neurocomputing.

[33]  C. Heyes,et al.  'Mirroring' Association and the Correspondence Problem , 2008 .

[34]  Varun Raj Kompella,et al.  Hierarchical Incremental Slow Feature Analysis , 2012 .

[35]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2003, ICTAI.

[36]  Caroline Catmur,et al.  Associative sequence learning: the role of experience in the development of imitation and the mirror system , 2009, Philosophical Transactions of the Royal Society B: Biological Sciences.

[37]  Yoshihiko Nakamura,et al.  Embodied Symbol Emergence Based on Mimesis Theory , 2004, Int. J. Robotics Res..

[38]  Anne M. P. Canuto,et al.  Improving Artmap Learning Through Variable Vigilance , 2001, Int. J. Neural Syst..

[39]  A. Meltzoff The 'like me' framework for recognizing and becoming an intentional agent. , 2007, Acta psychologica.

[40]  Yoshihiko Nakamura,et al.  Humanoid Robot's Autonomous Acquisition of Proto-Symbols through Motion Segmentation , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[41]  Stefanos Zafeiriou,et al.  Efficient Online Subspace Learning With an Indefinite Kernel for Visual Tracking and Recognition , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[42]  Dana Kulic,et al.  Whole body motion primitive segmentation from monocular video , 2009, 2009 IEEE International Conference on Robotics and Automation.

[43]  Scott Niekum,et al.  Incremental Semantically Grounded Learning from Demonstration , 2013, Robotics: Science and Systems.

[44]  Xiang Cao,et al.  Neural Network Based Temporal Video Segmentation , 2002, Int. J. Neural Syst..

[45]  Dana Kulic,et al.  Incremental on-line hierarchical clustering of whole body motion patterns , 2007, RO-MAN 2007 - The 16th IEEE International Symposium on Robot and Human Interactive Communication.

[46]  Olivier Michel,et al.  Cyberbotics Ltd. Webots™: Professional Mobile Robot Simulation , 2004 .

[47]  Ferda Nur Alpaslan,et al.  Simple and complex behavior learning using behavior hidden Markov model and CobART , 2013, Neurocomputing.

[48]  Gisa Aschersleben,et al.  Early development of action control , 2006 .

[49]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[50]  Tat-Jun Chin,et al.  Incremental Kernel Principal Component Analysis , 2007, IEEE Transactions on Image Processing.

[51]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[52]  Matthew Brand,et al.  Incremental Singular Value Decomposition of Uncertain Data with Missing Values , 2002, ECCV.

[53]  Stefan Schaal,et al.  Robot Programming by Demonstration , 2009, Springer Handbook of Robotics.

[54]  Aude Billard,et al.  Discriminative and adaptive imitation in uni-manual and bi-manual tasks , 2006, Robotics Auton. Syst..

[55]  J. Ariel Sirat,et al.  A Fast Neural Algorithm for Principal Component Analysis and Singular Value Decomposition , 1991, Int. J. Neural Syst..

[56]  Niko Wilbert,et al.  Slow feature analysis , 2011, Scholarpedia.

[57]  A. Slater,et al.  Innate and learned perceptual abilities in the newborn infant , 1998, Experimental Brain Research.

[58]  Odest Chadwicke Jenkins,et al.  Learning from demonstration using a multi-valued function regressor for time-series data , 2010, 2010 10th IEEE-RAS International Conference on Humanoid Robots.