Online Multimodal Ensemble Learning Using Self-Learned Sensorimotor Representations

Internal models play a key role in cognitive agents by providing on the one hand predictions of sensory consequences of motor commands (forward models), and on the other hand inverse mappings (inverse models) to realize tasks involving control loops, such as imitation tasks. The ability to predict and generate new actions in continuously evolving environments intrinsically requiring the use of different sensory modalities is particularly relevant for autonomous robots, which must also be able to adapt their models online. We present a learning architecture based on self-learned multimodal sensorimotor representations. To attain accurate forward models, we propose an online heterogeneous ensemble learning method that allows us to improve the prediction accuracy by leveraging differences of multiple diverse predictors. We further propose a method to learn inverse models on-the-fly to equip a robot with multimodal learning skills to perform imitation tasks using multiple sensory modalities. We have evaluated the proposed methods on an iCub humanoid robot. Since no assumptions are made on the robot kinematic/dynamic structure, the method can be applied to different robotic platforms.

[1]  Angelika Peer,et al.  Imitation learning of human grasping skills from motion and force data , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  Alexey Tsymbal,et al.  Dynamic Integration of Regression Models , 2004, Multiple Classifier Systems.

[3]  Gary R. Bradski,et al.  Learning OpenCV - computer vision with the OpenCV library: software that sees , 2008 .

[4]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[5]  Jan Peters,et al.  Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[6]  Yiannis Demiris,et al.  A Groovy Virtual Drumming Agent , 2009, IVA.

[7]  Michael I. Jordan,et al.  Forward Models: Supervised Learning with a Distal Teacher , 1992, Cogn. Sci..

[8]  Mitsuo Kawato,et al.  MOSAIC Model for Sensorimotor Learning and Control , 2001, Neural Computation.

[9]  Tae-Kyun Kim,et al.  A syntactic approach to robot imitation learning using probabilistic activity grammars , 2013, Robotics Auton. Syst..

[10]  Nikunj C. Oza,et al.  Online Ensemble Learning , 2000, AAAI/IAAI.

[11]  Aude Billard,et al.  Online Learning of the Body Schema , 2008, Int. J. Humanoid Robotics.

[12]  K. Dautenhahn,et al.  Imitation in Animals and Artifacts , 2002 .

[13]  Pierre-Yves Oudeyer,et al.  Intrinsic Motivation Systems for Autonomous Mental Development , 2007, IEEE Transactions on Evolutionary Computation.

[14]  Giorgio Valentini,et al.  Ensembles of Learning Machines , 2002, WIRN.

[15]  Aude Billard,et al.  LEARNING MOTOR SKILLS BY IMITATION: A BIOLOGICALLY INSPIRED ROBOTIC MODEL , 2001, Cybern. Syst..

[16]  A. Meltzoff,et al.  The Robot in the Crib: A Developmental Analysis of Imitation Skills in Infants and Robots. , 2008, Infant and child development.

[17]  Ludmila I. Kuncheva,et al.  Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy , 2003, Machine Learning.

[18]  Yiannis Demiris,et al.  Spatio-Temporal Learning With the Online Finite and Infinite Echo-State Gaussian Processes , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[19]  A. Meltzoff,et al.  Explaining Facial Imitation: A Theoretical Model. , 1997, Early development & parenting.

[20]  Tomoaki Nakamura,et al.  Integrated concept of objects and human motions based on multi-layered multimodal LDA , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[21]  Atsuo Takanishi,et al.  Online learning of humanoid robot kinematics under switching tools contexts , 2013, 2013 IEEE International Conference on Robotics and Automation.

[22]  Cornelius Weber,et al.  Object Learning with Natural Language in a Distributed Intelligent System { A Case Study of Human-Robot Interaction , 2014 .

[23]  Yiannis Demiris,et al.  When and how to help: An iterative probabilistic model for learning assistance by demonstration , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[24]  Stefan Schaal,et al.  Learning inverse kinematics , 2001, Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180).

[25]  Giulio Sandini,et al.  The iCub humanoid robot: An open-systems platform for research in cognitive development , 2010, Neural Networks.

[26]  Pieter Abbeel,et al.  Combining model-based policy search with online model learning for control of physical humanoids , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[27]  Darwin G. Caldwell,et al.  Reinforcement Learning in Robotics: Applications and Real-World Challenges , 2013, Robotics.

[28]  Sergey Levine,et al.  Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics , 2014, NIPS.

[29]  D M Wolpert,et al.  Multiple paired forward and inverse models for motor control , 1998, Neural Networks.

[30]  Carl E. Rasmussen,et al.  Gaussian Processes for Data-Efficient Learning in Robotics and Control , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Daniel M. Wolpert,et al.  Forward Models for Physiological Motor Control , 1996, Neural Networks.

[32]  Yiannis Demiris,et al.  Online quantum mixture regression for trajectory learning by demonstration , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[33]  Guido Schillaci,et al.  Sensorimotor learning and simulation of experience as a basis for the development of cognition in robotics , 2014 .

[34]  Yiannis Demiris,et al.  Hierarchical learning approach for one-shot action imitation in humanoid robots , 2010, 2010 11th International Conference on Control Automation Robotics & Vision.

[35]  Yiannis Demiris,et al.  Efficient template-based path imitation by invariant feature mapping , 2009, 2009 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[36]  Y. Demiris,et al.  From motor babbling to hierarchical learning by imitation: a robot developmental pathway , 2005 .

[37]  Yiannis Demiris,et al.  Markerless perspective taking for humanoid robots in unconstrained environments , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[38]  Gillian M. Hayes,et al.  A Robot Controller Using Learning by Imitation , 1994 .

[39]  Martina Zambelli,et al.  Multimodal imitation using self-learned sensorimotor representations , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[40]  Aude Billard,et al.  On Learning, Representing, and Generalizing a Task in a Humanoid Robot , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[41]  Christian Balkenius,et al.  Sense of Touch in Robots With Self-Organizing Maps , 2011, IEEE Transactions on Robotics.

[42]  Michael Beetz,et al.  PR2 looking at things — Ensemble learning for unstructured information processing with Markov logic networks , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[43]  Yiannis Demiris,et al.  Unsupervised learning of complex articulated kinematic structures combining motion and skeleton information , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Mark de Berg,et al.  Orthogonal Range Searching , 1997 .

[45]  Yiannis Demiris,et al.  Hierarchical attentive multiple models for execution and recognition of actions , 2006, Robotics Auton. Syst..

[46]  Rui Araújo,et al.  A dynamic and on-line ensemble regression for changing environments , 2015, Expert Syst. Appl..

[47]  Herbert Jaeger,et al.  Adaptive Nonlinear System Identification with Echo State Networks , 2002, NIPS.

[48]  Giulio Sandini,et al.  Autonomous Online Learning of Reaching Behavior in a humanoid Robot , 2012, Int. J. Humanoid Robotics.

[49]  Alexander Maye,et al.  Using Sensorimotor Contingencies for Prediction and Action Planning , 2012, SAB.

[50]  A. Clark Embodied Prediction , 2015 .

[51]  Stefan Schaal,et al.  Robot Programming by Demonstration , 2009, Springer Handbook of Robotics.

[52]  Jochen J. Steil,et al.  Online Goal Babbling for rapid bootstrapping of inverse models in high dimensions , 2011, 2011 IEEE International Conference on Development and Learning (ICDL).

[53]  Bruno Lara,et al.  Internal Simulations for Behaviour Selection and Recognition , 2012, HBU.

[54]  Jan Peters,et al.  Model learning for robot control: a survey , 2011, Cognitive Processing.

[55]  Tomoaki Nakamura,et al.  Autonomous acquisition of multimodal information for online object concept formation by robots , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[56]  José Santos-Victor,et al.  Open and closed-loop task space trajectory control of redundant robots using learned models , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[57]  Gavin Brown,et al.  Ensemble Learning , 2010, Encyclopedia of Machine Learning and Data Mining.

[58]  M. Hadders‐Algra,et al.  The Neuronal Group Selection Theory: a framework to explain variation in normal motor development , 2000, Developmental medicine and child neurology.

[59]  Alex Simpkins,et al.  System Identification: Theory for the User, 2nd Edition (Ljung, L.; 1999) [On the Shelf] , 2012, IEEE Robotics & Automation Magazine.

[60]  Stefan Schaal,et al.  Fast and efficient incremental learning for high-dimensional movement systems , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[61]  Yiannis Demiris,et al.  A Quantum-Statistical Approach Towards Robot Learning by Demonstration , 2012 .

[62]  Alexander Maye,et al.  A discrete computational model of sensorimotor contingencies for object perception and control of behavior , 2011, 2011 IEEE International Conference on Robotics and Automation.

[63]  Minoru Asada,et al.  Body Scheme Acquisition by Cross Modal Map Learning among Tactile, Visual, and Proprioceptive Spaces , 2002 .

[64]  Leo Breiman,et al.  Stacked regressions , 2004, Machine Learning.

[65]  Saso Dzeroski,et al.  Online tree-based ensembles and option trees for regression on evolving data streams , 2015, Neurocomputing.

[66]  Yiannis Demiris,et al.  From exploration to imitation: using learnt internal models to imitate others , 2007 .

[67]  Yiannis Demiris,et al.  Learning reusable task components using hierarchical activity grammars with uncertainties , 2012, 2012 IEEE International Conference on Robotics and Automation.

[68]  Alípio Mário Jorge,et al.  Ensemble approaches for regression: A survey , 2012, CSUR.

[69]  Bruno Lara,et al.  Online learning of visuo-motor coordination in a humanoid robot. A biologically inspired model , 2014, 4th International Conference on Development and Learning and on Epigenetic Robotics.

[70]  José Santos-Victor,et al.  Visual learning by imitation with motor representations , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[71]  Giulio Sandini,et al.  Learning task space control through goal directed exploration , 2011, 2011 IEEE International Conference on Robotics and Biomimetics.

[72]  Nolan Wagener,et al.  Learning contact-rich manipulation skills with guided policy search , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[73]  C. Hofsten An action perspective on motor development , 2004, Trends in Cognitive Sciences.

[74]  Tetsuya Ogata,et al.  Developmental Human-Robot Imitation Learning of Drawing with a Neuro Dynamical System , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[75]  Olivier Sigaud,et al.  Robot Skill Learning: From Reinforcement Learning to Evolution Strategies , 2013, Paladyn J. Behav. Robotics.

[76]  Yiannis Demiris,et al.  A morphable template framework for robot learning by demonstration: Integrating one-shot and incremental learning approaches , 2014, Robotics Auton. Syst..

[77]  Yiannis Demiris,et al.  A nonparametric Bayesian approach toward robot learning by demonstration , 2012, Robotics Auton. Syst..

[78]  Darwin G. Caldwell,et al.  Encoderless position control of a two-link robot manipulator , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[79]  Yiannis Demiris,et al.  Towards One Shot Learning by imitation for humanoid robots , 2010, 2010 IEEE International Conference on Robotics and Automation.

[80]  Antoine Cully,et al.  Robots that can adapt like animals , 2014, Nature.

[81]  Yiannis Demiris,et al.  Information Processing in the Mirror Neuron System in Primates and Machines , 2013, Neuroinformatics.

[82]  Mitsuo Kawato,et al.  Internal models for motor control and trajectory planning , 1999, Current Opinion in Neurobiology.