Online learning of humanoid robot kinematics under switching tools contexts

In this paper a novel approach to kinematics learning and task space control, under switching contexts, is presented. Such non-stationary contexts may appear in many robotic tasks: in particular, the changing of the context due to the use of tools with different lengths and shapes is herein studied. We model the robot forward kinematics as a multi-valued function, in which different outputs for the same input query are related to actual different hidden contexts. To do that, we employ IMLE, a recent online learning algorithm that fits an infinite mixture of linear experts to the online stream of training data. This algorithm can directly provide multi-valued regression in a online fashion, while having, for classic single-valued regression, a performance comparable to state-of-the-art online learning algorithms. The context varying forward kinematics is learned online through exploration, not relying on any kind of prior knowledge. Using the proposed approach, the robot can dynamically learn how to use different tools, without forgetting the kinematic mappings concerning previously manipulated tools. No information is given about such tool changes to the learning algorithm, nor any assumption is made about the tool kinematics. To our knowledge this is the most general and efficient approach to learning and control under discrete varying contexts. Some experimental results obtained on a high-dimensional simulated humanoid robot provide a strong support to our approach.

[1]  Giulio Sandini,et al.  Learning task space control through goal directed exploration , 2011, 2011 IEEE International Conference on Robotics and Biomimetics.

[2]  Giorgio Metta,et al.  YARP: Yet Another Robot Platform , 2006 .

[3]  Stefan Schaal,et al.  Learning Operational Space Control , 2006, Robotics: Science and Systems.

[4]  Stefan Schaal,et al.  Incremental Online Learning in High Dimensions , 2005, Neural Computation.

[5]  Yoshihiko Nakamura,et al.  Inverse kinematic solutions with singularity robustness for robot manipulator control , 1986 .

[6]  Olivier Sigaud,et al.  On-line regression algorithms for learning mechanical models of robots: A survey , 2011, Robotics Auton. Syst..

[7]  Mitsuo Kawato,et al.  MOSAIC Model for Sensorimotor Learning and Control , 2001, Neural Computation.

[8]  Angelo Cangelosi,et al.  An open-source simulator for cognitive robotics research: the prototype of the iCub humanoid robot simulator , 2008, PerMIS.

[9]  Kumpati S. Narendra,et al.  Adaptive control using multiple models , 1997, IEEE Trans. Autom. Control..

[10]  Daniel H. Grollman,et al.  Incremental learning of subtasks from unsegmented demonstration , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  José Santos-Victor,et al.  An online algorithm for simultaneously learning forward and inverse kinematics , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[12]  Sethu Vijayakumar,et al.  Context Estimation and Learning Control through Latent Variable Extraction: From discrete to continuous contexts , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[13]  A. Liegeois,et al.  Automatic supervisory control of the configuration and behavior of multi-body mechanisms , 1977 .

[14]  Jan Peters,et al.  Local Gaussian process regression for real-time model-based robot control , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[15]  Olivier Sigaud,et al.  Control of redundant robots using learned models: An operational space control approach , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[16]  Yasuo Kuniyoshi,et al.  Adaptive body schema for robotic tool-use , 2006, Adv. Robotics.

[17]  Jochen J. Steil,et al.  Learning Flexible Full Body Kinematics for Humanoid Tool Use , 2010, 2010 International Conference on Emerging Security Technologies.

[18]  Geoffrey E. Hinton,et al.  An Alternative Model for Mixtures of Experts , 1994, NIPS.