Robust Sensorimotor Representation to Physical Interaction Changes in Humanoid Motion Learning

This paper proposes a learning from demonstration system based on a motion feature, called phase transfer sequence. The system aims to synthesize the knowledge on humanoid whole body motions learned during teacher-supported interactions, and apply this knowledge during different physical interactions between a robot and its surroundings. The phase transfer sequence represents the temporal order of the changing points in multiple time sequences. It encodes the dynamical aspects of the sequences so as to absorb the gaps in timing and amplitude derived from interaction changes. The phase transfer sequence was evaluated in reinforcement learning of sitting-up and walking motions conducted by a real humanoid robot and compatible simulator. In both tasks, the robotic motions were less dependent on physical interactions when learned by the proposed feature than by conventional similarity measurements. Phase transfer sequence also enhanced the convergence speed of motion learning. Our proposed feature is original primarily because it absorbs the gaps caused by changes of the originally acquired physical interactions, thereby enhancing the learning speed in subsequent interactions.

[1]  Danica Kragic,et al.  Learning Actions from Observations , 2010, IEEE Robotics & Automation Magazine.

[2]  Stefan Schaal,et al.  Learning variable impedance control , 2011, Int. J. Robotics Res..

[3]  Qiang Huang,et al.  Sensory reflex control for humanoid walking , 2005, IEEE Transactions on Robotics.

[4]  Jun Morimoto,et al.  Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning , 2000, Robotics Auton. Syst..

[5]  Jun Nakanishi,et al.  Learning Attractor Landscapes for Learning Motor Primitives , 2002, NIPS.

[6]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[7]  P. Zelazo,et al.  "Walking" in the Newborn , 1972, Science.

[8]  Monica N. Nicolescu,et al.  Natural methods for robot task learning: instructive demonstrations, generalization and practice , 2003, AAMAS '03.

[9]  Takashi Minato,et al.  Physical Human-Robot Interaction: Mutual Learning and Adaptation , 2012, IEEE Robotics & Automation Magazine.

[10]  Giulio Sandini,et al.  The iCub humanoid robot: an open platform for research in embodied cognition , 2008, PerMIS.

[11]  Yasuo Kuniyoshi,et al.  Embodied basis of invariant features in execution and perception of whole-body dynamic actions - knacks and focuses of Roll-and-Rise motion , 2004, Robotics Auton. Syst..

[12]  L. Bergroth,et al.  A survey of longest common subsequence algorithms , 2000, Proceedings Seventh International Symposium on String Processing and Information Retrieval. SPIRE 2000.

[13]  Kazuhito Yokoi,et al.  Generating whole body motions for a biped humanoid robot from captured human dances , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[14]  Jun Morimoto,et al.  Learning from demonstration and adaptation of biped locomotion , 2004, Robotics Auton. Syst..

[15]  Qingguo Wang,et al.  A Fast Multiple Longest Common Subsequence (MLCS) Algorithm , 2011, IEEE Transactions on Knowledge and Data Engineering.

[16]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[17]  Toyoaki Nishida,et al.  Robust Singular Spectrum Transform , 2009, IEA/AIE.

[18]  S. Kato,et al.  Motion control for humanoid robots based on the motion phase decision tree learning , 2004, Micro-Nanomechatronics and Human Science, 2004 and The Fourth Symposium Micro-Nanomechatronics for Information-Based Society, 2004..

[19]  B. Ulrich,et al.  Treadmill training of infants with Down syndrome: evidence-based developmental outcomes. , 2001, Pediatrics.

[20]  Aude Billard,et al.  On Learning, Representing, and Generalizing a Task in a Humanoid Robot , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[21]  Miomir Vukobratovic,et al.  Zmp: a Review of Some Basic Misunderstandings , 2006, Int. J. Humanoid Robotics.

[22]  Giorgio Metta,et al.  YARP: Yet Another Robot Platform , 2006 .

[23]  Henrik Schiøler,et al.  Sociable Robots Through Self-Maintained Energy , 2006 .

[24]  Koh Hosoda,et al.  External rotation as morphological bootstrapping for emergence of biped walking , 2010, 2010 IEEE 9th International Conference on Development and Learning.

[25]  Chung Hyuk Park,et al.  Transfer of skills between human operators through haptic training with robot coordination , 2010, 2010 IEEE International Conference on Robotics and Automation.

[26]  Keisuke Inoue,et al.  Knowledge Discovery from Heterogeneous Dynamic Systems using Change-Point Correlations , 2005, SDM.