Motion Segmentation and Balancing for a Biped Robot's Imitation Learning

Techniques for transferring human behaviors to robots through learning by imitation/demonstration have been the subject of much study. However, direct transfer of human motion trajectories to humanoid robots does not result in dynamically stable robot movements because of the differences in human and humanoid robot kinematics and dynamics. An imitating algorithm called posture-based imitation with balance learning (Post-BL) is proposed in this paper. This Post-BL algorithm consists of three parts: a key posture identification method is used to capture key postures as knots to reconstruct the motion imitated; a clustering method classifies key postures with high similarity; and a learning method enhances the static stability of balance during imitation. In motion reproduction, the proposed system smoothly transits between key poses and the robot learns to maintain balance by slightly adjusting the leg joints. The developed balance controller uses a reinforcement learning mechanism, which is sufficient to stabilize the robot during online imitation. The experimental results for simulation and a real humanoid robot show that the Post-BL algorithm allows demonstrated motions to be imitated balance to be preserved.

[1]  Changyin Sun,et al.  Model Identification and Control Design for a Humanoid Robot , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[2]  Kazuhito Yokoi,et al.  Planning walking patterns for a biped robot , 2001, IEEE Trans. Robotics Autom..

[3]  Kao-Shing Hwang,et al.  Learning to Adjust and Refine Gait Patterns for a Biped Robot , 2015, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[4]  Tzuu-Hseng S. Li,et al.  Walking Motion Generation, Synthesis, and Control for Biped Robot by Using PGRL, LPI, and Fuzzy Logic , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[5]  Joo-Ho Lee,et al.  Full-body imitation of human motions with kinect and heterogeneous kinematic structure of humanoid robot , 2012, 2012 IEEE/SICE International Symposium on System Integration (SII).

[6]  Sukhan Lee,et al.  Energy-Efficient SVM Learning Control System for Biped Walking Robots , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Chia-Wen Lin,et al.  Automatic Key Posture Selection for Human Behavior Analysis , 2005, 2005 IEEE 7th Workshop on Multimedia Signal Processing.

[8]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[9]  Tzuu-Hseng S. Li,et al.  A biped gait learning algorithm for humanoid robots based on environmental impact assessed artificial bee colony , 2015, IEEE Access.

[10]  Abhishek Kar,et al.  Skeletal Tracking using Microsoft Kinect , 2011 .

[11]  Kao-Shing Hwang,et al.  Gait Balance and Acceleration of a Biped Robot Based on Q-Learning , 2016, IEEE Access.

[12]  Kao-Shing Hwang,et al.  Humanoid robot gait imitation , 2014, 2014 Proceedings of the SICE Annual Conference (SICE).

[13]  Michail G. Lagoudakis,et al.  Complete Analytical Forward and Inverse Kinematics for the NAO Humanoid Robot , 2015, J. Intell. Robotic Syst..

[14]  Wei He,et al.  Adaptive Neural Network Control of an Uncertain Robot With Full-State Constraints , 2016, IEEE Transactions on Cybernetics.

[15]  Chih-Lyang Hwang,et al.  Hybrid learning model and MMSVM classification for on-line visual imitation of a human with 3-D motions , 2015, Robotics Auton. Syst..

[16]  Yoshihiko Nakamura,et al.  Physical human robot interaction in imitation learning , 2011, 2011 IEEE International Conference on Robotics and Automation.

[17]  Luis Paulo Reis,et al.  Biped locomotion methodologies applied to humanoid robotics , 2009 .

[18]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[19]  Tohru Matsumoto,et al.  The Extraction of Symbolic Postures to Transfer Social Cues into Robot , 2010 .

[20]  Katsu Yamane,et al.  Simultaneous tracking and balancing of humanoid robots for imitating human motion capture data , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[21]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[22]  Ken Chen,et al.  Gait Synthesis and Sensory Control of Stair Climbing for a Humanoid Robot , 2008, IEEE Transactions on Industrial Electronics.

[23]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[24]  Kao-Shing Hwang,et al.  Decoupled Visual Servoing With Fuzzy Q-Learning , 2018, IEEE Transactions on Industrial Informatics.

[25]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[26]  Changyin Sun,et al.  Adaptive Neural Network Control of Biped Robots , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[27]  Qiong Wu,et al.  Development of a Complete Dynamic Model of a Planar Five-Link Biped and Sliding Mode Control of Its Locomotion During the Double Support Phase , 2004 .

[28]  F. Zuher,et al.  Recognition of Human Motions for Imitation and Control of a Humanoid Robot , 2012, 2012 Brazilian Robotics Symposium and Latin American Robotics Symposium.

[29]  Jin Bae Park,et al.  Robust Control of Planar Biped Robots in Single Support Phase Using Intelligent Adaptive Backstepping Technique , 2007 .

[30]  Qiang Wu,et al.  Extracting key postures in a human action video sequence , 2008, 2008 IEEE 10th Workshop on Multimedia Signal Processing.

[31]  Kazuhisa Mitobe,et al.  Control of a Biped Walking Robot during the Double Support Phase , 1997, Auton. Robots.

[32]  C. L. Philip Chen,et al.  Adaptive Neural Control for Dual-Arm Coordination of Humanoid Robot With Unknown Nonlinearities in Output Mechanism , 2015, IEEE Transactions on Cybernetics.