Learning to Adjust and Refine Gait Patterns for a Biped Robot

In this paper, a reinforced learning method for biped walking is proposed, where the robot learns to appropriately modulate an observed walking pattern. The biped robot was equipped with two Q -learning mechanisms. First, the robot learns a policy to adjust a defective walking pattern, gait-by-gait, into a more stable one. To avoid the complexity of adjusting too many joints of a humanoid robot and to speed up the learning process, the dimensionality of the action space was reduced. In turn, the other learning mechanism trained the robot to walk in a refined pattern, allowing it to walk faster without the loss of other required criteria, such as walking straight. This approach was implemented with both a simulated robot model and an actual biped robot. The results from the simulations and experiments show that successful walking policies were obtained. The learning system works quickly enough so that the robot was able to continually adapt to the terrain as it walked.

[1]  Jun Morimoto,et al.  Learning Biped Locomotion , 2007, IEEE Robotics & Automation Magazine.

[2]  Yuan F. Zheng,et al.  Reinforcement learning for a biped robot to climb sloping surfaces , 1997 .

[3]  William A. Gruver,et al.  Optimization of the biped robot trajectory , 1991, Conference Proceedings 1991 IEEE International Conference on Systems, Man, and Cybernetics.

[4]  Zhi Liu,et al.  A Probabilistic Neural-Fuzzy Learning System for Stochastic Modeling , 2008, IEEE Transactions on Fuzzy Systems.

[5]  Eric Chown,et al.  Omnidirectional Walking Using ZMP and Preview Control for the NAO Humanoid Robot , 2009, RoboCup.

[6]  Masa-aki Sato,et al.  Reinforcement Learning for Biped Robot , 2003 .

[7]  Changjiu Zhou,et al.  Dynamic balance of a biped robot using fuzzy reinforcement learning agents , 2003, Fuzzy Sets Syst..

[8]  Jungho Lee,et al.  Stable Walking Pattern Generation for a Biped Robot Using Reinforcement Learning , 2009 .

[9]  Shuji Hashimoto,et al.  Balance control of robot and human-robot interaction with haptic sensing foots , 2009, 2009 2nd Conference on Human System Interactions.

[10]  Jun Morimoto,et al.  A simple reinforcement learning algorithm for biped walking , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[11]  Tzuu-Hseng S. Li,et al.  Dynamic Balance Control for Biped Robot Walking Using Sensor Fusion, Kalman Filter, and Fuzzy Logic , 2012, IEEE Transactions on Industrial Electronics.

[12]  Kazuhito Yokoi,et al.  Biped walking pattern generation by using preview control of zero-moment point , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[13]  Min Cheol Lee,et al.  Fuzzy Posture Control for Biped Walking Robot Based on Force Sensor for ZMP , 2006, 2006 SICE-ICASE International Joint Conference.

[14]  Ronald C. Arkin,et al.  Autonomous Mobile Robots , 2016 .

[15]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[16]  K. Mitobe,et al.  A ZMP control of a powered passive dynamic walking robot , 2010, 2010 World Automation Congress.

[17]  Seokmin Hong,et al.  An omni-directional walking pattern generation method for humanoid robots with quartic polynomials , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18]  Shugen Ma,et al.  Omnidirectional static walking of a quadruped robot , 2005, IEEE Transactions on Robotics.

[19]  Shin Ishii,et al.  Reinforcement learning for a biped robot based on a CPG-actor-critic method , 2007, Neural Networks.

[20]  Sun Zengqi,et al.  Reinforcement learning method-based stable gait synthesis for biped robot , 2004, ICARCV 2004 8th Control, Automation, Robotics and Vision Conference, 2004..

[21]  S. Sitharama Iyengar,et al.  Autonomous Mobile Robots , 1991 .

[22]  Danwei Wang,et al.  Central Pattern Generator Inspired Control for Adaptive Walking of Biped Robots , 2013, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[23]  K. Erbatur,et al.  Humanoid Walking Robot Control with Natural ZMP References , 2006, IECON 2006 - 32nd Annual Conference on IEEE Industrial Electronics.

[24]  Shuuji Kajita,et al.  ZMP-Based Biped Running Control , 2007, IEEE Robotics & Automation Magazine.