论文信息 - Improving Biped Walk Stability Using Real-Time Corrective Human Feedback

Improving Biped Walk Stability Using Real-Time Corrective Human Feedback

Robust walking is one of the key requirements for soccer playing humanoid robots. Developing such a biped walk algorithm is non-trivial due to the complex dynamics of the walk process. In this paper, we first present a method for learning a corrective closed-loop policy to improve the walk stability for the Aldebaran Nao robot using real-time human feedback combined with an openloop walk cycle. The open-loop walk cycle is obtained from the recorded joint commands while the robot is walking using an existing walk algorithm as a blackbox unit. We capture the corrective feedback signals delivered by a human using a wireless feedback mechanism in the form of corrections to the particular joints and we present experimental results showing that a policy learned from a walk algorithm can be used to improve the stability of another walk algorithm. We then follow up with improving the open-loop walk cycle using advice operators before performing real-time human demonstration. During the demonstration, we then capture the sensory readings and the corrections in the form of displacements of the foot positions while the robot is executing improved open-loop walk cycle. We then translate the feet displacement values into individual correction signals for the leg joints using a simplified inverse kinematics calculation. We use a locally weighted linear regression method to learn a mapping from the recorded sensor values to the correction values. Finally, we use a simple anomaly detection method by modeling the changes in the sensory readings throughout the walk cycle during a stable walk as normal distributions and executing the correction policy only if a sensory reading goes beyond the modeled values. Experimental results demonstrate an improvement in the walk stability.

Manuela M. Veloso | Çetin Meriçli

[1] Brett Browning,et al. Learning robot motion control with demonstration and advice-operators , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2] Manuela M. Veloso,et al. Online ZMP sampling search for biped walking planning , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3] Daniel H. Grollman,et al. Dogged Learning for Robots , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[4] Eric Chown,et al. Omnidirectional Walking Using ZMP and Preview Control for the NAO Humanoid Robot , 2009, RoboCup.

[5] Manuela M. Veloso,et al. Teaching collaborative multi-robot tasks through demonstration , 2008, Humanoids 2008 - 8th IEEE-RAS International Conference on Humanoid Robots.

[6] Manuela M. Veloso,et al. Interactive Policy Learning through Confidence-Based Autonomy , 2014, J. Artif. Intell. Res..

[7] Stefan Schaal,et al. Robot Learning From Demonstration , 1997, ICML.

[8] Jacky Baltes,et al. RoboCup 2009: Robot Soccer World Cup XIII [papers from the 13th annual RoboCup International Symposium, Graz, Austria, June 29 - July 5, 2009] , 2010, RoboCup.

[9] D.H. Grollman,et al. Learning robot soccer skills from demonstration , 2007, 2007 IEEE 6th International Conference on Development and Learning.

[10] Manuela M. Veloso,et al. Biped Walk Learning Through Playback and Corrective Demonstration , 2010, AAAI.

[11] Andrew W. Moore,et al. Locally Weighted Learning , 1997, Artificial Intelligence Review.

[12] T. Röfer,et al. A Robust Closed-Loop Gait for the Standard Platform League Humanoid , 2009 .

[13] Brett Browning,et al. Learning by demonstration with critique from a human teacher , 2007, 2007 2nd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[14] Jun Morimoto,et al. Learning from demonstration and adaptation of biped locomotion , 2004, Robotics Auton. Syst..

[15] Xiaoping Chen,et al. Simplified Walking: A New Way to Generate Flexible Biped Patterns , 2009 .

[16] Bariş Gökçe,et al. Parameter Optimization of a Signal-Based Biped Locomotion Approach Using Evolutionary Strategies , 2009 .

[17] Oliver Urbann,et al. Observer-based dynamic walking control for biped robots , 2009, Robotics Auton. Syst..