论文信息 - Fast biped walking with a reflexive controller and real-time policy searching

Fast biped walking with a reflexive controller and real-time policy searching

In this paper, we present our design and experiments of a planar biped robot ("RunBot") under pure reflexive neuronal control. The goal of this study is to combine neuronal mechanisms with biomechanics to obtain very fast speed and the on-line learning of circuit parameters. Our controller is built with biologically inspired sensor- and motor-neuron models, including local reflexes and not employing any kind of position or trajectory-tracking control algorithm. Instead, this reflexive controller allows RunBot to exploit its own natural dynamics during critical stages of its walking gait cycle. To our knowledge, this is the first time that dynamic biped walking is achieved using only a pure reflexive controller. In addition, this structure allows using a policy gradient reinforcement learning algorithm to tune the parameters of the reflexive controller in real-time during walking. This way RunBot can reach a relative speed of 3.5 leg-lengths per second after a few minutes of online learning, which is faster than that of any other biped robot, and is also comparable to the fastest relative speed of human walking. In addition, the stability domain of stable walking is quite large supporting this design strategy.

[1] Miomir Vukobratović,et al. Biped Locomotion: Dynamics, Stability, Control and Application , 1990 .

[2] M. Vukobratovic,et al. Biped Locomotion , 1990 .

[3] Patrik Larsson,et al. A Distributed Neural Network Architecture for Hexapod Robot Locomotion , 1992, Neural Computation.

[4] Randall D. Beer,et al. Application of evolved locomotion controllers to a hexapod robot , 1996, Robotics Auton. Syst..

[5] Randall D. Beer,et al. Biologically inspired approaches to robotics: what can we learn from insects? , 1997, CACM.

[6] Örjan Ekeberg,et al. A neuro-mechanical model of legged locomotion: single leg control , 1998, Biological Cybernetics.

[7] Richard Quint van der Linde,et al. Active Leg Compliance for Passive Walking , 1998, ICRA.

[8] Thomas Kindermann,et al. Walknet--a biologically inspired network to control six-legged walking , 1998, Neural Networks.

[9] Jerry E. Pratt,et al. Exploiting inherent robustness and natural dynamics in the control of bipedal walking robots , 2000 .

[10] M. Anthony Lewis,et al. Certain Principles of Biomorphic Robots , 2001, Auton. Robots.

[11] Yasuhiro Fukuoka,et al. Adaptive Dynamic Walking of a Quadruped Robot on Irregular Terrain Based on Biological Concepts , 2003, Int. J. Robotics Res..

[12] Peter Stone,et al. Policy gradient reinforcement learning for fast quadrupedal locomotion , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[13] Russ Tedrake,et al. Efficient Bipedal Robots Based on Passive-Dynamic Walkers , 2005, Science.