Motor control optimization of compliant one-legged locomotion in rough terrain

While underactuated robotic systems are capable of energy efficient and rapid dynamic behavior, we still do not fully understand how body dynamics can be actively used for adaptive behavior in complex unstructured environment. In particular, we can expect that the robotic systems could achieve high maneuverability by flexibly storing and releasing energy through the motor control of the physical interaction between the body and the environment. This paper presents a minimalistic optimization strategy of motor control policy for underactuated legged robotic systems. Based on a reinforcement learning algorithm, we propose an optimization scheme, with which the robot can exploit passive elasticity for hopping forward while maintaining the stability of locomotion process in the environment with a series of large changes of ground surface. We show a case study of a simple one-legged robot which consists of a servomotor and a passive elastic joint. The dynamics and learning performance of the robot model are tested in simulation, and then transferred the results to the real-world robot.

[1]  R. Blickhan The spring-mass model for running and hopping. , 1989, Journal of biomechanics.

[2]  T. McMahon,et al.  The mechanics of running: how does stiffness couple with speed? , 1990, Journal of biomechanics.

[3]  A. J. van den Bogert,et al.  Direct dynamics simulation of the impact phase in heel-toe running. , 1995, Journal of biomechanics.

[4]  Daniel E. Koditschek,et al.  RHex: A Simple and Highly Mobile Hexapod Robot , 2001, Int. J. Robotics Res..

[5]  H. Benjamin Brown,et al.  c ○ 2001 Kluwer Academic Publishers. Manufactured in The Netherlands. RHex: A Biologically Inspired Hexapod Runner ∗ , 2022 .

[6]  Reinhard Blickhan,et al.  A movement criterion for running. , 2002, Journal of biomechanics.

[7]  Yasuhiro Fukuoka,et al.  Adaptive Dynamic Walking of a Quadruped Robot on Irregular Terrain Based on Biological Concepts , 2003, Int. J. Robotics Res..

[8]  Hiroshi Shimizu,et al.  Self-organized control of bipedal locomotion by neural oscillators in unpredictable environment , 1991, Biological Cybernetics.

[9]  R. Pfeifer,et al.  Exploiting body dynamics for controlling a running quadruped robot , 2005, ICAR '05. Proceedings., 12th International Conference on Advanced Robotics, 2005..

[10]  Jun Morimoto,et al.  Learning CPG-based biped locomotion with a policy gradient method , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..

[11]  Russ Tedrake,et al.  Efficient Bipedal Robots Based on Passive-Dynamic Walkers , 2005, Science.

[12]  Yasuhiro Fukuoka,et al.  Autonomously generating efficient running of a quadruped robot using delayed feedback control , 2006, Adv. Robotics.

[13]  Fumiya Iida,et al.  Finding Resonance: Adaptive Frequency Oscillators for Dynamic Legged Locomotion , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[14]  Fumiya Iida,et al.  One-Legged Locomotion with a Compliant Passive Joint , 2006, IAS.

[15]  Bernd Porr,et al.  A Reflexive Neural Network for Dynamic Biped Walking Control , 2006 .

[16]  Shin Ishii,et al.  Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[17]  Koh Hosoda,et al.  Controlling the Walking Period of a Pneumatic Muscle Walker , 2006, Int. J. Robotics Res..

[18]  Fumiya Iida,et al.  Bipedal Walking and Running with Compliant Legs , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.