Implementation of a trajectory library approach to controlling humanoid standing balance

This paper presents a nonlinear controller based on a trajectory library. To generate the library, we combine two trajectory optimization methods: a parametric trajectory optimization method that finds coarse initial trajectories and Differential Dynamic Programming (DDP) that further refines these trajectories and generates linear local models of the optimal control laws. To construct a controller from these local models, we maintain the consistency of adjacent trajectories. To keep the resultant library a reasonable size and also satisfy performance requirements, the library is generated based on the controller's predicted performance. It is applied to standing balance control of humanoid robots that explicitly handle pushes. Most previous work assumes that pushes are impulsive. The proposed controller also handles continuous pushes that change with time. We compared our approach with a Linear Quadratic Regulator (LQR) gain scheduling controller using the same optimization criterion. The effectiveness of the proposed method is explored with simulation and experiments.

[1]  Jovan Popovic,et al.  Multiobjective control with frictional contacts , 2007, SCA '07.

[2]  Christopher G. Atkeson,et al.  Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming , 1993, NIPS.

[3]  Benjamin J. Stephens,et al.  Humanoid push recovery , 2007, 2007 7th IEEE-RAS International Conference on Humanoid Robots.

[4]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[5]  W. Marsden I and J , 2012 .

[6]  Michael A. Saunders,et al.  SNOPT: An SQP Algorithm for Large-Scale Constrained Optimization , 2002, SIAM J. Optim..

[7]  A.D. Kuo,et al.  An optimal control model for analyzing human postural balance , 1995, IEEE Transactions on Biomedical Engineering.

[8]  Dinesh K. Pai,et al.  Data-driven Interactive Balancing Behaviors , 2005 .

[9]  David A. Forsyth,et al.  Generalizing motion edits with Gaussian processes , 2009, ACM Trans. Graph..

[10]  Hooshang Hemami,et al.  Postural stability of the two-degree-of-freedom biped by general linear feedback , 1976 .

[11]  Hooshang Hemami,et al.  Nonlinear feedback in simple locomotion systems , 1976 .

[12]  Anil V. Rao,et al.  Practical Methods for Optimal Control Using Nonlinear Programming , 1987 .

[13]  William D. Smart,et al.  Receding Horizon Differential Dynamic Programming , 2007, NIPS.

[14]  Russ Tedrake,et al.  LQR-trees: Feedback motion planning on sparse randomized trees , 2009, Robotics: Science and Systems.

[15]  Pierre-Brice Wieber,et al.  Online adaptation of reference trajectories for the control of walking systems , 2006, Robotics Auton. Syst..

[16]  Christopher G. Atkeson,et al.  Standing balance control using a trajectory library , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[17]  P. Harwood Michael , 1985 .

[18]  Christopher G. Atkeson,et al.  Multiple balance strategies from one optimization criterion , 2007, 2007 7th IEEE-RAS International Conference on Humanoid Robots.

[19]  Jun Morimoto,et al.  Nonparametric Representation of Policies and Value Functions: A Trajectory-Based Approach , 2002, NIPS.

[20]  Ambarish Goswami,et al.  Postural Stability of Biped Robots and the Foot-Rotation Indicator (FRI) Point , 1999, Int. J. Robotics Res..

[21]  Benjamin J. Stephens Integral control of humanoid balance , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[22]  Victor B. Zordan,et al.  Momentum control for balance , 2009, SIGGRAPH 2009.

[23]  David B. Doman,et al.  Integrated Adaptive Guidance and Control for Re-Entry Vehicles with Flight-Test Results , 2004 .

[24]  Christopher G. Atkeson,et al.  Compliant control of a hydraulic humanoid joint , 2007, 2007 7th IEEE-RAS International Conference on Humanoid Robots.

[25]  R. Bellman Dynamic programming. , 1957, Science.

[26]  Stephen R. McReynolds,et al.  The computation and theory of optimal control , 1970 .

[27]  David Q. Mayne,et al.  Differential dynamic programming , 1972, The Mathematical Gazette.