Minimum-time control of the Acrobot

The Acrobot, a two-link arm that is unactuated at the first joint, is challenging to control because it is an underactuated, nonlinear, continuous system. We describe a direct search algorithm for finding swingup trajectories for the Acrobot. The algorithm uses a lookahead search that maximizes the Acrobot's total energy in an N-step window. Because the controls are extremal and the number of switches in the window limited, the algorithm is fast. Nevertheless, the resulting trajectories are near optimal.

[1]  Arthur E. Bryson,et al.  Applied Optimal Control , 1969 .

[2]  Lino Guzzella,et al.  Time-optimal motions of robots in assembly tasks , 1985, 1985 24th IEEE Conference on Decision and Control.

[3]  Lino Guzzella,et al.  Time-optimal motions of robots in assembly tasks , 1986 .

[4]  Eugene I. Rivin,et al.  Mechanical Design of Robots , 1987 .

[5]  R. Murray,et al.  Nonlinear controllers for non-integrable systems: the Acrobot example , 1990, 1990 American Control Conference.

[6]  Ronald S. Fearing,et al.  Control of a two-link robot to achieve sliding and hopping gaits , 1992, Proceedings 1992 IEEE International Conference on Robotics and Automation.

[7]  Ronald S. Fearing,et al.  Control experiments on an underactuated robot with application to legged locomotion , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[8]  Mark W. Spong,et al.  Swinging up the Acrobot: an example of intelligent control , 1994, Proceedings of 1994 American Control Conference - ACC '94.

[9]  Mark W. Spong,et al.  The swing up control problem for the Acrobot , 1995 .

[10]  N. Sadegh,et al.  Minimum time trajectory learning , 1995, Proceedings of 1995 American Control Conference - ACC'95.

[11]  Richard S. Sutton,et al.  Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding , 1995, NIPS.

[12]  E. Daniel Davison,et al.  Regulation of the Acrobot , 1995 .

[13]  Mark W. Spong,et al.  Energy Based Control of a Class of Underactuated Mechanical Systems , 1996 .

[14]  Gary Boone,et al.  Efficient reinforcement learning: model-based Acrobot control , 1997, Proceedings of International Conference on Robotics and Automation.

[15]  Scott A. Bortoff,et al.  Stabilization using pseudolinearization and high-gain feedback , 1997, Autom..

[16]  Mark W. Spong,et al.  Control of underactuated mechanical systems using switching and saturation , 1997 .