Optimization-based iterative learning control for trajectory tracking

In this paper, an optimization-based iterative learning control approach is presented. Given a desired trajectory to be followed, the proposed learning algorithm improves the system performance from trial to trial by exploiting the experience gained from previous repetitions. Taking advantage of the a-priori knowledge about the systems dominating dynamics, a data-based update rule is derived which adapts the feedforward input signal after each trial. By combining traditional model-based optimal filtering methods with state-of-the-art optimization techniques such as convex programming, an effective and computationally highly efficient learning strategy is obtained. Moreover, the derived formalism allows for the direct treatment of input and state constraints. Different (nonlinear) performance objectives can be specified defining the overall learning behavior. Finally, the proposed algorithm is successfully applied to the benchmark problem of swinging up a pendulum using open-loop control only.

[1]  K. Lee,et al.  Semi-empirical model-based multivariable iterative learning control of an RTP system , 2005 .

[2]  Svante Gunnarsson,et al.  Time and frequency domain convergence properties in iterative learning control , 2002 .

[3]  R. Tousain,et al.  Design strategy for iterative learning control based on optimal control , 2001, Proceedings of the 40th IEEE Conference on Decision and Control (Cat. No.01CH37228).

[4]  Amit Patra,et al.  Swing-up and stabilization of a cart-pendulum system under restricted cart track length , 2002, Syst. Control. Lett..

[5]  Richard W. Longman,et al.  A mathematical theory of learning control for linear discrete multivariable systems , 1988 .

[6]  Richard W. Longman,et al.  Iterative learning control and repetitive control for engineering practice , 2000 .

[7]  Kirsten Morris,et al.  Friction and the Inverted Pendulum Stabilization Problem , 2008 .

[8]  E. Rogers,et al.  Iterative learning control using optimal feedback and feedforward actions , 1996 .

[9]  Kazunobu Yoshida,et al.  Swing-up control of an inverted pendulum by energy-based methods , 1999, Proceedings of the 1999 American Control Conference (Cat. No. 99CH36251).

[10]  Guanrong Chen,et al.  Kalman Filtering with Real-time Applications , 1987 .

[11]  D. Owens Iterative learning control-convergence using high gain feedback , 1992, [1992] Proceedings of the 31st IEEE Conference on Decision and Control.

[12]  Jay H. Lee,et al.  Control of Wafer Temperature Uniformity in Rapid Thermal Processing Using an Optimal Iterative Learning Control Technique , 2000 .

[13]  F. L. Chernousko,et al.  Time-optimal swing-up feedback control of a pendulum , 2006 .

[14]  A.G. Alleyne,et al.  A survey of iterative learning control , 2006, IEEE Control Systems.

[15]  Suguru Arimoto,et al.  Bettering operation of Robots by learning , 1984, J. Field Robotics.

[16]  B. Francis,et al.  A lifting technique for linear periodic systems with applications to sampled-data control , 1991 .

[17]  Katsuhisa Furuta,et al.  Swing up control of inverted pendulum , 1991, Proceedings IECON '91: 1991 International Conference on Industrial Electronics, Control and Instrumentation.

[18]  K. Moore Multi-loop control approach to designing iterative learning controllers , 1998, Proceedings of the 37th IEEE Conference on Decision and Control (Cat. No.98CH36171).

[19]  Martin A. Riedmiller Neural reinforcement learning to swing-up and balance a real pole , 2005, 2005 IEEE International Conference on Systems, Man and Cybernetics.

[20]  K. Moore,et al.  Iterative Learning Control: Robustness and Monotonic Convergence for Interval Systems , 2010 .

[21]  Si-Zhao Joe Qin,et al.  A two-stage iterative learning control technique combined with real-time feedback for independent disturbance rejection , 2004, Autom..