Design of an iterative learning control with a selective learning strategy for swinging up a pendulum

Swinging up a pendulum on a cart is a well-known demonstration example for trajectory tracking in a nonlinear system. The standard realtime feedback control approach fails if the plant output is not available in real time, e.g. due to large or variable measurement delays. However, the task can be solved in multiple trials by applying feedforward inputs that are improved from trial to trial by Iterative Learning Control (ILC). Our examination demonstrates that an ILC can be used for trajectory tracking close to the singularities and the unstable equilibrium of a non-linear system. Specifically, we present an ILC algorithm for pendulum swing-up by angle trajectory tracking. The controller design is based on a modified plant inversion approach that restricts the learning process to trajectory segments with small tracking errors and sufficient input sensitivity. We show that these restrictions lead to improved learning progress in contrast to conventional learning from the complete trajectory. Controller performance is evaluated in an experimental testbed. The ILC starts from a zero-input trajectory and learns to swing up the pendulum within six iterations. Robustness is analyzed experimentally, and the performance is compared to literature results. The convergence is at least two orders of magnitude faster than the one achieved by other methods that avoid feedback and do not rely on a suitable initial input trajectory.

[1]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[2]  Michael J. Grimble,et al.  Iterative Learning Control for Deterministic Systems , 1992 .

[3]  A.G. Alleyne,et al.  A survey of iterative learning control , 2006, IEEE Control Systems.

[4]  Kwang-Hyun Park,et al.  A generalized iterative learning controller against initial state error , 2000 .

[5]  Thomas Schauer,et al.  Iterative Learning Control and System Identification of the Antagonistic Knee Muscle Complex During Gait Using Functional Electrical Stimulation , 2017 .

[6]  E. Rogers,et al.  Experimentally verified generalized KYP Lemma based iterative learning control design , 2016 .

[7]  Thomas Seel,et al.  Iterative Learning Control with variable pass length applied to trajectory tracking on a crane with output constraints , 2013, 52nd IEEE Conference on Decision and Control.

[8]  David Silver,et al.  Memory-based control with recurrent neural networks , 2015, ArXiv.

[9]  K. Moore,et al.  Iterative learning control: brief survey and categorization 1998− 2004 , 2006 .

[10]  Engineering,et al.  Hybrid iterative learning control design : Application to inverted pendulum , 2014 .

[11]  Richard W. Longman,et al.  Simple learning control made practical by zero-phase filtering: applications to robotics , 2002 .

[12]  Jörg Raisch,et al.  Iterative learning control of a drop foot neuroprosthesis — Generating physiological foot motion in paretic gait by automatic feedback control , 2016 .

[13]  Martin A. Riedmiller,et al.  Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images , 2015, NIPS.

[14]  Peter Dayan,et al.  Bayes-Adaptive Simulation-based Search with Value Function Approximation , 2014, NIPS.

[15]  Krzysztof Galkowski,et al.  Control Systems Theory and Applications for Linear Repetitive Processes - Recent Progress and Open Research Questions , 2007 .

[16]  Thomas Seel,et al.  The adaptive drop foot stimulator - Multivariable learning control of foot pitch and roll motion in paretic gait. , 2016, Medical engineering & physics.

[17]  Eyal Dassau,et al.  Closed-Loop Control of Artificial Pancreatic $\beta$ -Cell in Type 1 Diabetes Mellitus Using Model Predictive Iterative Learning Control , 2010, IEEE Transactions on Biomedical Engineering.

[18]  Wojciech Paszke,et al.  On the design of ILC schemes for finite frequency range tracking specifications , 2010, 49th IEEE Conference on Decision and Control (CDC).

[19]  Thomas Seel,et al.  Monotonic convergence of iterative learning control systems with variable pass length , 2017, Int. J. Control.

[20]  Thomas Seel,et al.  Iterative Learning Cascade Control of Continuous Noninvasive Blood Pressure Measurement , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[21]  Svante Gunnarsson,et al.  Time and frequency domain convergence properties in iterative learning control , 2002 .

[22]  Katsuhisa Furuta,et al.  Swinging up a pendulum by energy control , 1996, Autom..

[23]  Eric Rogers,et al.  Iterative learning control of FES applied to the upper extremity for rehabilitation , 2009 .

[24]  E. Rogers,et al.  Iterative learning control using optimal feedback and feedforward actions , 1996 .

[25]  Raffaello D'Andrea,et al.  Optimization-based iterative learning control for trajectory tracking , 2009, 2009 European Control Conference (ECC).

[26]  B. Buckingham,et al.  Closed-loop control in type 1 diabetes. , 2016, The lancet. Diabetes & endocrinology.