A self-tuning optimal controller for affine nonlinear continuous-time systems with unknown internal dynamics

This paper presents a novel neural network (NN) - based self-tuning controller for the optimal regulation of affine nonlinear continuous-time systems. Knowledge of the internal system dynamics is not required whereas the control coefficient matrix is considered to be available. The proposed nonlinear optimal regulator tunes itself in order to simultaneously learn the optimal control input, optimal cost function, and the system internal dynamics using a single NN. A novel NN weight tuning algorithm is derived which ensures the internal system dynamics are learned while simultaneously minimizing a predefined cost function. An initial stabilizing controller is not required. Lyapunov methods are used to show that all signals are uniformly ultimately bounded (UUB). In the absence of NN reconstruction errors, the approximated control input is shown to converge to the optimal control asymptotically for the regulator design, and simulation results illustrate the effectiveness of the approach.

[1]  Kurt Hornik,et al.  Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks , 1990, Neural Networks.

[2]  Frank L. Lewis,et al.  Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations , 2011, Autom..

[3]  Jennie Si,et al.  Handbook of Learning and Approximate Dynamic Programming (IEEE Press Series on Computational Intelligence) , 2004 .

[4]  Warren B. Powell,et al.  Handbook of Learning and Approximate Dynamic Programming , 2006, IEEE Transactions on Automatic Control.

[5]  Anuradha M. Annaswamy,et al.  Stable Adaptive Systems , 1989 .

[6]  Sarangapani Jagannathan,et al.  Optimal tracking control of affine nonlinear discrete-time systems with unknown internal dynamics , 2009, Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference.

[7]  R. Stephenson A and V , 1962, The British journal of ophthalmology.

[8]  Sarangapani Jagannathan,et al.  Online Optimal Control of Affine Nonlinear Discrete-Time Systems With Unknown Internal Dynamics by Using Time-Based Policy Update , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[9]  Frank L. Lewis,et al.  Neural Network Control Of Robot Manipulators And Non-Linear Systems , 1998 .

[10]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[11]  John C. Doyle,et al.  Nonlinear Games: examples and counterexamples "It would be more impressive if it flowed the other way. " Oscar Wilde at Niagara Fulls , 1996 .

[12]  Frank L. Lewis,et al.  2009 Special Issue: Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems , 2009 .