State and output feedback-based adaptive optimal control of nonlinear continuous-time systems in strict feedback form

This paper focuses on neural network (NN) based adaptive optimal control of nonlinear continuous-time systems in strict feedback form with known dynamics. A single NN is utilized to learn the infinite horizon cost function which is the solution to the Hamilton-Jacobi-Bellman (HJB) equation in continuous-time. The corresponding optimal control input that minimizes the HJB equation is calculated in a forward-in-time manner without using value and policy iterations. First, the optimal control problem is solved in a generic multi input and multi output (MIMO) nonlinear system in strict feedback form with a state feedback approach. Then, the approach is extended to single input and single output (SISO) nonlinear system in strict feedback form by using output feedback via a nonlinear observer. Lyapunov techniques are used to show that all signals are uniformly ultimately bounded (UUB) and that the approximated control signals approach the optimal control inputs with small bounded error. In the absence of NN reconstruction errors, asymptotic convergence to the optimal control input is demonstrated. Finally, a simulation example is provided to validate the theoretical results for the output feedback controller design.

[1]  Warren B. Powell,et al.  Handbook of Learning and Approximate Dynamic Programming , 2006, IEEE Transactions on Automatic Control.

[2]  Frank L. Lewis,et al.  Adaptive optimal control for continuous-time linear systems based on policy iteration , 2009, Autom..

[3]  S. Jagannathan,et al.  Optimal control of affine nonlinear continuous-time systems using an online Hamilton-Jacobi-Isaacs formulation , 2010, 49th IEEE Conference on Decision and Control (CDC).

[4]  Draguna Vrabie,et al.  Adaptive optimal controllers based on Generalized Policy Iteration in a continuous-time framework , 2009, 2009 17th Mediterranean Conference on Control and Automation.

[5]  M. Krstić,et al.  Optimal design of adaptive tracking controllers for nonlinear systems , 1997, Proceedings of the 1997 American Control Conference (Cat. No.97CH36041).

[6]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[7]  T. Dierks,et al.  Optimal control of affine nonlinear discrete-time systems , 2009, 2009 17th Mediterranean Conference on Control and Automation.

[8]  John T. Wen,et al.  Improving the performance of stabilizing controls for nonlinear systems , 1996 .

[9]  Shuzhi Sam Ge,et al.  Adaptive neural network control for strict-feedback nonlinear systems using backstepping design , 1999, Proceedings of the 1999 American Control Conference (Cat. No. 99CH36251).

[10]  Miroslav Krstic,et al.  Optimal design of adaptive tracking controllers for non-linear systems , 1997, Autom..