Observer-Based Adaptive Control of Uncertain Nonlinear Systems Via Neural Networks

In this paper, a novel observer-based control strategy is proposed for a class of uncertain continuous-time nonlinear systems based on the Hamilton–Jacobi–Bellman (HJB) equation. Due to the complexity of nonlinear systems, the approximately optimal control for affine uncertain continuous-time nonlinear systems is pursued. Considering that only the output variables can be measured in the control practice, the state observer is designed to reconstruct all system states by using the output variables. The observer-based policy iteration algorithm can solve the HJB equation within the adaptive dynamic programming framework for the unknown-state uncertain nonlinear systems, where a critic neural network is constructed to approximate the optimal cost function, and then, the approximate expression of the optimal control policy can be directly derived from solving the HJB equation. In addition, the stability of the whole closed-loop system is provided based on the Lyapunov analysis.

[1]  Derong Liu,et al.  Neural-network-observer-based optimal control for unknown nonlinear systems using adaptive dynamic programming , 2013, Int. J. Control.

[2]  Haibo He,et al.  Air-Breathing Hypersonic Vehicle Tracking Control Based on Adaptive Dynamic Programming , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[3]  Changyin Sun,et al.  Adaptive Neural Network Control of a Flapping Wing Micro Aerial Vehicle With Disturbance Observer , 2017, IEEE Transactions on Cybernetics.

[4]  Xiong Yang,et al.  Online approximate solution of HJI equation for unknown constrained-input nonlinear continuous-time systems , 2016, Inf. Sci..

[5]  Zhongke Shi,et al.  DOB-Based Neural Control of Flexible Hypersonic Flight Vehicle Considering Wind Effects , 2017, IEEE Transactions on Industrial Electronics.

[6]  Haibo He,et al.  Fuzzy-Based Goal Representation Adaptive Dynamic Programming , 2016, IEEE Transactions on Fuzzy Systems.

[7]  Frank L. Lewis,et al.  A dynamic recurrent neural-network-based adaptive observer for a class of nonlinear systems , 1997, Autom..

[8]  Haibo He,et al.  Dynamic Behavior of Terminal Sliding Mode Control , 2018, IEEE Transactions on Industrial Electronics.

[9]  Zhongke Shi,et al.  Reinforcement Learning Output Feedback NN Control Using Deterministic Learning Technique , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[10]  Derong Liu,et al.  An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs , 2013, Inf. Sci..

[11]  Haibo He,et al.  Optimal Control for Unknown Discrete-Time Nonlinear Markov Jump Systems Using Adaptive Dynamic Programming , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[12]  Zhong-Ping Jiang,et al.  Robust Adaptive Dynamic Programming , 2017 .

[13]  Changyin Sun,et al.  Robust adaptive critic control design with network-based event-triggered formulation , 2017, Nonlinear Dynamics.

[14]  Paul J. Werbos,et al.  Approximate dynamic programming for real-time control and neural modeling , 1992 .

[15]  Frank L. Lewis,et al.  2009 Special Issue: Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems , 2009 .

[16]  Jennie Si,et al.  Online learning control by association and reinforcement , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[17]  Shaocheng Tong,et al.  Fuzzy Approximation-Based Adaptive Backstepping Optimal Control for a Class of Nonlinear Discrete-Time Systems With Dead-Zone , 2016, IEEE Transactions on Fuzzy Systems.

[18]  Roberto A. Santiago,et al.  Adaptive critic designs: A case study for neurocontrol , 1995, Neural Networks.

[19]  Frank L. Lewis,et al.  Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2009, 2009 International Joint Conference on Neural Networks.

[20]  Qichao Zhang,et al.  Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics , 2016, IEEE Transactions on Cybernetics.

[21]  Haibo He,et al.  Goal Representation Heuristic Dynamic Programming on Maze Navigation , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[22]  Huaguang Zhang,et al.  Neural-Network-Based Near-Optimal Control for a Class of Discrete-Time Affine Nonlinear Systems With Control Constraints , 2009, IEEE Transactions on Neural Networks.

[23]  Huaguang Zhang,et al.  Near-Optimal Control for Nonzero-Sum Differential Games of Continuous-Time Nonlinear Systems Using Single-Network ADP , 2013, IEEE Transactions on Cybernetics.

[24]  Haibo He,et al.  Event-Driven Adaptive Robust Control of Nonlinear Systems With Uncertainties Through NDP Strategy , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[25]  Lyle Noakes,et al.  Continuous-Time Adaptive Critics , 2007, IEEE Transactions on Neural Networks.

[26]  Derong Liu,et al.  Adaptive Dynamic Programming for Control: Algorithms and Stability , 2012 .

[27]  Yu Guo,et al.  Online adaptive optimal control for continuous-time nonlinear systems with completely unknown dynamics , 2016, Int. J. Control.

[28]  Haibo He,et al.  A three-network architecture for on-line learning and optimization based on adaptive dynamic programming , 2012, Neurocomputing.

[29]  Kurt Hornik,et al.  Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks , 1990, Neural Networks.

[30]  Haibo He,et al.  Adaptive Critic Nonlinear Robust Control: A Survey , 2017, IEEE Transactions on Cybernetics.

[31]  Haibo He,et al.  Data-Driven Tracking Control With Adaptive Dynamic Programming for a Class of Continuous-Time Nonlinear Systems , 2017, IEEE Transactions on Cybernetics.

[32]  Heidar Ali Talebi,et al.  A stable neural network-based observer with application to flexible-joint manipulators , 2006, IEEE Transactions on Neural Networks.

[33]  Changyin Sun,et al.  Neural Network Control of a Flexible Robotic Manipulator Using the Lumped Spring-Mass Model , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[34]  Wei-Song Lin,et al.  Optimization of Train Regulation and Energy Usage of Metro Lines Using an Adaptive-Optimal-Control Algorithm , 2011, IEEE Transactions on Automation Science and Engineering.

[35]  Frank L. Lewis,et al.  Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach , 2005, Autom..

[36]  Zhong-Ping Jiang,et al.  Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics , 2012, Autom..

[37]  Changyin Sun,et al.  Adaptive Neural Impedance Control of a Robotic Manipulator With Input Saturation , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[38]  Huaguang Zhang,et al.  Asymptotic tracking control scheme for mechanical systems with external disturbances and friction , 2010, Neurocomputing.

[39]  Sarangapani Jagannathan,et al.  Online Optimal Control of Affine Nonlinear Discrete-Time Systems With Unknown Internal Dynamics by Using Time-Based Policy Update , 2012, IEEE Transactions on Neural Networks and Learning Systems.