Self-learning optimal guaranteed cost control of input-affine continuous-time nonlinear systems under uncertain environment

In this paper, we investigate the self-learning optimal guaranteed cost control problem of input-affine continuous-time nonlinear systems possessing dynamical uncertainty. The cost function related to the original uncertain system is discussed sufficiently, with the purpose of developing the optimal guaranteed cost and the corresponding feedback control input. Through theoretical analysis, the optimal guaranteed cost control problem is transformed into designing an optimal controller of the nominal system with a newly defined cost function. The policy iteration algorithm is employed to conduct the learning process and a critic neural network is built, serving as the approximator, to implement the algorithm conveniently. The main idea comes from adaptive dynamic programming (ADP), which is regarded as a self-learning optimal control approach with a certain degree of human brain intelligence. The performance of the control strategy is verified via a simulation example. The established method provides a combination of ADP and robust control design, which enhances the scope of ADP study to nonlinear systems under uncertain environment.

[1]  Qinglai Wei,et al.  Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming , 2012, Autom..

[2]  Li Yu,et al.  An LMI approach to guaranteed cost control of linear uncertain time-delay systems , 1999, Autom..

[3]  Frank L. Lewis,et al.  Adaptive Optimal Control of Unknown Constrained-Input Systems Using Policy Iteration and Neural Networks , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Feng Lin,et al.  Robust Control of Nonlinear Systems: Compensating for Uncertainty , 1990, 1990 American Control Conference.

[5]  Derong Liu,et al.  Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming , 2014, Inf. Sci..

[6]  Guang-Hong Yang,et al.  Adaptive Actor–Critic Design-Based Integral Sliding-Mode Control for Partially Unknown Nonlinear Systems With Input Disturbances , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Frank L. Lewis,et al.  Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2009, 2009 International Joint Conference on Neural Networks.

[8]  Frank L. Lewis,et al.  Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach , 2005, Autom..

[9]  Derong Liu,et al.  Neural-Network-Based Online HJB Solution for Optimal Robust Guaranteed Cost Control of Continuous-Time Uncertain Nonlinear Systems , 2014, IEEE Transactions on Cybernetics.

[10]  P. Werbos,et al.  Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[11]  Haibo He,et al.  GrDHP: A General Utility Function Representation for Dual Heuristic Dynamic Programming , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[12]  Haibo He,et al.  Power System Stability Control for a Wind Farm Based on Adaptive Dynamic Programming , 2015, IEEE Transactions on Smart Grid.

[13]  Derong Liu,et al.  Data-based Self-learning Optimal Control: Research Progress and Prospects , 2013 .

[14]  C. Mu,et al.  A New Finite Time Convergence Condition for Super‐Twisting Observer Based on Lyapunov Analysis , 2015 .

[15]  Xiong Yang,et al.  Online approximate solution of HJI equation for unknown constrained-input nonlinear continuous-time systems , 2016, Inf. Sci..

[16]  Randal W. Beard,et al.  Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation , 1997, Autom..

[17]  T. K. C. Peng,et al.  Adaptive Guaranteed Cost of Control of Systems with Uncertain Parameters , 1970 .

[18]  Derong Liu,et al.  A neural-network-based online optimal control approach for nonlinear robust decentralized stabilization , 2016, Soft Comput..

[19]  Indra Narayan Kar,et al.  Bounded robust control of nonlinear systems using neural network–based HJB solution , 2011, IEEE Transactions on Automation Science and Engineering.

[20]  Derong Liu,et al.  Data-Based Adaptive Critic Designs for Nonlinear Robust Optimal Control With Uncertain Dynamics , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[21]  Dongbin Zhao,et al.  MEC—A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[22]  F. Lewis,et al.  Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers , 2012, IEEE Control Systems.

[23]  Changyin Sun,et al.  Fast sliding mode control on air-breathing hypersonic vehicles with transient response analysis , 2016, J. Syst. Control. Eng..

[24]  Tingwen Huang,et al.  Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design , 2014, Autom..

[25]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[26]  Derong Liu,et al.  Policy Iteration Algorithm for Online Design of Robust Control for a Class of Continuous-Time Nonlinear Systems , 2014, IEEE Transactions on Automation Science and Engineering.

[27]  Qing-Long Han,et al.  Optimal guaranteed cost control of linear uncertain systems with input constraints , 2004, Fifth World Congress on Intelligent Control and Automation (IEEE Cat. No.04EX788).

[28]  W. Haddad,et al.  Nonlinear Dynamical Systems and Control: A Lyapunov-Based Approach , 2008 .

[29]  Haibo He,et al.  A neural network based online learning and control approach for Markov jump systems , 2015, Neurocomputing.