论文信息 - Overview of Robust Adaptive Critic Control Design

Overview of Robust Adaptive Critic Control Design

Adaptive dynamic programming (ADP) and reinforcement learning are quite relevant to each other when performing intelligent optimization. They are both regarded as promising methods involving important components of evaluation and improvement, at the background of information technology, such as artificial intelligence, big data, and deep learning. Although great progresses have been achieved and surveyed when addressing nonlinear optimal control problems, the research on robustness of ADP-based control strategies under uncertain environment has not been fully summarized. Hence, this chapter reviews the recent main results of adaptive-critic-based robust control design of continuous-time nonlinear systems. The ADP-based nonlinear optimal regulation is reviewed, followed by robust stabilization of nonlinear systems with matched uncertainties, guaranteed cost control design of unmatched plants, and decentralized stabilization of interconnected systems. Additionally, further comprehensive discussions are presented, including event-based robust control design, improvement of the critic learning rule, nonlinear $H_{\infty }$ control design, and several notes on future perspectives. This overview is beneficial to promote the development of adaptive critic control methods with robustness guarantee and the construction of higher level intelligent systems.

Chaoxu Mu | Ding Wang | Ding Wang | C. Mu

[1] Chao Li,et al. Neural-network-based decentralized control of continuous-time nonlinear interconnected systems with unknown dynamics , 2015, Neurocomputing.

[2] Derek C. Rose,et al. Deep Machine Learning - A New Frontier in Artificial Intelligence Research [Research Frontier] , 2010, IEEE Computational Intelligence Magazine.

[3] Huaguang Zhang,et al. Neural network-based online H∞ control for discrete-time affine nonlinear system using adaptive dynamic programming , 2016, Neurocomputing.

[4] Haibo He,et al. Adaptive-critic-based event-driven nonlinear robust state feedback , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[5] Qichao Zhang,et al. Event-Triggered $H_\infty $ Control for Continuous-Time Nonlinear System via Concurrent Learning , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[6] F.L. Lewis,et al. Reinforcement learning and adaptive dynamic programming for feedback control , 2009, IEEE Circuits and Systems Magazine.

[7] Chaoxu Mu,et al. Neural-network-based adaptive guaranteed cost control of nonlinear dynamical systems with matched uncertainties , 2017, Neurocomputing.

[8] Frank L. Lewis,et al. Optimal Adaptive Control and Differential Games by Reinforcement Learning Principles , 2012 .

[9] Frank L. Lewis,et al. Reinforcement Learning and Approximate Dynamic Programming for Feedback Control , 2012 .

[10] Frank L. Lewis,et al. Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning , 2014, Autom..

[11] Frank L. Lewis,et al. Off-Policy Actor-Critic Structure for Optimal Control of Unknown Systems With Disturbances , 2016, IEEE Transactions on Cybernetics.

[12] Qinglai Wei,et al. Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming , 2012, Autom..

[13] Huaguang Zhang,et al. An Overview of Research on Adaptive Dynamic Programming , 2013, Acta Automatica Sinica.

[14] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.

[15] Derong Liu,et al. A novel policy iteration based deterministic Q-learning for discrete-time nonlinear systems , 2015, Science China Information Sciences.

[16] Derong Liu,et al. Data-driven Nonlinear Near-optimal Regulation Based on Iterative Neural Dynamic Programming , 2017 .

[17] Aiguo Song,et al. Decentralized adaptive optimal stabilization of nonlinear systems with matched interconnections , 2018, Soft Comput..

[18] D. Liu,et al. Adaptive Dynamic Programming for Finite-Horizon Optimal Control of Discrete-Time Nonlinear Systems With $\varepsilon$-Error Bound , 2011, IEEE Transactions on Neural Networks.

[19] Avimanyu Sahoo,et al. Neural Network-Based Event-Triggered State Feedback Control of Nonlinear Continuous-Time Systems , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[20] Haibo He,et al. Adaptive Learning and Control for MIMO System Based on Adaptive Dynamic Programming , 2011, IEEE Transactions on Neural Networks.

[21] Huaguang Zhang,et al. Model‐Free H∞ Control Design for Unknown Continuous‐Time Linear System Using Adaptive Dynamic Programming , 2016 .

[22] Warren B. Powell,et al. Handbook of Learning and Approximate Dynamic Programming , 2006, IEEE Transactions on Automatic Control.

[23] Frank L. Lewis,et al. Adaptive Optimal Control of Unknown Constrained-Input Systems Using Policy Iteration and Neural Networks , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[24] Tingwen Huang,et al. Data-Driven $H_\infty$ Control for Nonlinear Distributed Parameter Systems , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[25] Kevin A. Wise,et al. Robust and Adaptive Control: With Aerospace Applications , 2012 .

[26] Randal W. Beard,et al. Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation , 1997, Autom..

[27] Frank L. Lewis,et al. Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2010, Autom..

[28] H. Kang,et al. Optimal control of nonlinear stochastic systems , 1971 .

[29] Zhong-Ping Jiang,et al. Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design , 2016, Autom..

[30] Derong Liu,et al. Reinforcement-Learning-Based Robust Controller Design for Continuous-Time Uncertain Nonlinear Systems Subject to Input Constraints , 2015, IEEE Transactions on Cybernetics.

[31] Jennie Si,et al. Handbook of Learning and Approximate Dynamic Programming (IEEE Press Series on Computational Intelligence) , 2004 .

[32] Yu Guo,et al. Online adaptive optimal control for continuous-time nonlinear systems with completely unknown dynamics , 2016, Int. J. Control.

[33] Derong Liu,et al. Event-based input-constrained nonlinear H∞ state feedback with adaptive critic and neural implementation , 2016, Neurocomputing.

[34] Sarangapani Jagannathan,et al. Neural Network-Based Optimal Adaptive Output Feedback Control of a Helicopter UAV , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[35] Haibo He,et al. A three-network architecture for on-line learning and optimization based on adaptive dynamic programming , 2012, Neurocomputing.

[36] Frank L. Lewis,et al. Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach , 2005, Autom..

[37] Haibo He,et al. Adaptive near-optimal controllers for non-linear decentralised feedback stabilisation problems , 2017 .

[38] Derong Liu,et al. On Mixed Data and Event Driven Design for Adaptive-Critic-Based Nonlinear $H_{\infty}$ Control , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[39] Shaocheng Tong,et al. Neural Controller Design-Based Adaptive Control for Nonlinear MIMO Systems With Unknown Hysteresis Inputs , 2016, IEEE Transactions on Cybernetics.

[40] Haibo He,et al. Robust controller design of continuous-time nonlinear system using neural network , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[41] Radhakant Padhi,et al. A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems , 2006, Neural Networks.

[42] Frank L. Lewis,et al. Online solution of nonlinear two‐player zero‐sum games using synchronous policy iteration , 2012 .

[43] Zhuo Wang,et al. From model-based control to data-driven control: Survey, classification and perspective , 2013, Inf. Sci..

[44] R. Bellman. Dynamic programming. , 1957, Science.

[45] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[46] Zhongke Shi,et al. Reinforcement Learning Output Feedback NN Control Using Deterministic Learning Technique , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[47] Liuqing Yang,et al. Where does AlphaGo go: from church-turing thesis to AlphaGo thesis and beyond , 2016, IEEE/CAA Journal of Automatica Sinica.

[48] Yu Jiang,et al. Robust Adaptive Dynamic Programming and Feedback Stabilization of Nonlinear Systems , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[49] Xin Zhang,et al. Data-Driven Robust Approximate Optimal Tracking Control for Unknown General Nonlinear Systems Using Adaptive Dynamic Programming Method , 2011, IEEE Transactions on Neural Networks.

[50] W. Haddad,et al. Nonlinear Dynamical Systems and Control: A Lyapunov-Based Approach , 2008 .

[51] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[52] Derong Liu,et al. Guaranteed cost neural tracking control for a class of uncertain nonlinear systems using adaptive dynamic programming , 2016, Neurocomputing.

[53] Daxue Liu,et al. Self-Learning Cruise Control Using Kernel-Based Least Squares Policy Iteration , 2014, IEEE Transactions on Control Systems Technology.

[54] Guang-Hong Yang,et al. Adaptive Actor–Critic Design-Based Integral Sliding-Mode Control for Partially Unknown Nonlinear Systems With Input Disturbances , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[55] Pengfei Yan,et al. Data-driven controller design for general MIMO nonlinear systems via virtual reference feedback tuning and neural networks , 2016, Neurocomputing.

[56] Derong Liu,et al. An Approximate Optimal Control Approach for Robust Stabilization of a Class of Discrete-Time Nonlinear Systems With Uncertainties , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[57] Xue-wen Chen,et al. Big Data Deep Learning: Challenges and Perspectives , 2014, IEEE Access.

[58] W. Haddad,et al. Robust nonlinear feedback control for uncertain linear systems with nonquadratic performance criteria , 1998 .

[59] Derong Liu,et al. Policy Iteration Algorithm for Online Design of Robust Control for a Class of Continuous-Time Nonlinear Systems , 2014, IEEE Transactions on Automation Science and Engineering.

[60] Derong Liu,et al. Data-based robust adaptive control for a class of unknown nonlinear constrained-input systems via integral reinforcement learning , 2016, Inf. Sci..

[61] Jae Young Lee,et al. Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems , 2012, Autom..

[62] Huaguang Zhang,et al. An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games , 2011, Autom..

[63] Frank L. Lewis,et al. Optimal Control: Lewis/Optimal Control 3e , 2012 .

[64] Miroslav Krstic,et al. Nonlinear and adaptive control de-sign , 1995 .

[65] Kyriakos G. Vamvoudakis,et al. Event-triggered H-infinity control for unknown continuous-time linear systems using Q-learning , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[66] Dongbin Zhao,et al. Computational Intelligence in Urban Traffic Signal Control: A Survey , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[67] Tingwen Huang,et al. Off-Policy Reinforcement Learning for $ H_\infty $ Control Design , 2013, IEEE Transactions on Cybernetics.

[68] Sarangapani Jagannathan,et al. Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence , 2009, Neural Networks.

[69] P.J. Werbos,et al. Using ADP to Understand and Replicate Brain Intelligence: the Next Level Design , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.

[70] Haibo He,et al. Power System Stability Control for a Wind Farm Based on Adaptive Dynamic Programming , 2015, IEEE Transactions on Smart Grid.

[71] Wang Fei-Yue,et al. Parallel Control: A Method for Data-Driven and Computational Control , 2013 .

[72] Dragoslav D. Šiljak,et al. Decentralized control of complex systems , 2012 .

[73] Haibo He,et al. A Theoretical Foundation of Goal Representation Heuristic Dynamic Programming , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[74] J. Yi,et al. An OVerview on the Adaptive Dynamic Programming Based Urban City Traffic Signal Optimal Control: An OVerview on the Adaptive Dynamic Programming Based Urban City Traffic Signal Optimal Control , 2009 .

[75] W. Haddad,et al. Optimal non-linear robust control for non-linear uncertain systems , 2000 .

[76] K. Vamvoudakis. Event-triggered optimal adaptive control algorithm for continuous-time nonlinear systems , 2014, IEEE/CAA Journal of Automatica Sinica.

[77] M. Corless,et al. Continuous state feedback guaranteeing uniform ultimate boundedness for uncertain dynamic systems , 1981 .

[78] Kyriakos G. Vamvoudakis,et al. Asymptotically Stable Adaptive–Optimal Control Algorithm With Saturating Actuators and Relaxed Persistence of Excitation , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[79] Yang Xiong,et al. Adaptive Dynamic Programming with Applications in Optimal Control , 2017 .

[80] Tingwen Huang,et al. Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design , 2014, Autom..

[81] Robert Kozma,et al. Complete stability analysis of a heuristic approximate dynamic programming control design , 2015, Autom..

[82] M. Gopal,et al. Fixed final time optimal control approach for bounded robust controller design using Hamilton-Jacobi-Bellman solution , 2009 .

[83] J. Na,et al. Online adaptive approximate optimal tracking control with simplified dual approximation structure for continuous-time unknown nonlinear systems , 2014, IEEE/CAA Journal of Automatica Sinica.

[84] Qinglai Wei,et al. Neural-network-based approach to finite-time optimal control for a class of unknown nonlinear systems , 2014, Soft Comput..

[85] Tamer Başar,et al. H1-Optimal Control and Related Minimax Design Problems , 1995 .

[86] Haibo He,et al. Novel iterative neural dynamic programming for data-based approximate optimal control design , 2017, Autom..

[87] Chaoxu Mu,et al. A novel neural optimal control framework with nonlinear dynamics: Closed-loop stability and simulation verification , 2017, Neurocomputing.

[88] Haibo He,et al. Air-Breathing Hypersonic Vehicle Tracking Control Based on Adaptive Dynamic Programming , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[89] Huaguang Zhang,et al. Distributed Cooperative Optimal Control for Multiagent Systems on Directed Graphs: An Inverse Optimal Approach , 2015, IEEE Transactions on Cybernetics.

[90] Zhong-Ping Jiang,et al. Output-feedback adaptive optimal control of interconnected systems based on robust adaptive dynamic programming , 2016, Autom..

[91] Derong Liu,et al. Data-based Self-learning Optimal Control: Research Progress and Prospects , 2013 .

[92] T. Basar,et al. H∞-0ptimal Control and Related Minimax Design Problems: A Dynamic Game Approach , 1996, IEEE Trans. Autom. Control..

[93] Derong Liu,et al. Adaptive optimal control for a class of continuous-time affine nonlinear systems with unknown internal dynamics , 2012, Neural Computing and Applications.

[94] Frank L. Lewis,et al. 2009 Special Issue: Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems , 2009 .

[95] Aiguo Song,et al. Iterative GDHP-based approximate optimal tracking control for a class of discrete-time nonlinear systems , 2016, Neurocomputing.

[96] F. Lewis,et al. Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers , 2012, IEEE Control Systems.

[97] Chaoxu Mu,et al. Developing nonlinear adaptive optimal regulators through an improved neural learning mechanism , 2016, Science China Information Sciences.

[98] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.

[99] Zhong-Ping Jiang,et al. Robust Adaptive Dynamic Programming , 2017 .

[100] Ali Saberi,et al. On optimality of decentralized control for a class of nonlinear interconnected systems , 1988, Autom..

[101] J. Nazuno. Haykin, Simon. Neural networks: A comprehensive foundation, Prentice Hall, Inc. Segunda Edición, 1999 , 2000 .

[102] Changyin Sun,et al. An Event-Triggered Approach for Load Frequency Control With Supplementary ADP , 2017, IEEE Transactions on Power Systems.

[103] Derong Liu,et al. Approximate Dynamic Programming for Self-Learning Control , 2005 .

[104] Haibo He,et al. Adaptive Critic Nonlinear Robust Control: A Survey , 2017, IEEE Transactions on Cybernetics.

[105] Haibo He,et al. An Event-Triggered ADP Control Approach for Continuous-Time System With Unknown Internal States , 2017, IEEE Transactions on Cybernetics.

[106] Derong Liu,et al. Decentralized Stabilization for a Class of Continuous-Time Nonlinear Interconnected Systems Using Online Learning Optimal Control Approach , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[107] Haibo He,et al. Data-Driven Finite-Horizon Approximate Optimal Control for Discrete-Time Nonlinear Systems Using Iterative HDP Approach , 2018, IEEE Transactions on Cybernetics.

[108] Pavankumar Tallapragada,et al. On Event Triggered Tracking for Nonlinear Systems , 2013, IEEE Transactions on Automatic Control.

[109] Haibo He,et al. Q-Learning-Based Vulnerability Analysis of Smart Grid Against Sequential Topology Attacks , 2017, IEEE Transactions on Information Forensics and Security.

[110] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[111] Zhong-Ping Jiang,et al. Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics , 2012, Autom..

[112] Huaguang Zhang,et al. Adaptive Fault-Tolerant Tracking Control for MIMO Discrete-Time Systems via Reinforcement Learning Algorithm With Less Learning Parameters , 2017, IEEE Transactions on Automation Science and Engineering.

[113] Huaguang Zhang,et al. Leader-Based Optimal Coordination Control for the Consensus Problem of Multiagent Differential Games via Fuzzy Adaptive Dynamic Programming , 2015, IEEE Transactions on Fuzzy Systems.

[114] Haibo He,et al. Improved Sliding Mode Design for Load Frequency Control of Power System Integrated an Adaptive Learning Strategy , 2017, IEEE Transactions on Industrial Electronics.

[115] Lei Guo,et al. Finite-Horizon Approximate Optimal Guaranteed Cost Control of Uncertain Nonlinear Systems With Application to Mars Entry Guidance , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[116] T. K. C. Peng,et al. Adaptive Guaranteed Cost of Control of Systems with Uncertain Parameters , 1970 .

[117] Ding Wang. Adaptation-Oriented Near-Optimal Control and Robust Synthesis of an Overhead Crane System , 2017, ICONIP.

[118] Zhong-Ping Jiang,et al. Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems , 2013, IEEE Transactions on Automatic Control.

[119] Jae Young Lee,et al. On integral generalized policy iteration for continuous-time linear quadratic regulations , 2014, Autom..

[120] Zhong-Ping Jiang,et al. Robust Adaptive Dynamic Programming for Large-Scale Systems With an Application to Multimachine Power Systems , 2012, IEEE Transactions on Circuits and Systems II: Express Briefs.

[121] Huaguang Zhang,et al. Adaptive Dynamic Programming: An Introduction , 2009, IEEE Computational Intelligence Magazine.

[122] Dongbin Zhao,et al. Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[123] Jennie Si,et al. Online learning control by association and reinforcement. , 2001, IEEE transactions on neural networks.

[124] Derong Liu,et al. Residential energy scheduling for variable weather solar energy based on adaptive dynamic programming , 2018, IEEE/CAA Journal of Automatica Sinica.

[125] Fei-Yue Wang. Parallel Control: A Method for Data-Driven and Computational Control: Parallel Control: A Method for Data-Driven and Computational Control , 2014 .

[126] Frank L. Lewis,et al. A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems , 2013, Autom..

[127] Indra Narayan Kar,et al. Bounded robust control of nonlinear systems using neural network–based HJB solution , 2011, IEEE Transactions on Automation Science and Engineering.

[128] Paul J. Werbos,et al. 2009 Special Issue: Intelligence in the brain: A theory of how it works and how to build it , 2009 .

[129] Jiaqi Liang,et al. Wide-area measurement based dynamic stochastic optimal power flow control for smart grids with high variability and uncertainty , 2012, 2012 IEEE Power and Energy Society General Meeting.

[130] Paul J. Werbos,et al. Building and Understanding Adaptive Systems: A Statistical/Numerical Approach to Factory Automation and Brain Research , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[131] Bart De Schutter,et al. Reinforcement Learning and Dynamic Programming Using Function Approximators , 2010 .

[132] Derong Liu,et al. Data-Driven Neuro-Optimal Temperature Control of Water–Gas Shift Reaction Using Stable Iterative Adaptive Dynamic Programming , 2014, IEEE Transactions on Industrial Electronics.

[133] Frank L. Lewis,et al. Linear Quadratic Tracking Control of Partially-Unknown Continuous-Time Systems Using Reinforcement Learning , 2014, IEEE Transactions on Automatic Control.

[134] Derong Liu,et al. Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming , 2014, Inf. Sci..

[135] Xiong Yang,et al. Online approximate solution of HJI equation for unknown constrained-input nonlinear continuous-time systems , 2016, Inf. Sci..

[136] Derong Liu,et al. Adaptive Dynamic Programming for Optimal Tracking Control of Unknown Nonlinear Systems With Application to Coal Gasification , 2014, IEEE Transactions on Automation Science and Engineering.

[137] Frank L. Lewis,et al. Policy Iterations on the Hamilton–Jacobi–Isaacs Equation for $H_{\infty}$ State Feedback Control With Input Saturation , 2006, IEEE Transactions on Automatic Control.

[138] George G. Lendaris,et al. A retrospective on Adaptive Dynamic Programming for control , 2009, 2009 International Joint Conference on Neural Networks.

[139] Haibo He,et al. Intelligent Critic Control With Disturbance Attenuation for Affine Dynamics Including an Application to a Microgrid System , 2017, IEEE Transactions on Industrial Electronics.

[140] P. Werbos,et al. Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[141] Derong Liu,et al. Neural-Network-Based Optimal Control for a Class of Unknown Discrete-Time Nonlinear Systems Using Globalized Dual Heuristic Programming , 2012, IEEE Transactions on Automation Science and Engineering.

[142] Derong Liu,et al. Data-based robust optimal control of continuous-time affine nonlinear systems with matched uncertainties , 2016, Inf. Sci..

[143] Changyin Sun,et al. Adaptive Neural Impedance Control of a Robotic Manipulator With Input Saturation , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[144] George G. Lendaris,et al. Adaptive dynamic programming , 2002, IEEE Trans. Syst. Man Cybern. Part C.

[145] Paul J. Werbos,et al. Computational Intelligence for the Smart Grid-History, Challenges, and Opportunities , 2011, IEEE Computational Intelligence Magazine.

[146] Zhong-Ping Jiang,et al. Robust adaptive dynamic programming for linear and nonlinear systems: An overview , 2013, Eur. J. Control.

[147] Zhong-Ping Jiang,et al. Small-gain theorem for ISS systems and applications , 1994, Math. Control. Signals Syst..

[148] Dimitri P. Bertsekas,et al. Value and Policy Iterations in Optimal Control and Adaptive Dynamic Programming , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[149] Huaguang Zhang,et al. Near-Optimal Control for Nonzero-Sum Differential Games of Continuous-Time Nonlinear Systems Using Single-Network ADP , 2013, IEEE Transactions on Cybernetics.

[150] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[151] Hao Xu,et al. Neural Network-Based Finite-Horizon Optimal Control of Uncertain Affine Nonlinear Discrete-Time Systems , 2015, IEEE Trans. Neural Networks Learn. Syst..

[152] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[153] Cheng Wang,et al. Iterative adaptive dynamic programming approach to power optimal control for smart grid with energy storage devices , 2014 .

[154] Sarangapani Jagannathan,et al. Optimal Control of Nonlinear Continuous-Time Systems in Strict-Feedback Form , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[155] Haibo He,et al. Event-Triggered Optimal Control for Partially Unknown Constrained-Input Systems via Adaptive Dynamic Programming , 2017, IEEE Transactions on Industrial Electronics.

[156] Wen Yu. Recent Advances in Intelligent Control Systems , 2009 .

[157] Jürgen Schmidhuber,et al. Deep learning in neural networks: An overview , 2014, Neural Networks.

[158] Shangtai Jin,et al. Data-Driven Model-Free Adaptive Control for a Class of MIMO Nonlinear Discrete-Time Systems , 2011, IEEE Transactions on Neural Networks.

[159] Huaguang Zhang,et al. Neural-Network-Based Constrained Optimal Control Scheme for Discrete-Time Switched Nonlinear System Using Dual Heuristic Programming , 2014, IEEE Transactions on Automation Science and Engineering.

[160] F. Lewis,et al. Online solution of nonquadratic two‐player zero‐sum games arising in the H ∞ control of constrained input systems , 2014 .

[161] Qichao Zhang,et al. Event-Based Robust Control for Uncertain Nonlinear Systems Using Adaptive Dynamic Programming , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[162] Bin Jiang,et al. Online Adaptive Policy Learning Algorithm for $H_{\infty }$ State Feedback Control of Unknown Affine Nonlinear Discrete-Time Systems , 2014, IEEE Transactions on Cybernetics.

[163] Warren E. Dixon,et al. Model-based reinforcement learning for approximate optimal regulation , 2016, Autom..

[164] Ali Heydari,et al. Finite-Horizon Control-Constrained Nonlinear Optimal Control Using Single Network Adaptive Critics , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[165] Haibo He,et al. Fuzzy-Based Goal Representation Adaptive Dynamic Programming , 2016, IEEE Transactions on Fuzzy Systems.

[166] Jagannathan Sarangapani,et al. Neural Network Control of Nonlinear Discrete-Time Systems , 2018 .

[167] Jae Young Lee,et al. Integral Reinforcement Learning for Continuous-Time Input-Affine Nonlinear Systems With Simultaneous Invariant Explorations , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[168] Bin Xu,et al. Robust adaptive neural control of flexible hypersonic flight vehicle with dead-zone input nonlinearity , 2015 .

[169] S. Jagannathan,et al. Optimal control of affine nonlinear continuous-time systems using an online Hamilton-Jacobi-Isaacs formulation , 2010, 49th IEEE Conference on Decision and Control (CDC).

[170] Feng-Yi Lin. Robust Control Design: An Optimal Control Approach , 2007 .

[171] Ali Heydari,et al. Revisiting Approximate Dynamic Programming and its Convergence , 2014, IEEE Transactions on Cybernetics.

[172] Haibo He,et al. Event-Driven Nonlinear Discounted Optimal Regulation Involving a Power System Application , 2017, IEEE Transactions on Industrial Electronics.

[173] Yunpeng Wang,et al. Optimal Formation of Multirobot Systems Based on a Recurrent Neural Network , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[174] S. Lyshevski. Nonlinear discrete-time systems: constrained optimization and application of nonquadratic costs , 1998, Proceedings of the 1998 American Control Conference. ACC (IEEE Cat. No.98CH36207).

[175] Haibo He,et al. GrDHP: A General Utility Function Representation for Dual Heuristic Dynamic Programming , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[176] R. D. Brandt,et al. Robust control of nonlinear systems: compensating for uncertainty , 1992 .

[177] Derong Liu,et al. Infinite Horizon Self-Learning Optimal Control of Nonaffine Discrete-Time Nonlinear Systems , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[178] A. Rantzer. Relaxed dynamic programming in switching systems , 2006 .

[179] Paulo Tabuada,et al. Event-Triggered Real-Time Scheduling of Stabilizing Control Tasks , 2007, IEEE Transactions on Automatic Control.

[180] Derong Liu,et al. Adaptive dynamic programming for infinite horizon optimal robust guaranteed cost control of a class of uncertain nonlinear systems , 2015, 2015 American Control Conference (ACC).

[181] Frank L. Lewis,et al. Neural Network Control Of Robot Manipulators And Non-Linear Systems , 1998 .

[182] Zhong-Ping Jiang,et al. Decentralized Adaptive Optimal Control of Large-Scale Systems With Application to Power Systems , 2015, IEEE Transactions on Industrial Electronics.

[183] Bo Lincoln,et al. Relaxing dynamic programming , 2006, IEEE Transactions on Automatic Control.

[184] Zhong-Ping Jiang,et al. Adaptive Dynamic Programming and Adaptive Optimal Output Regulation of Linear Systems , 2016, IEEE Transactions on Automatic Control.

[185] Haibo He,et al. Online Learning Control Using Adaptive Critic Designs With Sparse Kernel Machines , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[186] Paul J. Werbos,et al. Foreword: ADP - The Key Direction for Future Research in Intelligent Control and Understanding Brain Intelligence , 2008, IEEE Trans. Syst. Man Cybern. Part B.

[187] Derong Liu,et al. Data-Based Adaptive Critic Designs for Nonlinear Robust Optimal Control With Uncertain Dynamics , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[188] Huaguang Zhang,et al. Neural-Network-Based Near-Optimal Control for a Class of Discrete-Time Affine Nonlinear Systems With Control Constraints , 2009, IEEE Transactions on Neural Networks.

[189] Qihui Wu,et al. A survey of machine learning for big data processing , 2016, EURASIP Journal on Advances in Signal Processing.

[190] Derong Liu,et al. A Data-Based State Feedback Control Method for a Class of Nonlinear Systems , 2013, IEEE Transactions on Industrial Informatics.

[191] Frank L. Lewis,et al. Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[192] Derong Liu,et al. Error Bounds of Adaptive Dynamic Programming Algorithms for Solving Undiscounted Optimal Control Problems , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[193] Derong Liu,et al. Neural-Network-Based Distributed Adaptive Robust Control for a Class of Nonlinear Multiagent Systems With Time Delays and External Noises , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[194] Haibo He,et al. Event-Driven Adaptive Robust Control of Nonlinear Systems With Uncertainties Through NDP Strategy , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[195] Lyle Noakes,et al. Continuous-Time Adaptive Critics , 2007, IEEE Transactions on Neural Networks.

[196] Derong Liu,et al. Adaptive Dynamic Programming for Control: Algorithms and Stability , 2012 .

[197] Qing Ye,et al. Robust differential game guidance laws design for uncertain interceptor-target engagement via adaptive dynamic programming , 2017, Int. J. Control.

[198] Kevin A. Wise,et al. Robust and Adaptive Control , 2013 .

[199] Derong Liu,et al. Neural-Network-Based Online HJB Solution for Optimal Robust Guaranteed Cost Control of Continuous-Time Uncertain Nonlinear Systems , 2014, IEEE Transactions on Cybernetics.