Critic-only adaptive dynamic programming algorithms' applications to the secure control of cyber-physical systems.

Industrial cyber-physical systems generally suffer from the malicious attacks and unmatched perturbation, and thus the security issue is always the core research topic in the related fields. This paper proposes a novel intelligent secure control scheme, which integrates optimal control theory, zero-sum game theory, reinforcement learning and neural networks. First, the secure control problem of the compromised system is converted into the zero-sum game issue of the nominal auxiliary system, and then both policy-iteration-based and value-iteration-based adaptive dynamic programming methods are introduced to solve the Hamilton-Jacobi-Isaacs equations. The proposed secure control scheme can mitigate the effects of actuator attacks and unmatched perturbation, and stabilize the compromised cyber-physical systems by tuning the system performance parameters, which is proved through the Lyapunov stability theory. Finally, the proposed approach is applied to the Quanser helicopter to verify the effectiveness.

[1]  Huai-Ning Wu,et al.  Neural Network Based Online Simultaneous Policy Update Algorithm for Solving the HJI Equation in Nonlinear $H_{\infty}$ Control , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[2]  Guang-Hong Yang,et al.  Improved adaptive resilient control against sensor and actuator attacks , 2018, Inf. Sci..

[3]  Haibo He,et al.  Learning Without External Reward [Research Frontier] , 2018, IEEE Computational Intelligence Magazine.

[4]  Haibo He,et al.  Optimal Control for Unknown Discrete-Time Nonlinear Markov Jump Systems Using Adaptive Dynamic Programming , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[5]  Derong Liu,et al.  Event-Based Constrained Robust Control of Affine Systems Incorporating an Adaptive Critic Mechanism , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[6]  Derong Liu,et al.  Output Tracking Control Based on Adaptive Dynamic Programming With Multistep Policy Evaluation , 2019, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[7]  Haibo He,et al.  Model-Free Adaptive Control for Unknown Nonlinear Zero-Sum Differential Game , 2018, IEEE Transactions on Cybernetics.

[8]  Derong Liu,et al.  Adaptive Constrained Optimal Control Design for Data-Based Nonlinear Discrete-Time Systems With Critic-Only Structure , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[9]  Frank L. Lewis,et al.  $ {H}_{ {\infty }}$ Tracking Control of Completely Unknown Continuous-Time Systems via Off-Policy Reinforcement Learning , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[10]  Huaguang Zhang,et al.  Optimal Guaranteed Cost Sliding Mode Control for Constrained-Input Nonlinear Systems With Matched and Unmatched Disturbances , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[11]  Huai‐Ning Wu,et al.  Computationally efficient simultaneous policy update algorithm for nonlinear H∞ state feedback control with Galerkin's method , 2013 .

[12]  Guang-Hong Yang,et al.  Secure State Estimation Against Sparse Sensor Attacks With Adaptive Switching Mechanism , 2018, IEEE Transactions on Automatic Control.

[13]  Sing Kiong Nguang,et al.  Distributed Filtering for Discrete-Time T–S Fuzzy Systems With Incomplete Measurements , 2018, IEEE Transactions on Fuzzy Systems.

[14]  Haibo He,et al.  Self-learning robust optimal control for continuous-time nonlinear systems with mismatched disturbances , 2018, Neural Networks.

[15]  Huaguang Zhang,et al.  Disturbance observer based fault estimation and dynamic output feedback fault tolerant control for fuzzy systems with local nonlinear models. , 2015, ISA transactions.

[16]  Tingwen Huang,et al.  Off-Policy Reinforcement Learning for $ H_\infty $ Control Design , 2013, IEEE Transactions on Cybernetics.

[17]  Fei-Yue Wang,et al.  Scanning the Issue and Beyond: Computational Transportation and Transportation 5.0 , 2014, IEEE Trans. Intell. Transp. Syst..

[18]  Haibo He,et al.  Adaptive Dynamic Programming for Robust Regulation and Its Application to Power Systems , 2018, IEEE Transactions on Industrial Electronics.

[19]  Yu Liu,et al.  Optimal constrained self-learning battery sequential management in microgrid via adaptive dynamic programming , 2017, IEEE/CAA Journal of Automatica Sinica.

[20]  Haibo He,et al.  Reinforcement learning for robust adaptive control of partially unknown nonlinear systems subject to unmatched uncertainties , 2018, Inf. Sci..

[21]  Qinglai Wei,et al.  Data-Driven Zero-Sum Neuro-Optimal Control for a Class of Continuous-Time Unknown Nonlinear Systems With Disturbance Using ADP , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[22]  Kun Zhang,et al.  Value iteration based integral reinforcement learning approach for H∞ controller design of continuous-time nonlinear systems , 2018, Neurocomputing.

[23]  Huai-Ning Wu,et al.  Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control , 2017, IEEE Transactions on Cybernetics.

[24]  Huaguang Zhang,et al.  General value iteration based single network approach for constrained optimal controller design of partially-unknown continuous-time nonlinear systems , 2018, J. Frankl. Inst..

[25]  Derong Liu,et al.  Data-Based Adaptive Critic Designs for Nonlinear Robust Optimal Control With Uncertain Dynamics , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[26]  Tingwen Huang,et al.  Model-Free Optimal Tracking Control via Critic-Only Q-Learning , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[27]  Haibo He,et al.  Data-Driven Tracking Control With Adaptive Dynamic Programming for a Class of Continuous-Time Nonlinear Systems , 2017, IEEE Transactions on Cybernetics.

[28]  Frank L. Lewis,et al.  Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[29]  Qichao Zhang,et al.  Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics , 2016, IEEE Transactions on Cybernetics.

[30]  Dong Yue,et al.  Fault Estimation Observer Design for Discrete-Time Takagi–Sugeno Fuzzy Systems Based on Homogenous Polynomially Parameter-Dependent Lyapunov Functions , 2017, IEEE Transactions on Cybernetics.

[31]  Haibo He,et al.  Intelligent Critic Control With Disturbance Attenuation for Affine Dynamics Including an Application to a Microgrid System , 2017, IEEE Transactions on Industrial Electronics.

[32]  Tingwen Huang,et al.  Optimal Output Regulation for Model-Free Quanser Helicopter With Multistep Q-Learning , 2018, IEEE Transactions on Industrial Electronics.

[33]  F.L. Lewis,et al.  Reinforcement learning and adaptive dynamic programming for feedback control , 2009, IEEE Circuits and Systems Magazine.

[34]  Guang-Hong Yang,et al.  Adaptive Actor–Critic Design-Based Integral Sliding-Mode Control for Partially Unknown Nonlinear Systems With Input Disturbances , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[35]  Tansel Yucelen,et al.  An Adaptive Control Architecture for Mitigating Sensor and Actuator Attacks in Cyber-Physical Systems , 2017, IEEE Transactions on Automatic Control.

[36]  Haibo He,et al.  An Event-Triggered ADP Control Approach for Continuous-Time System With Unknown Internal States , 2017, IEEE Transactions on Cybernetics.

[37]  Dan Zhang,et al.  Robust Fuzzy-Model-Based Filtering for Nonlinear Cyber-Physical Systems With Multiple Stochastic Incomplete Measurements , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[38]  Frank L. Lewis,et al.  Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2010, Autom..

[39]  Derong Liu,et al.  Adaptive $Q$ -Learning for Data-Based Optimal Output Regulation With Experience Replay , 2018, IEEE Transactions on Cybernetics.

[40]  F. Lewis,et al.  Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers , 2012, IEEE Control Systems.

[41]  Fei-Yue Wang,et al.  The Emergence of Intelligent Enterprises: From CPS to CPSS , 2010, IEEE Intelligent Systems.

[42]  Derong Liu,et al.  Adaptive Dynamic Programming for Discrete-Time Zero-Sum Games , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[43]  Dan Zhang,et al.  Asynchronous State Estimation for Discrete-Time Switched Complex Networks With Communication Constraints , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[44]  Derong Liu,et al.  Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming , 2014, Inf. Sci..