论文信息 - Critic-only adaptive dynamic programming algorithms' applications to the secure control of cyber-physical systems.

Critic-only adaptive dynamic programming algorithms' applications to the secure control of cyber-physical systems.

Industrial cyber-physical systems generally suffer from the malicious attacks and unmatched perturbation, and thus the security issue is always the core research topic in the related fields. This paper proposes a novel intelligent secure control scheme, which integrates optimal control theory, zero-sum game theory, reinforcement learning and neural networks. First, the secure control problem of the compromised system is converted into the zero-sum game issue of the nominal auxiliary system, and then both policy-iteration-based and value-iteration-based adaptive dynamic programming methods are introduced to solve the Hamilton-Jacobi-Isaacs equations. The proposed secure control scheme can mitigate the effects of actuator attacks and unmatched perturbation, and stabilize the compromised cyber-physical systems by tuning the system performance parameters, which is proved through the Lyapunov stability theory. Finally, the proposed approach is applied to the Quanser helicopter to verify the effectiveness.

[1] Huai-Ning Wu,et al. Neural Network Based Online Simultaneous Policy Update Algorithm for Solving the HJI Equation in Nonlinear $H_{\infty}$ Control , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[2] Guang-Hong Yang,et al. Improved adaptive resilient control against sensor and actuator attacks , 2018, Inf. Sci..

[3] Haibo He,et al. Learning Without External Reward [Research Frontier] , 2018, IEEE Computational Intelligence Magazine.

[4] Haibo He,et al. Optimal Control for Unknown Discrete-Time Nonlinear Markov Jump Systems Using Adaptive Dynamic Programming , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[5] Derong Liu,et al. Event-Based Constrained Robust Control of Affine Systems Incorporating an Adaptive Critic Mechanism , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[6] Derong Liu,et al. Output Tracking Control Based on Adaptive Dynamic Programming With Multistep Policy Evaluation , 2019, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[7] Haibo He,et al. Model-Free Adaptive Control for Unknown Nonlinear Zero-Sum Differential Game , 2018, IEEE Transactions on Cybernetics.

[8] Derong Liu,et al. Adaptive Constrained Optimal Control Design for Data-Based Nonlinear Discrete-Time Systems With Critic-Only Structure , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[9] Frank L. Lewis,et al. $ {H}_{ {\infty }}$ Tracking Control of Completely Unknown Continuous-Time Systems via Off-Policy Reinforcement Learning , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[10] Huaguang Zhang,et al. Optimal Guaranteed Cost Sliding Mode Control for Constrained-Input Nonlinear Systems With Matched and Unmatched Disturbances , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[11] Huai‐Ning Wu,et al. Computationally efficient simultaneous policy update algorithm for nonlinear H∞ state feedback control with Galerkin's method , 2013 .

[12] Guang-Hong Yang,et al. Secure State Estimation Against Sparse Sensor Attacks With Adaptive Switching Mechanism , 2018, IEEE Transactions on Automatic Control.

[13] Sing Kiong Nguang,et al. Distributed Filtering for Discrete-Time T–S Fuzzy Systems With Incomplete Measurements , 2018, IEEE Transactions on Fuzzy Systems.

[14] Haibo He,et al. Self-learning robust optimal control for continuous-time nonlinear systems with mismatched disturbances , 2018, Neural Networks.

[15] Huaguang Zhang,et al. Disturbance observer based fault estimation and dynamic output feedback fault tolerant control for fuzzy systems with local nonlinear models. , 2015, ISA transactions.

[16] Tingwen Huang,et al. Off-Policy Reinforcement Learning for $ H_\infty $ Control Design , 2013, IEEE Transactions on Cybernetics.

[17] Fei-Yue Wang,et al. Scanning the Issue and Beyond: Computational Transportation and Transportation 5.0 , 2014, IEEE Trans. Intell. Transp. Syst..

[18] Haibo He,et al. Adaptive Dynamic Programming for Robust Regulation and Its Application to Power Systems , 2018, IEEE Transactions on Industrial Electronics.

[19] Yu Liu,et al. Optimal constrained self-learning battery sequential management in microgrid via adaptive dynamic programming , 2017, IEEE/CAA Journal of Automatica Sinica.

[20] Haibo He,et al. Reinforcement learning for robust adaptive control of partially unknown nonlinear systems subject to unmatched uncertainties , 2018, Inf. Sci..

[21] Qinglai Wei,et al. Data-Driven Zero-Sum Neuro-Optimal Control for a Class of Continuous-Time Unknown Nonlinear Systems With Disturbance Using ADP , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[22] Kun Zhang,et al. Value iteration based integral reinforcement learning approach for H∞ controller design of continuous-time nonlinear systems , 2018, Neurocomputing.

[23] Huai-Ning Wu,et al. Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control , 2017, IEEE Transactions on Cybernetics.

[24] Huaguang Zhang,et al. General value iteration based single network approach for constrained optimal controller design of partially-unknown continuous-time nonlinear systems , 2018, J. Frankl. Inst..

[25] Derong Liu,et al. Data-Based Adaptive Critic Designs for Nonlinear Robust Optimal Control With Uncertain Dynamics , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[26] Tingwen Huang,et al. Model-Free Optimal Tracking Control via Critic-Only Q-Learning , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[27] Haibo He,et al. Data-Driven Tracking Control With Adaptive Dynamic Programming for a Class of Continuous-Time Nonlinear Systems , 2017, IEEE Transactions on Cybernetics.

[28] Frank L. Lewis,et al. Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[29] Qichao Zhang,et al. Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics , 2016, IEEE Transactions on Cybernetics.

[30] Dong Yue,et al. Fault Estimation Observer Design for Discrete-Time Takagi–Sugeno Fuzzy Systems Based on Homogenous Polynomially Parameter-Dependent Lyapunov Functions , 2017, IEEE Transactions on Cybernetics.

[31] Haibo He,et al. Intelligent Critic Control With Disturbance Attenuation for Affine Dynamics Including an Application to a Microgrid System , 2017, IEEE Transactions on Industrial Electronics.

[32] Tingwen Huang,et al. Optimal Output Regulation for Model-Free Quanser Helicopter With Multistep Q-Learning , 2018, IEEE Transactions on Industrial Electronics.

[33] F.L. Lewis,et al. Reinforcement learning and adaptive dynamic programming for feedback control , 2009, IEEE Circuits and Systems Magazine.

[34] Guang-Hong Yang,et al. Adaptive Actor–Critic Design-Based Integral Sliding-Mode Control for Partially Unknown Nonlinear Systems With Input Disturbances , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[35] Tansel Yucelen,et al. An Adaptive Control Architecture for Mitigating Sensor and Actuator Attacks in Cyber-Physical Systems , 2017, IEEE Transactions on Automatic Control.

[36] Haibo He,et al. An Event-Triggered ADP Control Approach for Continuous-Time System With Unknown Internal States , 2017, IEEE Transactions on Cybernetics.

[37] Dan Zhang,et al. Robust Fuzzy-Model-Based Filtering for Nonlinear Cyber-Physical Systems With Multiple Stochastic Incomplete Measurements , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[38] Frank L. Lewis,et al. Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2010, Autom..

[39] Derong Liu,et al. Adaptive $Q$ -Learning for Data-Based Optimal Output Regulation With Experience Replay , 2018, IEEE Transactions on Cybernetics.

[40] F. Lewis,et al. Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers , 2012, IEEE Control Systems.

[41] Fei-Yue Wang,et al. The Emergence of Intelligent Enterprises: From CPS to CPSS , 2010, IEEE Intelligent Systems.

[42] Derong Liu,et al. Adaptive Dynamic Programming for Discrete-Time Zero-Sum Games , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[43] Dan Zhang,et al. Asynchronous State Estimation for Discrete-Time Switched Complex Networks With Communication Constraints , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[44] Derong Liu,et al. Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming , 2014, Inf. Sci..