A Learning-Based Solution for an Adversarial Repeated Game in Cyber–Physical Power Systems

Due to the rapidly expanding complexity of the cyber–physical power systems, the probability of a system malfunctioning and failing is increasing. Most of the existing works combining smart grid (SG) security and game theory fail to replicate the adversarial events in the simulated environment close to the real-life events. In this article, a repeated game is formulated to mimic the real-life interactions between the adversaries of the modern electric power system. The optimal action strategies for different environment settings are analyzed. The advantage of the repeated game is that the players can generate actions independent of the previous actions’ history. The solution of the game is designed based on the reinforcement learning algorithm, which ensures the desired outcome in favor of the players. The outcome in favor of a player means achieving higher mixed strategy payoff compared to the other player. Different from the existing game-theoretic approaches, both the attacker and the defender participate actively in the game and learn the sequence of actions applying to the power transmission lines. In this game, we consider several factors (e.g., attack and defense costs, allocated budgets, and the players’ strengths) that could affect the outcome of the game. These considerations make the game close to real-life events. To evaluate the game outcome, both players’ utilities are compared, and they reflect how much power is lost due to the attacks and how much power is saved due to the defenses. The players’ favorable outcome is achieved for different attack and defense strengths (probabilities). The IEEE 39 bus system is used here as the test benchmark. Learned attack and defense strategies are applied in a simulated power system environment (PowerWorld) to illustrate the postattack effects on the system.

[1]  Haibo He,et al.  The sequential attack against power grid networks , 2014, 2014 IEEE International Conference on Communications (ICC).

[2]  Xiaozhe Wang,et al.  Synchrophasor-Based State Estimation for Voltage Stability Monitoring in Power Systems , 2018, 2018 North American Power Symposium (NAPS).

[3]  Lingfeng Wang,et al.  Incorporating Unidentifiable Cyberattacks into Power System Reliability Assessment , 2018, 2018 IEEE Power & Energy Society General Meeting (PESGM).

[4]  Zheng Xu,et al.  Voltage sensitivity analysis based bus voltage regulation in transmission systems with UPFC series converter , 2017, IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society.

[5]  Peter Vrancx,et al.  Game Theory and Multi-agent Reinforcement Learning , 2012, Reinforcement Learning.

[6]  Csaba Szepesvári,et al.  A Unified Analysis of Value-Function-Based Reinforcement-Learning Algorithms , 1999, Neural Computation.

[7]  Athanasios V. Vasilakos,et al.  Enhancing smart grid with microgrids: Challenges and opportunities , 2017 .

[8]  Aditya Ashok,et al.  Cyber-physical risk modeling and mitigation for the smart grid using a game-theoretic approach , 2015, 2015 IEEE Power & Energy Society Innovative Smart Grid Technologies Conference (ISGT).

[9]  Boris Bellalta,et al.  A distributed power sharing framework among households in microgrids: a repeated game approach , 2016, Computing.

[10]  Michael L. Littman,et al.  Friend-or-Foe Q-learning in General-Sum Games , 2001, ICML.

[11]  Jian Shen,et al.  Game-Theory-Based Active Defense for Intrusion Detection in Cyber-Physical Embedded Systems , 2016, ACM Trans. Embed. Comput. Syst..

[12]  Walid Saad,et al.  Stochastic Games for Power Grid Protection Against Coordinated Cyber-Physical Attacks , 2018, IEEE Transactions on Smart Grid.

[13]  Ashish Saini,et al.  Clustering based Voltage Control Areas for Localized Reactive Power Management in Deregulated Power System , 2011 .

[14]  Zhen Ni,et al.  A Study of Linear Programming and Reinforcement Learning for One-Shot Game in Smart Grid Security , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[15]  Qi Wang,et al.  A two-layer game theoretical attack-defense model for a false data injection attack against power systems , 2019, International Journal of Electrical Power & Energy Systems.

[16]  Seref Sagiroglu,et al.  Smart grid security evaluation with a big data use case , 2018, 2018 IEEE 12th International Conference on Compatibility, Power Electronics and Power Engineering (CPE-POWERENG 2018).

[17]  Erfu Yang,et al.  Multiagent Reinforcement Learning for Multi-Robot Systems: A Survey , 2004 .

[18]  Derong Liu,et al.  Adaptive Dynamic Programming for Discrete-Time Zero-Sum Games , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[19]  Naresh Malla,et al.  Real-time cyber physical system testbed for power system security and control , 2017 .

[20]  Hao Xu,et al.  Deep reinforecement learning based optimal defense for cyber-physical system in presence of unknown cyber-attack , 2017, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).

[21]  Michael L. Littman,et al.  Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[22]  Michael P. Wellman,et al.  Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm , 1998, ICML.

[23]  John N. Tsitsiklis,et al.  Convergence rate and termination of asynchronous iterative algorithms , 1989, ICS '89.

[24]  Sylvain Sorin,et al.  Stochastic Games and Applications , 2003 .

[25]  Deepa Kundur,et al.  A Game-Theoretic Analysis of Cyber Switching Attacks and Mitigation in Smart Grid Systems , 2016, IEEE Transactions on Smart Grid.

[26]  Rong Zheng,et al.  Detecting Stealthy False Data Injection Using Machine Learning in Smart Grid , 2017, IEEE Systems Journal.

[27]  Frank L. Lewis,et al.  Off-Policy Reinforcement Learning for Synchronization in Multiagent Graphical Games , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[28]  Athanasios V. Vasilakos,et al.  Impact Assessment of Hypothesized Cyberattacks on Interconnected Bulk Power Systems , 2018, IEEE Transactions on Smart Grid.

[29]  Lingfeng Wang,et al.  A game-theoretic study of load redistribution attack and defense in power systems , 2017 .

[30]  Yanfei Sun,et al.  Strategic Honeypot Game Model for Distributed Denial of Service Attacks in the Smart Grid , 2017, IEEE Transactions on Smart Grid.

[31]  Tianyou Chai,et al.  Online Solution of Two-Player Zero-Sum Games for Continuous-Time Nonlinear Systems With Completely Unknown Dynamics , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[32]  Mihaela van der Schaar,et al.  Demand Side Management in Smart Grids Using a Repeated Game Framework , 2013, IEEE Journal on Selected Areas in Communications.

[33]  Zhen Ni,et al.  Vulnerability analysis for simultaneous attack in smart grid security , 2017, 2017 IEEE Power & Energy Society Innovative Smart Grid Technologies Conference (ISGT).

[34]  Frank L. Lewis,et al.  Optimal and Autonomous Control Using Reinforcement Learning: A Survey , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[35]  M. R. Barzegaran,et al.  Cyber physical renewable energy microgrid: A novel approach to make the power system reliable, resilient and secure , 2016, 2016 IEEE Innovative Smart Grid Technologies - Asia (ISGT-Asia).

[36]  Kenneth A. Loparo,et al.  Cascading Failure Attacks in the Power System: A Stochastic Game Perspective , 2017, IEEE Internet of Things Journal.

[37]  Mohamad Musavi,et al.  Identification of Critical Locations of Power Systems , 2017, 2017 Ninth Annual IEEE Green Technologies Conference (GreenTech).

[38]  Dongbin Zhao,et al.  Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[39]  Wei Sun,et al.  Electrical Distance Approach for Searching Vulnerable Branches During Contingencies , 2018, IEEE Transactions on Smart Grid.

[40]  Haibo He,et al.  Q-Learning-Based Vulnerability Analysis of Smart Grid Against Sequential Topology Attacks , 2017, IEEE Transactions on Information Forensics and Security.

[41]  Zhao Yang Dong,et al.  A Framework for Cyber-Topology Attacks: Line-Switching and New Attack Scenarios , 2019, IEEE Transactions on Smart Grid.

[42]  Haibo He,et al.  Model-Free Adaptive Control for Unknown Nonlinear Zero-Sum Differential Game , 2018, IEEE Transactions on Cybernetics.

[43]  Michael I. Jordan,et al.  MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY and CENTER FOR BIOLOGICAL AND COMPUTATIONAL LEARNING DEPARTMENT OF BRAIN AND COGNITIVE SCIENCES , 1996 .

[44]  Hao Jan Liu,et al.  Distributed secondary control for isolated microgrids under malicious attacks , 2016, 2016 North American Power Symposium (NAPS).

[45]  Krishnendu Chatterjee,et al.  On Nash Equilibria in Stochastic Games , 2004, CSL.

[46]  Nilanjan Ray Chaudhuri,et al.  Malicious Corruption-Resilient Wide-Area Oscillation Monitoring using Online Robust PCA , 2018, 2018 IEEE Power & Energy Society General Meeting (PESGM).

[47]  Samson Lasaulce,et al.  On the benefits of repeated game models for green cross-layer power control in small cells , 2013, 2013 First International Black Sea Conference on Communications and Networking (BlackSeaCom).

[48]  Zhen Ni,et al.  A Comparative Study of Smart Grid Security Based on Unsupervised Learning and Load Ranking , 2019, 2019 IEEE International Conference on Electro Information Technology (EIT).

[49]  Marcus Johnson,et al.  Approximate $N$ -Player Nonzero-Sum Game Solution for an Uncertain Continuous Nonlinear System , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[50]  Frank L. Lewis,et al.  Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[51]  Wayes Tushar,et al.  Multi-Source–Destination Distributed Wireless Networks: Pareto-Efficient Dynamic Power Control Game With Rapid Convergence , 2014, IEEE Transactions on Vehicular Technology.

[52]  Zhen Ni,et al.  A Strategic Analysis of Attacker-Defender Repeated Game in Smart Grid Security , 2019, 2019 IEEE Power & Energy Society Innovative Smart Grid Technologies Conference (ISGT).

[53]  Kashem M. Muttaqi,et al.  A Decentralized Multiagent-Based Voltage Control for Catastrophic Disturbances in a Power System , 2013, IEEE Transactions on Industry Applications.

[54]  Mahmud Fotuhi-Firuzabad,et al.  Application of Game Theory in Reliability-Centered Maintenance of Electric Power Systems , 2017, IEEE Transactions on Industry Applications.

[55]  David K. Y. Yau,et al.  Markov Game Analysis for Attack-Defense of Power Networks Under Possible Misinformation , 2013, IEEE Transactions on Power Systems.

[56]  Haibo He,et al.  Smart Grid Vulnerability under Cascade-Based Sequential Line-Switching Attacks , 2014, 2015 IEEE Global Communications Conference (GLOBECOM).

[57]  Manuela Veloso,et al.  An Analysis of Stochastic Game Theory for Multiagent Reinforcement Learning , 2000 .

[58]  William H. Sanders,et al.  Evaluating Detectors on Optimal Attack Vectors That Enable Electricity Theft and DER Fraud , 2018, IEEE Journal of Selected Topics in Signal Processing.

[59]  Qinglai Wei,et al.  A reinforcement learning approach for sequential decision-making process of attacks in smart grid , 2017, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).

[60]  Maria Lorena Tuballa,et al.  A review of the development of Smart Grid technologies , 2016 .

[61]  Zhen Ni,et al.  A Multistage Game in Smart Grid Security: A Reinforcement Learning Solution , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[62]  Zibin Zheng,et al.  Wide and Deep Convolutional Neural Networks for Electricity-Theft Detection to Secure Smart Grids , 2018, IEEE Transactions on Industrial Informatics.

[63]  Zhen Ni,et al.  Study of Learning of Power Grid Defense Strategy in Adversarial Stage Game , 2019, 2019 IEEE International Conference on Electro Information Technology (EIT).