Multi-Agent Correlated Equilibrium Q(λ) Learning for Coordinated Smart Generation Control of Interconnected Power Grids

This paper proposes an optimal coordinated control methodology based on the multi-agent reinforcement learning (MARL) for the multi-area smart generation control (SGC) under the control performance standards (CPS). A new MARL algorithm called correlated Q(λ) learning (CEQ(λ)) is presented to form an optimal joint equilibrium strategy for the coordinated load frequency control of interconnected control areas, and a SGC framework is proposed to facilitate information sharing and strategic interaction among multi-areas so as to enhance the overall long-run performance of the control areas. Furthermore, a novel time-varying equilibrium factor is introduced into the equilibrium selection function to identify the optimum equilibrium policies in various power system operation scenarios. The performance of CEQ(λ) based SGC strategy has been fully tested and benchmarked on a two-area power system and the China Southern Power Grid. Comparative studies have not only demonstrated the superior equilibrium optimization and dynamic performance of the proposed SGC strategy but also confirmed its fast convergence and high flexibility in designing the equilibrium factor for the desirable operating state of correlated equilibria.

[1]  A. R. Oneal,et al.  A simple method for improving control area performance: area control error (ACE) diversity interchange ADI , 1995 .

[2]  Luis Rouco Rodríguez,et al.  The Spanish AGC system: Description and analysis , 2009 .

[3]  B. Wigdorowitz,et al.  A methodology for the redesign of frequency control for AC networks , 2004, IEEE Transactions on Power Systems.

[4]  Manuela M. Veloso,et al.  Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.

[5]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[6]  N. Jaleeli,et al.  NERC's new control performance standards , 1999 .

[7]  Michael Wooldridge,et al.  Computational Aspects of Cooperative Game Theory , 2011, KES-AMSTA.

[8]  Keith B. Hall,et al.  Correlated Q-Learning , 2003, ICML.

[9]  Tao Yu,et al.  Stochastic Optimal Relaxed Automatic Generation Control in Non-Markov Environment Based on Multi-Step $Q(\lambda)$ Learning , 2011, IEEE Transactions on Power Systems.

[10]  Jing Peng,et al.  Incremental multi-step Q-learning , 1994, Machine Learning.

[11]  J. Nash NON-COOPERATIVE GAMES , 1951, Classics in Game Theory.

[12]  Fangxing Li,et al.  Next-Generation Monitoring, Analysis, and Control for the Future Smart Control Center , 2010, IEEE Transactions on Smart Grid.

[13]  D. Ernst,et al.  Power systems stability control: reinforcement learning framework , 2004, IEEE Transactions on Power Systems.

[14]  Agostino Poggi,et al.  Developing Multi-agent Systems with JADE , 2007, ATAL.

[15]  S.C. Srivastava,et al.  A decentralized automatic generation control scheme for competitive electricity markets , 2006, IEEE Transactions on Power Systems.

[16]  M. Yao,et al.  AGC logic based on NERC's new control performance standard and disturbance control standard , 2000, 2000 Power Engineering Society Summer Meeting (Cat. No.00CH37134).

[17]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 2005, IEEE Transactions on Neural Networks.

[18]  Hassan Bevrani,et al.  Load–frequency control : a GA-based multi-agent reinforcement learning , 2010 .

[19]  L. H. Fink,et al.  Understanding automatic generation control , 1992 .

[20]  T. Sasaki,et al.  Dynamic analysis of generation control performance standards , 2002, IEEE Power Engineering Society Summer Meeting,.

[21]  Tao Yu,et al.  R(λ) imitation learning for automatic generation control of interconnected power grids , 2012, Autom..

[22]  Ronald J. Williams,et al.  Incremental Multi-Step , 1996 .

[23]  P. S. Nagendra Rao,et al.  A reinforcement learning approach to automatic generation control , 2002 .

[24]  Y. V. Makarov,et al.  Possible improvements of the ACE diversity interchange methodology , 2010, IEEE PES General Meeting.

[25]  Bezalel Peleg,et al.  Correlated equilibria of games with many players , 2000, Int. J. Game Theory.

[26]  S. Mishra,et al.  Maiden application of bacterial foraging-based optimization technique in multiarea automatic generation control , 2012, 2012 IEEE Power and Energy Society General Meeting.

[27]  Ali Keyhani,et al.  Automatic Generation Control Structure for Smart Power Grids , 2012, IEEE Transactions on Smart Grid.

[28]  Ibraheem,et al.  Recent philosophies of automatic generation control strategies in power systems , 2005, IEEE Transactions on Power Systems.

[29]  Takashi Hiyama,et al.  A New Intelligent Agent-Based AGC Design With Real-Time Application , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).