论文信息 - Hierarchical correlated Q-learning for multi-layer optimal generation command dispatch

Hierarchical correlated Q-learning for multi-layer optimal generation command dispatch

Abstract This paper presents a novel hierarchical correlated Q-learning (HCEQ) algorithm to solve the dynamic optimization of generation command dispatch (GCD) in the Automatic Generation Control (AGC). The GCD problem is to dynamically allocate the total AGC generation command from the central to each individual AGC generator. The proposed HCEQ is a novel multi-agent Q-learning algorithm based on the concept of correlated equilibrium point, and each AGC generator with an agent is to optimize its regulation participation factor and coordinate its decision with others for the overall GCD performance enhancement. In order to cope with the curse of dimensionality in the GCD problem with the increased number of AGC plants involved, a multi-layer optimum GCD framework is developed in this paper. In this hierarchical framework, the multiobjective design and a time-varying coordination factor have been formulated into the reward functions to improve the optimization efficiency and convergence of HCEQ. The application of the proposed approach has been fully verified on the China southern power grid (CSG) model to demonstrate its superior performance and dynamic optimization capability in various power system scenarios.

Tao Yu | Bin Zhou | Ka Wing Chan | X. S. Zhang

[1] J. Nash. NON-COOPERATIVE GAMES , 1951, Classics in Game Theory.

[2] Tao Yu,et al. Equilibrium-Inspired Multiple Group Search Optimization With Synergistic Learning for Multiobjective Electric Power Dispatch , 2013, IEEE Transactions on Power Systems.

[3] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[4] M. Yao,et al. AGC logic based on NERC's new Control Performance Standard and Disturbance Control Standard , 2000 .

[5] N. Jaleeli,et al. NERC's new control performance standards , 1999 .

[6] Yu Xichang,et al. Practical implementation of the SCADA+AGC/ED system of the hunan power pool in the central China power network , 1994 .

[7] Dewen Hu,et al. Multiobjective Reinforcement Learning: A Comprehensive Overview , 2015, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[8] Bart De Schutter,et al. A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[9] Rabindra Kumar Sahu,et al. A novel hybrid PSO-PS optimized fuzzy PI controller for AGC in multi area interconnected power systems , 2015 .

[10] Tao Yu,et al. Distributed multi-step Q(λ) learning for Optimal Power Flow of large-scale power grids , 2012 .

[11] Rahmat-Allah Hooshmand,et al. A NEW PID CONTROLLER DESIGN FOR AUTOMATIC GENERATION CONTROL OF HYDRO POWER SYSTEMS , 2010 .

[12] Michael Wooldridge,et al. Computational Aspects of Cooperative Game Theory (Synthesis Lectures on Artificial Inetlligence and Machine Learning) , 2011 .

[13] Ieee Report,et al. Dynamic Models for Steam and Hydro Turbines in Power System Studies , 1973 .

[14] Tao Yu,et al. Stochastic Optimal Relaxed Automatic Generation Control in Non-Markov Environment Based on Multi-Step $Q(\lambda)$ Learning , 2011, IEEE Transactions on Power Systems.

[15] Lalit Chandra Saikia,et al. Automatic generation control of a multi area hydrothermal system using reinforced learning neural network controller , 2011 .

[16] Ibraheem,et al. Recent philosophies of automatic generation control strategies in power systems , 2005, IEEE Transactions on Power Systems.

[17] L. H. Fink,et al. Understanding automatic generation control , 1992 .

[18] Tao Yu,et al. R(λ) imitation learning for automatic generation control of interconnected power grids , 2012, Autom..

[19] Yongyut Manichaikul. Industrial electric load modeling , 1978 .

[20] Sridhar Mahadevan,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..

[21] Kanendra Naidu,et al. Application of firefly algorithm with online wavelet filter in automatic generation control of an interconnected reheat thermal power system , 2014 .

[22] D. Ernst,et al. Power systems stability control: reinforcement learning framework , 2004, IEEE Transactions on Power Systems.

[23] Tao Yu,et al. Stochastic optimal generation command dispatch based on improved hierarchical reinforcement learning approach , 2011 .

[24] John N. Tsitsiklis,et al. Asynchronous Stochastic Approximation and Q-Learning , 1994, Machine Learning.