Distributed Economic Dispatch in Microgrids Based on Cooperative Reinforcement Learning

Microgrids incorporated with distributed generation (DG) units and energy storage (ES) devices are expected to play more and more important roles in the future power systems. Yet, achieving efficient distributed economic dispatch in microgrids is a challenging issue due to the randomness and nonlinear characteristics of DG units and loads. This paper proposes a cooperative reinforcement learning algorithm for distributed economic dispatch in microgrids. Utilizing the learning algorithm can avoid the difficulty of stochastic modeling and high computational complexity. In the cooperative reinforcement learning algorithm, the function approximation is leveraged to deal with the large and continuous state spaces. And a diffusion strategy is incorporated to coordinate the actions of DG units and ES devices. Based on the proposed algorithm, each node in microgrids only needs to communicate with its local neighbors, without relying on any centralized controllers. Algorithm convergence is analyzed, and simulations based on real-world meteorological and load data are conducted to validate the performance of the proposed algorithm.

[1]  Alessandro Abate,et al.  Modeling and simulation of a microgrid as a Stochastic Hybrid System , 2012, 2012 3rd IEEE PES Innovative Smart Grid Technologies Europe (ISGT Europe).

[2]  Frans A. Oliehoek,et al.  Decentralized POMDPs , 2012, Reinforcement Learning.

[3]  N. Growe-Kuska,et al.  Scenario reduction and scenario tree construction for power management problems , 2003, 2003 IEEE Bologna Power Tech Conference Proceedings,.

[4]  Derong Liu,et al.  A Novel Dual Iterative $Q$-Learning Method for Optimal Battery Management in Smart Residential Environments , 2015, IEEE Transactions on Industrial Electronics.

[5]  Yan Li,et al.  Power Management of Inverter Interfaced Autonomous Microgrid Based on Virtual Frequency-Voltage Frame , 2011, IEEE Transactions on Smart Grid.

[6]  Ali H. Sayed,et al.  Distributed Policy Evaluation Under Multiple Behavior Strategies , 2013, IEEE Transactions on Automatic Control.

[7]  Weihua Zhuang,et al.  Decentralized Economic Dispatch in Microgrids via Heterogeneous Wireless Networks , 2012, IEEE Journal on Selected Areas in Communications.

[8]  Chen Chen,et al.  Coordinated energy management of networked Microgrids in distribution systems , 2015 .

[9]  Shalabh Bhatnagar,et al.  Fast gradient-descent methods for temporal-difference learning with linear function approximation , 2009, ICML '09.

[10]  Alan Millner,et al.  Modeling Lithium Ion battery degradation in electric vehicles , 2010, 2010 IEEE Conference on Innovative Technologies for an Efficient and Reliable Electricity Supply.

[11]  Tajana Simunic,et al.  Optimal battery configuration in a residential home with time-of-use pricing , 2013, 2013 IEEE International Conference on Smart Grid Communications (SmartGridComm).

[12]  Shalabh Bhatnagar,et al.  Toward Off-Policy Learning Control with Function Approximation , 2010, ICML.

[13]  K. W. Chan,et al.  Multi-Agent Correlated Equilibrium Q(λ) Learning for Coordinated Smart Generation Control of Interconnected Power Grids , 2015, IEEE Transactions on Power Systems.

[14]  Junwei Gao,et al.  FMRQ—A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks , 2017, IEEE Transactions on Cybernetics.

[15]  Kevin Tomsovic,et al.  Bidding Strategy for Microgrid in Day-Ahead Market Based on Hybrid Stochastic/Robust Optimization , 2016, IEEE Transactions on Smart Grid.

[16]  Ali H. Sayed,et al.  Diffusion Strategies Outperform Consensus Strategies for Distributed Estimation Over Adaptive Networks , 2012, IEEE Transactions on Signal Processing.

[17]  Wensheng Zhang,et al.  Generalization Performance of Radial Basis Function Networks , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[18]  Jiang Wu,et al.  Coordinated Multi-Microgrids Optimal Control Algorithm for Smart Distribution Management System , 2013, IEEE Transactions on Smart Grid.

[19]  Dick Duffey,et al.  Power Generation , 1932, Transactions of the American Institute of Electrical Engineers.

[20]  Weihua Zhuang,et al.  Stochastic Modeling and Optimization in a Microgrid: A Survey , 2014 .

[21]  T. Nguyen,et al.  Stochastic Optimization of Renewable-Based Microgrid Operation Incorporating Battery Operating Cost , 2016, IEEE Transactions on Power Systems.

[22]  Joeri Van Mierlo,et al.  Peukert Revisited—Critical Appraisal and Need for Modification for Lithium-Ion Batteries , 2013 .

[23]  Panagiotis D. Christofides,et al.  Distributed Supervisory Predictive Control of Distributed Wind and Solar Energy Systems , 2013, IEEE Transactions on Control Systems Technology.

[24]  Xiaoping Liu,et al.  Robust Adaptive Neural Tracking Control for a Class of Stochastic Nonlinear Interconnected Systems , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[25]  Ali H. Sayed,et al.  Cooperative off-policy prediction of Markov decision processes in adaptive networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[26]  S. Zampieri,et al.  On the Existence and Linear Approximation of the Power Flow Solution in Power Distribution Networks , 2014, IEEE Transactions on Power Systems.

[27]  Hamed Kebriaei,et al.  A study on pricing strategies for residential load management using fuzzy reinforcement learning , 2015, 2015 International Conference on Cognitive Computing and Information Processing(CCIP).

[28]  Frank L. Lewis,et al.  $ {H}_{ {\infty }}$ Tracking Control of Completely Unknown Continuous-Time Systems via Off-Policy Reinforcement Learning , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[29]  Ratnesh K. Sharma,et al.  Dynamic Energy Management System for a Smart Microgrid , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[30]  Gabriela Hug,et al.  Consensus + Innovations Approach for Distributed Multiagent Coordination in a Microgrid , 2015, IEEE Transactions on Smart Grid.

[31]  Frank L. Lewis,et al.  Distributed Consensus-Based Economic Dispatch With Transmission Losses , 2014, IEEE Transactions on Power Systems.

[32]  Dragan Maksimovic,et al.  Accounting for Lithium-Ion Battery Degradation in Electric Vehicle Charging Optimization , 2014, IEEE Journal of Emerging and Selected Topics in Power Electronics.

[33]  Weihua Zhuang,et al.  Stochastic Information Management in Smart Grid , 2014, IEEE Communications Surveys & Tutorials.

[34]  Malabika Basu,et al.  Microgrid: Architecture, policy and future trends , 2016 .

[35]  Bart De Schutter,et al.  Reinforcement Learning and Dynamic Programming Using Function Approximators , 2010 .

[36]  Li Xin,et al.  Genetic based fuzzy Q-learning energy management for smart grid , 2012, Proceedings of the 31st Chinese Control Conference.

[37]  D. Doerffel,et al.  A critical review of using the peukert equation for determining the remaining capacity of lead-acid and lithium-ion batteries , 2006 .

[38]  Qian Ai,et al.  Economic power transaction using coalitional game strategy in micro-grids , 2016 .

[39]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[40]  Mohammad Shahidehpour,et al.  Integration of High Reliability Distribution System in Microgrid Operation , 2012, IEEE Transactions on Smart Grid.

[41]  Hossein Lotfi,et al.  State of the Art in Research on Microgrids: A Review , 2015, IEEE Access.