Fully Decentralized Reinforcement Learning-Based Control of Photovoltaics in Distribution Grids for Joint Provision of Real and Reactive Power

In this paper, we introduce a new framework to address the problem of voltage regulation in unbalanced distribution grids with deep photovoltaic penetration. In this framework, both real and reactive power setpoints are explicitly controlled at each solar panel smart inverter, and the objective is to simultaneously minimize system-wide voltage deviation and maximize solar power output. We formulate the problem as a Markov decision process with continuous action spaces and use proximal policy optimization, a reinforcement learning-based approach, to solve it, without the need for any forecast or explicit knowledge of network topology or line parameters. By representing the system in a quasi-steady state manner, and by carefully formulating the Markov decision process, we reduce the complexity of the problem and allow for fully decentralized (communication-free) policies, all of which make the trained policies much more practical and interpretable. Numerical simulations on a 240-node unbalanced distribution grid, based on a real network in Midwest U.S., are used to validate the proposed framework and reinforcement learning approach.

[1]  Santiago Grijalva,et al.  Improving distribution network PV hosting capacity via smart inverter reactive power support , 2015, 2015 IEEE Power & Energy Society General Meeting.

[2]  Hanchen Xu,et al.  Optimal Tap Setting of Voltage Regulation Transformers Using Batch Reinforcement Learning , 2018, IEEE Transactions on Power Systems.

[3]  Wei Zhang,et al.  Multiagent-Based Reinforcement Learning for Optimal Reactive Power Dispatch , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[4]  Steven H. Low,et al.  Optimal inverter VAR control in distribution systems with high PV penetration , 2011, 2012 IEEE Power and Energy Society General Meeting.

[5]  Sergey Levine,et al.  Trust Region Policy Optimization , 2015, ICML.

[6]  Kaveh Dehghanpour,et al.  A Time-Series Distribution Test System Based on Real Utility Data , 2019, 2019 North American Power Symposium (NAPS).

[7]  Wei Wang,et al.  Volt-VAR Control in Power Distribution Systems with Deep Reinforcement Learning , 2019, 2019 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm).

[8]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[9]  Carlo Fischione,et al.  Voltage Control Using Limited Communication , 2019, IEEE Transactions on Control of Network Systems.

[10]  Hao Jan Liu,et al.  Fast Local Voltage Control Under Limited Reactive Power: Optimality and Stability Analysis , 2015, IEEE Transactions on Power Systems.

[11]  Eilyan Bitar,et al.  Real-time Voltage Regulation in Distribution Systems via Decentralized PV Inverter Control , 2018, HICSS.

[12]  Yi Wu,et al.  Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.

[13]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[14]  Gang Wang,et al.  Stochastic Reactive Power Management in Microgrids With Renewables , 2014, IEEE Transactions on Power Systems.

[15]  Albert Y. S. Lam,et al.  An Optimal and Distributed Method for Voltage Regulation in Power Distribution Systems , 2012, IEEE Transactions on Power Systems.

[16]  Dileep Kalathil,et al.  Communication-free Voltage Regulation in Distribution Networks with Deep PV Penetration , 2020, HICSS.

[17]  Na Li,et al.  Optimal Distributed Feedback Voltage Control Under Limited Reactive Power , 2018, IEEE Transactions on Power Systems.

[18]  Ratnesh Sharma,et al.  Coordination of PV Smart Inverters Using Deep Reinforcement Learning for Grid Voltage Regulation , 2019, 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA).

[19]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[20]  Mohammad A. S. Masoum,et al.  Optimal PV Inverter Reactive Power Control and Real Power Curtailment to Improve Performance of Unbalanced Four-Wire LV Distribution Networks , 2014, IEEE Transactions on Sustainable Energy.

[21]  M.R. Iravani,et al.  Power Management Strategies for a Microgrid With Multiple Distributed Generation Units , 2006, IEEE Transactions on Power Systems.

[22]  Wei Wang,et al.  Safe Off-Policy Deep Reinforcement Learning Algorithm for Volt-VAR Control in Power Distribution Systems , 2020, IEEE Transactions on Smart Grid.