A sequential resource investment planning framework using reinforcement learning and simulation-based optimization: A case study on microgrid storage expansion

A model and expansion plan have been developed to optimally determine microgrid designs as they evolve to dynamically react to changing conditions and to exploit energy storage capabilities. In the wake of the highly electrified future ahead of us, the role of energy storage is crucial wherever distributed generation is abundant, such as microgrid settings. Given the variety of storage options that are recently becoming more economical, determining which type of storage technology to invest in, along with the appropriate timing and capacity becomes a critical research question. In problems where the investment timing is of high priority, like this one, developing analytical and systematic frameworks for rigorously considering these issues is indispensable. From a business perspective, these strategic frameworks will aim to optimize the process of investment planning, by leveraging novel approaches and by capturing all the problem details that traditional approaches are unable to. Reinforcement learning algorithms have recently proven to be successful in problems where sequential decision-making is inherent. In the operations planning area, these algorithms are already used but mostly in short-term problems with well-defined constraints and low levels of uncertainty modeling. On the contrary, in this work, we expand and tailor these techniques to long-term investment planning by utilizing model-free approaches, like the Q-learning algorithm, combined with simulation-based models. We find that specific types of energy storage units, including the vanadium-redox battery, can be expected to be at the core of the future microgrid applications, and therefore, require further attention. Another key finding is that the optimal storage capacity threshold for a system depends heavily on the price movements of the available storage units in the market.

[1]  Eduardo F. Morales,et al.  An Introduction to Reinforcement Learning , 2011 .

[2]  Frede Blaabjerg,et al.  Review of Energy Storage System Technologies in Microgrid Applications: Issues and Challenges , 2018, IEEE Access.

[3]  Josep M. Guerrero,et al.  Aalborg Universitet Optimal Power Flow in Microgrids with Energy Storage , 2013 .

[4]  I. Kamwa,et al.  Causes of the 2003 major grid blackouts in North America and Europe, and recommended means to improve system dynamic performance , 2005, IEEE Transactions on Power Systems.

[5]  Keith Robert Pullen,et al.  A Review of Flywheel Energy Storage System Technologies and Their Applications , 2017 .

[6]  David W. Coit,et al.  Combined effects of load dynamics and dependence clusters on cascading failures in network systems , 2018, Reliab. Eng. Syst. Saf..

[7]  Nataraj Venkataramanan,et al.  Synthetic Data Generation , 2016 .

[8]  Wenlong Jing,et al.  Dynamic power allocation of battery-supercapacitor hybrid energy storage for standalone PV microgrid applications , 2017 .

[9]  Serguei Netessine,et al.  Strategic Investment in Renewable Energy Sources: The Effect of Supply Intermittency , 2017, Manuf. Serv. Oper. Manag..

[10]  Vitaly Levdik,et al.  Time Limits in Reinforcement Learning , 2017, ICML.

[11]  Yue Yuan,et al.  International Conference on Electricity Distribution Prague , 8-11 June 2009 Paper 0358 BENEFITS OF ENERGY STORAGE IN POWER SYSTEMS WITH HIGH LEVEL OF INTERMITTENT GENERATION , 2009 .

[12]  G.C. Goodwin,et al.  A battery storage control scheme for AC microgrids , 2017, 2017 20th International Conference on Electrical Machines and Systems (ICEMS).

[13]  Rahmat-Allah Hooshmand,et al.  Comprehensive review of generation and transmission expansion planning , 2013 .

[14]  Jian Zhou,et al.  An improved model for cascading failures in complex networks , 2012, 2012 IEEE 2nd International Conference on Cloud Computing and Intelligence Systems.

[15]  Nima Amjady,et al.  Robust Transmission and Energy Storage Expansion Planning in Wind Farm-Integrated Power Systems Considering Transmission Switching , 2016, IEEE Transactions on Sustainable Energy.

[16]  Richard J. Campbell,et al.  Weather-Related Power Outages and Electric System Resiliency , 2012 .

[17]  Shaghayegh Bahramirad,et al.  Reliability-Constrained Optimal Sizing of Energy Storage System in a Microgrid , 2012, IEEE Transactions on Smart Grid.

[18]  Enrico Zio,et al.  Reinforcement learning for microgrid energy management , 2013 .

[19]  R. S. Milton,et al.  Distributed Optimization of Solar Micro-grid Using Multi Agent Reinforcement Learning☆ , 2015 .

[20]  Daishi Harada,et al.  Reinforcement Learning with Time , 1997, AAAI/IAAI.

[21]  Geert Deconinck,et al.  Battery Energy Management in a Microgrid Using Batch Reinforcement Learning , 2017 .

[22]  Jian Zhou,et al.  Economic trends and comparisons for optimizing grid-outage resilient photovoltaic and battery systems , 2019 .

[23]  N. D. Hatziargyriou,et al.  Multi-agent reinforcement learning for microgrids , 2010, IEEE PES General Meeting.

[24]  Gary E. Bolton,et al.  Learning-by-Doing in the Newsvendor Problem: A Laboratory Investigation of the Role of Experience and Feedback , 2008, Manuf. Serv. Oper. Manag..

[25]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[26]  R. S. Milton,et al.  Reinforcement learning for optimal energy management of a solar microgrid , 2014, 2014 IEEE Global Humanitarian Technology Conference - South Asia Satellite (GHTC-SAS).

[27]  Jose A. Guajardo Third-Party Ownership Business Models and the Operational Performance of Solar Energy Systems , 2018, Manuf. Serv. Oper. Manag..

[28]  David W. Coit,et al.  Battery selection for optimal grid-outage resilient photovoltaic and battery systems , 2019, 1901.11389.

[29]  Dunbar P. Birnie,et al.  Economic and resilience benefit analysis of incorporating battery storage to photovoltaic array generation , 2019, Renewable Energy.

[30]  Qing Li,et al.  Integrated optimization of offshore wind farm layout design and turbine opportunistic condition-based maintenance , 2018, Comput. Ind. Eng..

[31]  Yong He,et al.  Optimal control in microgrid using multi-agent reinforcement learning. , 2012, ISA transactions.

[32]  Roman Kapuscinski,et al.  Curtailing Intermittent Generation in Electrical Systems , 2013, Manuf. Serv. Oper. Manag..

[33]  Stuart J. Russell,et al.  Bayesian Q-Learning , 1998, AAAI/IAAI.

[34]  Y. Parag,et al.  Microgrids: A review of technologies, key drivers, and outstanding issues , 2018, Renewable and Sustainable Energy Reviews.

[35]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[36]  Mahmud Fotuhi-Firuzabad,et al.  Stochastic Capacity Expansion Planning of Remote Microgrids With Wind Farms and Energy Storage , 2015, IEEE Transactions on Sustainable Energy.

[37]  Devanandham Henry,et al.  On the Impacts of Power Outages during Hurricane Sandy—A Resilience‐Based Analysis , 2016, Syst. Eng..

[38]  Hado van Hasselt,et al.  Double Q-learning , 2010, NIPS.

[39]  Amin Khodaei,et al.  A Comprehensive Battery Energy Storage Optimal Sizing Model for Microgrid Applications , 2018, IEEE Transactions on Power Systems.

[40]  Sohrab Asgarpoor,et al.  Reinforcement Learning Approach for Optimal Distributed Energy Management in a Microgrid , 2018, IEEE Transactions on Power Systems.

[41]  Wenbin Wang,et al.  Capacity Investment in Renewable Energy Technology with Supply Intermittency: Data Granularity Matters! , 2015, Manuf. Serv. Oper. Manag..