Deep Reinforcement Learning-Based Energy Storage Arbitrage With Accurate Lithium-Ion Battery Degradation Model

Accurate estimation of battery degradation cost is one of the main barriers for battery participating on the energy arbitrage market. This paper addresses this problem by using a model-free deep reinforcement learning (DRL) method to optimize the battery energy arbitrage considering an accurate battery degradation model. Firstly, the control problem is formulated as a Markov Decision Process (MDP). Then a noisy network based deep reinforcement learning approach is proposed to learn an optimized control policy for storage charging/discharging strategy. To address the uncertainty of electricity price, a hybrid Convolutional Neural Network (CNN) and Long Short Term Memory (LSTM) model is adopted to predict the price for the next day. Finally, the proposed approach is tested on the historical U.K. wholesale electricity market prices. The results compared with model based Mixed Integer Linear Programming (MILP) have demonstrated the effectiveness and performance of the proposed framework.

[1]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[2]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[3]  Philip Bachman,et al.  Deep Reinforcement Learning that Matters , 2017, AAAI.

[4]  Kankar Bhattacharya,et al.  Battery Energy Storage Systems in Energy and Reserve Markets , 2020, IEEE Transactions on Power Systems.

[5]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[6]  Ning Zhang,et al.  Probabilistic individual load forecasting using pinball loss guided LSTM , 2019, Applied Energy.

[7]  R. Spotnitz Simulation of capacity fade in lithium-ion batteries , 2003 .

[8]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[9]  Hao Wang,et al.  Energy Storage Arbitrage in Real-Time Markets via Reinforcement Learning , 2017, 2018 IEEE Power & Energy Society General Meeting (PESGM).

[10]  Nima Amjady,et al.  Affinely Adjustable Robust Bidding Strategy for a Solar Plant Paired With a Battery Storage , 2019, IEEE Transactions on Smart Grid.

[11]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[12]  Nathaniel S. Borenstein,et al.  IBM ® , 2009 .

[13]  Jihong Wang,et al.  Capacity fade modelling of lithium-ion battery under cyclic loading conditions , 2016 .

[14]  Darrell F. Socie,et al.  Simple rainflow counting algorithms , 1982 .

[15]  Hadi Khani,et al.  Real-Time Optimal Dispatch and Economic Viability of Cryogenic Energy Storage Exploiting Arbitrage Opportunities in an Electricity Market , 2015, IEEE Transactions on Smart Grid.

[16]  Zhi Zhou,et al.  Energy Storage Arbitrage Under Day-Ahead and Real-Time Price Uncertainty , 2018, IEEE Transactions on Power Systems.

[17]  Vassilios G. Agelidis,et al.  Model Predictive Control for Distributed Microgrid Battery Energy Storage Systems , 2017, IEEE Transactions on Control Systems Technology.

[18]  Wojciech Zaremba,et al.  OpenAI Gym , 2016, ArXiv.

[19]  João Tomé Saraiva,et al.  Use of battery storage systems for price arbitrage operations in the 15- and 60-min German intraday markets , 2018, Electric Power Systems Research.

[20]  Haibo He,et al.  Model-Free Real-Time EV Charging Scheduling Based on Deep Reinforcement Learning , 2019, IEEE Transactions on Smart Grid.

[21]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[22]  Hamed Mohsenian Rad,et al.  Optimal Operation of Independent Storage Systems in Energy and Reserve Markets With High Wind Penetration , 2014, IEEE Transactions on Smart Grid.

[23]  Yuan Zhang,et al.  Short-Term Residential Load Forecasting Based on LSTM Recurrent Neural Network , 2019, IEEE Transactions on Smart Grid.

[24]  Daniel S. Kirschen,et al.  Modeling of Lithium-Ion Battery Degradation for Cell Life Assessment , 2018, IEEE Transactions on Smart Grid.

[25]  Le Xie,et al.  Risk Measure Based Robust Bidding Strategy for Arbitrage Using a Wind Farm and Energy Storage , 2013, IEEE Transactions on Smart Grid.

[26]  Kevin G. Gallagher,et al.  Impact of battery degradation on energy arbitrage revenue of grid-level energy storage , 2017 .

[27]  José R. Vázquez-Canteli,et al.  Reinforcement learning for demand response: A review of algorithms and modeling techniques , 2019, Applied Energy.

[28]  M. Wohlfahrt‐Mehrens,et al.  Ageing mechanisms in lithium-ion batteries , 2005 .

[29]  Jihong Wang,et al.  Overview of current development in electrical energy storage technologies and the application potential in power system operation , 2015 .

[30]  Shane Legg,et al.  Noisy Networks for Exploration , 2017, ICLR.