Deep Learning-based Predictive Control of Battery Management for Frequency Regulation

This paper proposes a deep learning-based optimal battery management scheme for frequency regulation (FR) by integrating model predictive control (MPC), supervised learning (SL), reinforcement learning (RL), and high-fidelity battery models. By taking advantage of deep neural networks (DNNs), the derived DNN-approximated policy is computationally efficient in online implementation. The design procedure of the proposed scheme consists of two sequential processes: (1) the SL process, in which we first run a simulation with an MPC embedding a low-fidelity battery model to generate a training data set, and then, based on the generated data set, we optimize a DNN-approximated policy using SL algorithms; and (2) the RL process, in which we 1 ar X iv :2 20 1. 01 16 6v 1 [ ee ss .S Y ] 4 J an 2 02 2 utilize RL algorithms to improve the performance of the DNN-approximated policy by balancing short-term economic incentives and long-term battery degradation. The SL process speeds up the subsequent RL process by providing a good initialization. By utilizing RL algorithms, one prominent property of the proposed scheme is that it can learn from the data generated by simulating the FR policy on the high-fidelity battery simulator to adjust the DNN-approximated policy, which is originally based on lowfidelity battery model. A case study using real-world data of FR signals and prices is performed. Simulation results show that, compared to conventional MPC schemes, the proposed deep learning-based scheme can effectively achieve higher economic benefits of FR participation while maintaining lower online computational cost.

[1]  Victor M. Zavala,et al.  Benchmarking stochastic and deterministic MPC: A case study in stationary battery systems , 2019, AIChE Journal.

[2]  A. Oudalov,et al.  Value Analysis of Battery Energy Storage Applications in Power Systems , 2006, 2006 IEEE PES Power Systems Conference and Exposition.

[3]  Ralph E. White,et al.  Review of Models for Predicting the Cycling Performance of Lithium Ion Batteries , 2006 .

[4]  D. Bertsekas Reinforcement Learning and Optimal ControlA Selective Overview , 2018 .

[5]  Victor M. Zavala,et al.  A Stochastic Model Predictive Control Framework for Stationary Battery Systems , 2018, IEEE Transactions on Power Systems.

[6]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[7]  Chongqing Kang,et al.  Optimal Bidding Strategy of Battery Storage in Power Markets Considering Performance-Based Regulation and Battery Cycle Life , 2016, IEEE Transactions on Smart Grid.

[8]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[9]  Victor M. Zavala,et al.  Data Centers as Dispatchable Loads to Harness Stranded Power , 2016, IEEE Transactions on Sustainable Energy.

[10]  Hosam K. Fathy,et al.  Genetic identification and fisher identifiability analysis of the Doyle–Fuller–Newman model from experimental cycling of a LiFePO4 cell , 2012 .

[11]  P. Balbuena,et al.  Lithium-ion batteries : solid-electrolyte interphase , 2004 .

[12]  Victor M. Zavala,et al.  Multiscale model predictive control of battery systems for frequency regulation markets using physics-based models , 2020, Journal of Process Control.

[13]  Frank Allgöwer,et al.  Learning an Approximate Model Predictive Controller With Guarantees , 2018, IEEE Control Systems Letters.

[14]  Ali Ahmadian,et al.  Optimal bidding strategy of a virtual power plant in day-ahead energy and frequency regulation markets: A deep learning-based approach , 2021 .

[15]  Daniel S. Kirschen,et al.  Optimal Battery Participation in Frequency Regulation Markets , 2017, IEEE Transactions on Power Systems.

[16]  E. T. Maddalena,et al.  A Neural Network Architecture to Learn Explicit MPC Controllers from Data , 2019, ArXiv.

[17]  Emanuel Peled,et al.  The Electrochemical Behavior of Alkali and Alkaline Earth Metals in Nonaqueous Battery Systems—The Solid Electrolyte Interphase Model , 1979 .

[18]  R. B. Gopaluni,et al.  Deep Neural Network Approximation of Nonlinear Model Predictive Control , 2020 .

[19]  Monimoy Bujarbaruah,et al.  Near-Optimal Rapid MPC Using Neural Networks: A Primal-Dual Policy Learning Framework , 2019, IEEE Transactions on Control Systems Technology.

[20]  Hosam K. Fathy,et al.  Optimal Control of Film Growth in Lithium-Ion Battery Packs via Relay Switches , 2011, IEEE Transactions on Industrial Electronics.

[21]  Alexander Mitsos,et al.  Accelerating nonlinear model predictive control through machine learning , 2020 .

[22]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.