论文信息 - Deep Learning-based Predictive Control of Battery Management for Frequency Regulation

Deep Learning-based Predictive Control of Battery Management for Frequency Regulation

This paper proposes a deep learning-based optimal battery management scheme for frequency regulation (FR) by integrating model predictive control (MPC), supervised learning (SL), reinforcement learning (RL), and high-fidelity battery models. By taking advantage of deep neural networks (DNNs), the derived DNN-approximated policy is computationally efficient in online implementation. The design procedure of the proposed scheme consists of two sequential processes: (1) the SL process, in which we first run a simulation with an MPC embedding a low-fidelity battery model to generate a training data set, and then, based on the generated data set, we optimize a DNN-approximated policy using SL algorithms; and (2) the RL process, in which we 1 ar X iv :2 20 1. 01 16 6v 1 [ ee ss .S Y ] 4 J an 2 02 2 utilize RL algorithms to improve the performance of the DNN-approximated policy by balancing short-term economic incentives and long-term battery degradation. The SL process speeds up the subsequent RL process by providing a good initialization. By utilizing RL algorithms, one prominent property of the proposed scheme is that it can learn from the data generated by simulating the FR policy on the high-fidelity battery simulator to adjust the DNN-approximated policy, which is originally based on lowfidelity battery model. A case study using real-world data of FR signals and prices is performed. Simulation results show that, compared to conventional MPC schemes, the proposed deep learning-based scheme can effectively achieve higher economic benefits of FR participation while maintaining lower online computational cost.

[1] Victor M. Zavala,et al. Benchmarking stochastic and deterministic MPC: A case study in stationary battery systems , 2019, AIChE Journal.

[2] A. Oudalov,et al. Value Analysis of Battery Energy Storage Applications in Power Systems , 2006, 2006 IEEE PES Power Systems Conference and Exposition.

[3] Ralph E. White,et al. Review of Models for Predicting the Cycling Performance of Lithium Ion Batteries , 2006 .

[4] D. Bertsekas. Reinforcement Learning and Optimal ControlA Selective Overview , 2018 .

[5] Victor M. Zavala,et al. A Stochastic Model Predictive Control Framework for Stationary Battery Systems , 2018, IEEE Transactions on Power Systems.

[6] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[7] Chongqing Kang,et al. Optimal Bidding Strategy of Battery Storage in Power Markets Considering Performance-Based Regulation and Battery Cycle Life , 2016, IEEE Transactions on Smart Grid.

[8] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.

[9] Victor M. Zavala,et al. Data Centers as Dispatchable Loads to Harness Stranded Power , 2016, IEEE Transactions on Sustainable Energy.

[10] Hosam K. Fathy,et al. Genetic identification and fisher identifiability analysis of the Doyle–Fuller–Newman model from experimental cycling of a LiFePO4 cell , 2012 .

[11] P. Balbuena,et al. Lithium-ion batteries : solid-electrolyte interphase , 2004 .

[12] Victor M. Zavala,et al. Multiscale model predictive control of battery systems for frequency regulation markets using physics-based models , 2020, Journal of Process Control.

[13] Frank Allgöwer,et al. Learning an Approximate Model Predictive Controller With Guarantees , 2018, IEEE Control Systems Letters.

[14] Ali Ahmadian,et al. Optimal bidding strategy of a virtual power plant in day-ahead energy and frequency regulation markets: A deep learning-based approach , 2021 .

[15] Daniel S. Kirschen,et al. Optimal Battery Participation in Frequency Regulation Markets , 2017, IEEE Transactions on Power Systems.

[16] E. T. Maddalena,et al. A Neural Network Architecture to Learn Explicit MPC Controllers from Data , 2019, ArXiv.

[17] Emanuel Peled,et al. The Electrochemical Behavior of Alkali and Alkaline Earth Metals in Nonaqueous Battery Systems—The Solid Electrolyte Interphase Model , 1979 .

[18] R. B. Gopaluni,et al. Deep Neural Network Approximation of Nonlinear Model Predictive Control , 2020 .

[19] Monimoy Bujarbaruah,et al. Near-Optimal Rapid MPC Using Neural Networks: A Primal-Dual Policy Learning Framework , 2019, IEEE Transactions on Control Systems Technology.

[20] Hosam K. Fathy,et al. Optimal Control of Film Growth in Lithium-Ion Battery Packs via Relay Switches , 2011, IEEE Transactions on Industrial Electronics.

[21] Alexander Mitsos,et al. Accelerating nonlinear model predictive control through machine learning , 2020 .

[22] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.