Learning the Optimal Energy Supply Plan with Online Convex Optimization

In this paper, we propose an online learning approach, utilizing the framework of Online Convex Optimization (OCO) to tackle the problem of learning the optimal energy supply plan, in terms of total incurred cost, from the perspective of an electricity retailer that also owns power generators and renewable energy sources. The retailer does not have prior knowledge about key dynamic processes that affect the problem, such as future demand, renewable energy supply, and wholesale market prices. The retailer sequentially learns how to adjust the power generation plan so as to minimize the power generation cost plus the cost of buying additional power from the wholesale market, in case the planned generated amount is not enough to cover the demand. We use Online Mirror Descent (OMD) and Online Gradient Descent (OGD), and we verify that they both achieve sublinear static and dynamic regret, which compare the cumulative cost of each algorithm against that of the optimal offline static and dynamic solution respectively. In particular, dynamic regret appears to be well aligned with the considered setting since it captures the fact that a power generator can dynamically change its power output between consecutive time slots based on only small adjustments and not in an arbitrary fashion, due to ramp constraints. Our model can capture different settings of electricity markets. Simulations with real data verify that OMD precisely learns the optimal dynamic supply policy for small power adjustments between consecutive time slots.

[1]  Lijun Zhang,et al.  Adaptive Online Learning in Dynamic Environments , 2018, NeurIPS.

[2]  Adam Wierman,et al.  Online convex optimization with ramp constraints , 2015, 2015 54th IEEE Conference on Decision and Control (CDC).

[3]  T. S. Jayram,et al.  Online optimization for the smart (micro) grid , 2012, 2012 Third International Conference on Future Systems: Where Energy, Computing and Communication Meet (e-Energy).

[4]  Brian M. Sadler,et al.  Nonstationary Nonparametric Online Learning: Balancing Dynamic Regret and Model Parsimony , 2019, ArXiv.

[5]  Vincent W. S. Wong,et al.  An Autonomous Demand Response Algorithm based on Online Convex Optimization , 2018, 2018 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm).

[6]  Martin Zinkevich,et al.  Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.

[7]  Shai Shalev-Shwartz,et al.  Online Learning and Online Convex Optimization , 2012, Found. Trends Mach. Learn..

[8]  Rebecca Willett,et al.  Online Convex Optimization in Dynamic Environments , 2015, IEEE Journal of Selected Topics in Signal Processing.

[9]  Vijay Gupta,et al.  Distributed Energy Management for Networked Microgrids Using Online ADMM With Regret , 2018, IEEE Transactions on Smart Grid.

[10]  Lazaros Gkatzikis,et al.  The Role of Aggregators in Smart Grid Demand Response Markets , 2013, IEEE Journal on Selected Areas in Communications.

[11]  Yi Wang,et al.  Review of Smart Meter Data Analytics: Applications, Methodologies, and Challenges , 2018, IEEE Transactions on Smart Grid.

[12]  Elad Hazan,et al.  Introduction to Online Convex Optimization , 2016, Found. Trends Optim..

[13]  Leandros Tassiulas,et al.  Optimal Control Policies for Power Demand Scheduling in the Smart Grid , 2012, IEEE Journal on Selected Areas in Communications.

[14]  Georgios B. Giannakis,et al.  Online optimal power flow with renewables , 2014, 2014 48th Asilomar Conference on Signals, Systems and Computers.

[15]  P. Cramton Electricity market design , 2017 .

[16]  Shahin Shahrampour,et al.  Online Optimization : Competing with Dynamic Comparators , 2015, AISTATS.