Electricity Demand Response under Real-Time Pricing: A Multi-armed Bandit Game

Real-time electricity pricing (RTP) for consumers has long been argued to be key to realize the many envisioned benefits of a smart energy grid. However, there has not been a consensus on how to best implement RTP in an organized, competitive wholesale market with active demand participation. Since most of such markets implement a two-settlement system, with day-ahead electricity price forecasts guiding physical transactions in the next day and real-time ex post prices settling any realtime imbalances, it is a natural idea to let consumers respond to the day-ahead prices. We show in this paper through simulation that naive responsive behaviors to the day-ahead price signals can lead to high price volatility, which will increase not only the risk of system instability, but also the financial risks faced by the consumers. To overcome this issue, we propose a game-theoretic framework in which each consumer solves a multi-armed bandit problem; that is, each consumer learns from the history of the game and attempts to minimize the cumulative regrets. We show through simulation that such a framework leads to drastically reduced volatility on real-time prices and much flatter load curves for the entire grid.

[1]  Jia Yuan Yu,et al.  Mean Field Analysis of Multi-Armed Bandit Games , 2013 .

[2]  Andrew L. Liu,et al.  Intelligent demand response for electricity consumers: A multi-armed bandit game approach , 2017, 2017 19th International Conference on Intelligent System Application to Power Systems (ISAP).

[3]  Jingjie Xiao,et al.  Grid integration and smart grid implementation of emerging technologies in electric power systems through approximate dynamic programming , 2013 .

[4]  Munther A. Dahleh,et al.  Volatility of Power Grids Under Real-Time Pricing , 2011, IEEE Transactions on Power Systems.

[5]  A. Papavasiliou,et al.  Reserve Requirements for Wind Power Integration: A Scenario-Based Stochastic Programming Framework , 2011, IEEE Transactions on Power Systems.

[6]  Demosthenis Teneketzis,et al.  Multi-Armed Bandit Problems , 2008 .

[7]  Fred C. Schweppe Power: Power systems `2000¿: Hierarchical control strategies: Multilevel controls and home minis will enable utilities to buy and sell power at `real time¿ rates determined by supply and demand , 1978 .

[8]  Drew Fudenberg,et al.  Game theory (3. pr.) , 1991 .

[9]  Sarvapali D. Ramchurn,et al.  Agent-based control for decentralised demand side management in the smart grid , 2011, AAMAS.

[10]  Warren B. Powell,et al.  Optimal Learning: Powell/Optimal , 2012 .

[11]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[12]  Dheepak Krishnamurthy,et al.  psst: An open-source power system simulation toolbox in Python , 2016, 2016 North American Power Symposium (NAPS).