Deep Reinforcement Learning for Joint Bidding and Pricing of Load Serving Entity

In this paper, we address the problem of jointly determining the energy bid submitted to the wholesale electricity market (WEM) and the energy price charged in the retailed electricity market (REM) for a load serving entity (LSE). The joint bidding and pricing problem is formulated as a Markov decision process (MDP) with continuous state and action spaces in which the energy bid and the energy price are two actions that share a common objective. We apply the deep deterministic policy gradient (DDPG) algorithm to solve this MDP for the optimal bidding and pricing policies. Yet, the DDPG algorithm typically requires a significant number of state transition samples, which are costly in this application. To this end, we apply neural networks to learn dynamical bid and price response functions from historical data to model the WEM and the collective behavior of the end use customers (EUCs), respectively. These response functions explicitly capture the inter-temporal correlations of the WEM clearing results and the EUC responses and can be utilized to generate state transition samples without any cost. More importantly, the response functions also inform the choice of states in the MDP formulation. Numerical simulations illustrated the effectiveness of the proposed methodology.

[1]  Hanchen Xu,et al.  The values of market-based demand response on improving power system reliability under extreme circumstances , 2017 .

[2]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[3]  Zheng Wen,et al.  Optimal Demand Response Using Device-Based Reinforcement Learning , 2014, IEEE Transactions on Smart Grid.

[4]  R. Stephenson A and V , 1962, The British journal of ophthalmology.

[5]  Long Bao Le,et al.  Dynamic Pricing Design for Demand Response Integration in Power Distribution Networks , 2016, IEEE Transactions on Power Systems.

[6]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[7]  Hanchen Xu,et al.  Learning Dynamical Demand Response Model in Real-Time Pricing Program , 2018, 2019 IEEE Power & Energy Society Innovative Smart Grid Technologies Conference (ISGT).

[8]  M. P. Moghaddam,et al.  Optimal real time pricing in an agent-based retail market using a comprehensive demand response model , 2011 .

[9]  Bart De Schutter,et al.  Residential Demand Response of Thermostatically Controlled Loads Using Batch Reinforcement Learning , 2017, IEEE Transactions on Smart Grid.

[10]  Fangxing Li,et al.  Coupon-Based Demand Response Considering Wind Power Uncertainty: A Strategic Bidding Model for Load Serving Entities , 2016, IEEE Transactions on Power Systems.

[11]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[12]  Abhishek Halder,et al.  Architecture and algorithms for privacy preserving thermal inertial load management by a load serving entity , 2017 .

[13]  Fco Alberto Campos,et al.  Optimal Demand-Side Bidding Strategies in Electricity Spot Markets , 2012, IEEE Transactions on Power Systems.

[14]  Mihaela van der Schaar,et al.  Dynamic Pricing and Energy Consumption Scheduling With Reinforcement Learning , 2016, IEEE Transactions on Smart Grid.

[15]  Hanchen Xu,et al.  Optimal Joint Bidding and Pricing of Profit-Seeking Load Serving Entity , 2017, IEEE Transactions on Power Systems.

[16]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[17]  Damien Ernst,et al.  Reinforcement Learning for Electric Power System Decision and Control: Past Considerations and Perspectives , 2017 .

[18]  S. A. Soman,et al.  Application of Actor-Critic Learning Algorithm for Optimal Bidding Problem of a Genco , 2002, IEEE Power Engineering Review.

[19]  Peter Henderson,et al.  An Introduction to Deep Reinforcement Learning , 2018, Found. Trends Mach. Learn..

[20]  Hanchen Xu,et al.  Optimal Tap Setting of Voltage Regulation Transformers Using Batch Reinforcement Learning , 2018, IEEE Transactions on Power Systems.

[21]  R.J. Thomas,et al.  Demand-Side Bidding Agents: Modeling and Simulation , 2008, IEEE Transactions on Power Systems.

[22]  Juan M. Morales,et al.  Real-Time Demand Response Model , 2010, IEEE Transactions on Smart Grid.

[23]  Hanchen Xu,et al.  Offer Strategies for Wholesale Energy and Regulation Markets , 2018, IEEE Transactions on Power Systems.

[24]  Farrokh Rahimi,et al.  Demand Response as a Market Resource Under the Smart Grid Paradigm , 2010, IEEE Transactions on Smart Grid.

[25]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.