Dynamic Compensation Pricing Scheme for Demand Resources Based on Deep Reinforcement Learning

As demand response is playing a more and more important role in power system operation, the pricing of those kinds of service becomes critical to the economic operation. In this paper, a pricing method based on deep reinforcement learning is proposed to maximize the cumulative gain of the load aggregator. The responsive behavior of demand resources is affected by the price generated by the load aggregator which dynamically adjusts its compensation price in an optimal way to maximize its return in a longer time horizon. Experimental simulation is conducted to illustrate the effectiveness of the proposed pricing model.

[1]  Sesh Commuri,et al.  An Artificial Neural Network based Approach to Electric Demand Response Implementation , 2018, 2018 North American Power Symposium (NAPS).

[2]  Seung Ho Hong,et al.  Incentive-based demand response for smart grid with reinforcement learning and deep neural network , 2019, Applied Energy.

[3]  S. Raj,et al.  Reference Price Research: Review and Propositions , 2005 .

[4]  Tao Yu,et al.  Deep transfer Q-learning with virtual leader-follower for supply-demand Stackelberg game of smart grid , 2017 .

[5]  Peter L. Langbein Demand response participation in PJM wholesale markets , 2012, 2012 IEEE PES Innovative Smart Grid Technologies (ISGT).

[6]  Rafael Luis Wagner Lowering consumers’ price image without lowering their internal reference price: the role of pay-what-you-want pricing mechanism , 2019, Journal of Revenue and Pricing Management.

[7]  Tyrone Fernando,et al.  Sequential quadratic programming particle swarm optimization for wind power system operations considering emissions , 2013 .

[8]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[9]  Rauli Svento,et al.  Towards Flexible Energy Demand – Preferences for Dynamic Contracts, Services and Emissions Reductions , 2018 .

[10]  Anna S. Mattila,et al.  The role of reference prices in the lodging industry: the moderating effect of an individual’s psychological state , 2019, Journal of Travel & Tourism Marketing.

[11]  Rauli Svento,et al.  Towards Flexible Energy Demand – Preferences for Dynamic Contracts, Services and Emissions Reductions , 2018, Energy Economics.

[12]  John P. Charlton,et al.  The role of internal reference prices in consumers' willingness to pay judgments: Thaler's Beer Pricing Task revisited. , 2001, Acta psychologica.