Deep Reinforcement Learning-Based Trading Strategy for Load Aggregators on Price-Responsive Demand

With the development of the Internet of things and smart grid technologies, modern electricity markets seamlessly connect demand response to the spot market through price-responsive loads, in which the trading strategy of load aggregators plays a crucial role in profit capture. In this study, we propose a deep reinforcement learning-based strategy for purchasing and selling electricity based on real-time electricity prices and real-time demand data in the spot market, which maximizes the revenue of load aggregators. The deep deterministic policy gradient (DDPG) is applied through a bidirectional long- and short-term memory (BiLSTM) network to extract the market state features that are used to make trading decisions. The effectiveness of the method is validated using datasets from the New England electricity market and Australian electricity market by introducing a bidirectional LSTM structure into the actor-critic network structure to learn hidden states in partially observable Markov states through memory inference. Comparative experiments of the method show that the method can provide greater yield results.

[1]  M. Ibrahim,et al.  Resiliency Assessment of Power Systems Using Deep Reinforcement Learning , 2022, Computational intelligence and neuroscience.

[2]  Hongbin Sun,et al.  Deep Reinforcement Learning Microgrid Optimization Strategy Considering Priority Flexible Demand Side , 2022, Sensors.

[3]  Qiuwei Wu,et al.  Deep Reinforcement Learning-Based Charging Pricing for Autonomous Mobility-on-Demand System , 2022, IEEE Transactions on Smart Grid.

[4]  Liming Feng,et al.  Diversity Evolutionary Policy Deep Reinforcement Learning , 2021, Comput. Intell. Neurosci..

[5]  Maxim A. Dulebenets,et al.  An Adaptive Polyploid Memetic Algorithm for scheduling trucks at a cross-docking terminal , 2021, Inf. Sci..

[6]  Guangdong Tian,et al.  An integrated optimization method for tactical-level planning in liner shipping with heterogeneous ship fleet and environmental considerations , 2021, Adv. Eng. Informatics.

[7]  Amin Shojaeighadikolaei,et al.  Demand Responsive Dynamic Pricing Framework for Prosumer Dominated Microgrids using Multiagent Reinforcement Learning , 2020, 2020 52nd North American Power Symposium (NAPS).

[8]  Morteza Nazari-Heris,et al.  Economic-environmental effect of power to gas technology in coupled electricity and gas systems with price-responsive shiftable loads , 2020, Journal of Cleaner Production.

[9]  Changsheng Zhang,et al.  An online-learning-based evolutionary many-objective algorithm , 2020, Inf. Sci..

[10]  Hongmei Chi,et al.  An augmented self-adaptive parameter control in evolutionary computation: A case study for the berth scheduling problem , 2019, Adv. Eng. Informatics.

[11]  Pierre Pinson,et al.  Evaluating Price-Based Demand Response in Practice—With Application to the EcoGrid EU Experiment , 2018, IEEE Transactions on Smart Grid.

[12]  Nando de Freitas,et al.  Sample Efficient Actor-Critic with Experience Replay , 2016, ICLR.

[13]  Shahaboddin Shamshirband,et al.  Estimating building energy consumption using extreme learning machine method , 2016 .

[14]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[15]  Geoffrey E. Hinton,et al.  A Simple Way to Initialize Recurrent Networks of Rectified Linear Units , 2015, ArXiv.

[16]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[17]  J. Cardell,et al.  Implementing a grid state indicator for responsive retail demand , 2013, 2013 North American Power Symposium (NAPS).

[18]  Martha White,et al.  Linear Off-Policy Actor-Critic , 2012, ICML.

[19]  Ying Xiao,et al.  Impact of price responsive demand on PJM Real-time/Look-ahead markets , 2011, 2011 IEEE Power and Energy Society General Meeting.

[20]  Jürgen Schmidhuber,et al.  Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[21]  Jürgen Schmidhuber,et al.  Recurrent nets that time and count , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[22]  Masoud Rabbani,et al.  Ambulance routing in disaster response considering variable patient condition: NSGA-II and MOPSO algorithms , 2021, Journal of Industrial & Management Optimization.

[23]  Dunnan Liu,et al.  Dynamic Pricing Strategy of Electric Vehicle Aggregators Based on DDPG Reinforcement Learning Algorithm , 2021, IEEE Access.

[24]  Dunnan Liu,et al.  Research on Cooperation Between Wind Farm and Electric Vehicle Aggregator Based on A3C Algorithm , 2021, IEEE Access.

[25]  Kevin Tomsovic,et al.  Bidding Strategy for Microgrid in Day-Ahead Market Based on Hybrid Stochastic/Robust Optimization , 2016, IEEE Transactions on Smart Grid.