Scalable coordinated management of peer-to-peer energy trading: A multi-cluster deep reinforcement learning approach

Abstract The increasing penetration of small-scale distributed energy resources (DER) has the potential to support cost-efficient energy balancing in emerging electricity systems, but is also fundamentally affecting the conventional operation paradigm of the latter. In this context, innovative market mechanisms need to be devised to better coordinate and provide incentives for DER to utilize their flexibility. Peer-to-Peer (P2P) energy trading has emerged as an alternative approach to facilitate direct trading between consumers and prosumers interacting in an energy collective and fosters more efficient local demand–supply balancing. While previous research has primarily focused on the technical and economic benefits of P2P trading, little effort has been made towards the incorporation of prosumers’ heterogeneous characteristics in the P2P trading problem. Here, we address this research gap by classifying the participating prosumers into multiple clusters with regard to their portfolio of DER, and analyzing their trading decisions in a simulated P2P trading platform. The latter employs the mid-market rate (MMR) local pricing mechanism to enable energy trading among prosumers and penalizes the contribution to the system demand peak of each prosumer. We formulate the P2P trading problem as a multi-agent coordination problem and propose a novel multi-agent deep reinforcement learning (MADRL) method to address it. The proposed method is founded on the combination of the multi-agent deep deterministic policy gradient (MADDPG) algorithm and the technique of parameter sharing (PS), which not only enables accelerating the training speed by sharing experiences and learned policies between all agents in each cluster, but also sustains the policies’ diversity between multiple clusters. To address the non-stationarity and computational complexity of MADRL as well as persevering the privacy of prosumers, the P2P trading platform acts as a trusted third party which augments the market collective trading information to help training of prosumer agents. Experiments with a large-scale real-world data-set involving 300 residential households demonstrate that the proposed MADRL method exhibits a strong generalization capability in the test data-set and outperforms the state-of-the-art MADRL methods with regard to the system operation cost, demand peak as well as computational time.

[1]  R. Shorten,et al.  Analytics for the Sharing Economy: Mathematics, Engineering and Business Perspectives , 2020 .

[2]  Johan A. K. Suykens,et al.  Multi-agent reinforcement learning for modeling and control of thermostatically controlled loads , 2019, Applied Energy.

[3]  Michael Kaisers,et al.  Automated Peer-to-peer Negotiation for Energy Contract Settlements in Residential Cooperatives , 2019, Applied Energy.

[4]  Goran Strbac,et al.  Model-Free Real-Time Autonomous Control for a Residential Multi-Energy System Using Deep Reinforcement Learning , 2020, IEEE Transactions on Smart Grid.

[5]  Yuemin Ding,et al.  Multi-agent deep reinforcement learning based demand response for discrete manufacturing systems energy management , 2020, Applied Energy.

[6]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[7]  J. Ravishankar,et al.  Rebound behaviour of uncoordinated EMS and their impact minimisation , 2020 .

[8]  Tengyu Ma,et al.  Peer-to-peer electricity trading in grid-connected residential communities with household distributed photovoltaic , 2020, Applied Energy.

[9]  Cheng Wang,et al.  Energy-Sharing Model With Price-Based Demand Response for Microgrids of Peer-to-Peer Prosumers , 2017, IEEE Transactions on Power Systems.

[10]  José R. Vázquez-Canteli,et al.  Multi-agent reinforcement learning for adaptive demand response in smart cities , 2019 .

[11]  H. Vincent Poor,et al.  Peer-to-Peer Energy Trading With Sustainable User Participation: A Game Theoretic Approach , 2018, IEEE Access.

[12]  Yan Xu,et al.  A Multi-Agent Reinforcement Learning-Based Data-Driven Method for Home Energy Management , 2020, IEEE Transactions on Smart Grid.

[13]  Zhao Yang Dong,et al.  Decision-Making for Electricity Retailers: A Brief Survey , 2018, IEEE Transactions on Smart Grid.

[14]  Thomas Morstyn,et al.  Multiclass Energy Management for Peer-to-Peer Energy Trading Driven by Prosumer Preferences , 2019, IEEE Transactions on Power Systems.

[15]  Archie C. Chapman,et al.  Peer-to-Peer Energy Systems for Connected Communities: A Review of Recent Advances and Emerging Challenges , 2020, Applied Energy.

[16]  Tao Chen,et al.  Local Energy Trading Behavior Modeling With Deep Reinforcement Learning , 2018, IEEE Access.

[17]  Mohammad Shahidehpour,et al.  Coalitional Game-Based Transactive Energy Management in Local Energy Communities , 2020, IEEE Transactions on Power Systems.

[18]  Saeid Nahavandi,et al.  Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications , 2018, IEEE Transactions on Cybernetics.

[19]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[20]  H. Vincent Poor,et al.  A Motivational Game-Theoretic Approach for Peer-to-Peer Energy Trading in the Smart Grid , 2019, Applied Energy.

[21]  Junwei Cao,et al.  Optimal energy management strategies for energy Internet via deep reinforcement learning approach , 2019, Applied Energy.

[22]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[23]  Meng Cheng,et al.  Peer-to-Peer energy trading in a Microgrid , 2018, Applied Energy.

[24]  Yue Zhou,et al.  Evaluation of peer-to-peer energy sharing mechanisms based on a multiagent simulation framework , 2018, Applied Energy.

[25]  George A. Vouros,et al.  Fuzzy Q-Learning for multi-agent decentralized energy management in microgrids , 2018, Applied Energy.

[26]  Jianzhong Wu,et al.  Framework design and optimal bidding strategy for ancillary service provision from a peer-to-peer energy trading community , 2020, Applied Energy.

[27]  H. Vincent Poor,et al.  Energy Storage Sharing in Smart Grid: A Modified Auction-Based Approach , 2015, IEEE Transactions on Smart Grid.

[28]  Jeff Smith,et al.  Distributed Energy Resources Takes Center Stage: A Renewed Spotlight on the Distribution Planning Process , 2018, IEEE Power and Energy Magazine.

[29]  Dimitrios Papadaskalopoulos,et al.  Exploring the effects of local energy markets on electricity retailers and customers , 2020 .

[30]  Steven R. Weller,et al.  Residential load and rooftop PV generation: an Australian distribution network dataset , 2017 .

[31]  Goran Strbac,et al.  Computationally Efficient Pricing and Benefit Distribution Mechanisms for Incentivizing Stable Peer-to-Peer Energy Trading , 2021, IEEE Internet of Things Journal.

[32]  Bowon Lee,et al.  Automatic P2P Energy Trading Model Based on Reinforcement Learning Using Long Short-Term Delayed Reward , 2020, Energies.

[33]  Amjad Anvari-Moghaddam,et al.  A multi-agent based energy management solution for integrated buildings and microgrid system , 2017 .

[34]  Thomas Morstyn,et al.  Using peer-to-peer energy-trading platforms to incentivize prosumers to form federated power plants , 2018, Nature Energy.

[35]  Goran Strbac,et al.  Nonlinear and Randomized Pricing for Distributed Management of Flexible Loads , 2016, IEEE Transactions on Smart Grid.

[36]  L. Jiang,et al.  Energy Consumption Scheduling of HVAC Considering Weather Forecast Error Through the Distributionally Robust Approach , 2018, IEEE Transactions on Industrial Informatics.

[37]  Jianzhong Wu,et al.  State-of-the-Art Analysis and Perspectives for Peer-to-Peer Energy Trading , 2020, Engineering.

[38]  Thomas Kunz,et al.  Peer-to-peer energy trading among smart homes , 2019, Applied Energy.

[39]  Pierre Pinson,et al.  Consensus-Based Approach to Peer-to-Peer Electricity Markets With Product Differentiation , 2018, IEEE Transactions on Power Systems.

[40]  Haibo He,et al.  Model-Free Real-Time EV Charging Scheduling Based on Deep Reinforcement Learning , 2019, IEEE Transactions on Smart Grid.

[41]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[42]  Pablo Hernandez-Leal,et al.  A Survey of Learning in Multiagent Environments: Dealing with Non-Stationarity , 2017, ArXiv.

[43]  Ulf J. J. Hahnel,et al.  Becoming prosumer: Revealing trading preferences and decision-making strategies in peer-to-peer energy communities , 2020 .

[44]  Alfonso Capozzoli,et al.  Deep reinforcement learning to optimise indoor temperature control and heating energy consumption in buildings , 2020 .

[45]  Jianzhong Wu,et al.  Peer-to-peer energy trading in a community microgrid , 2017, 2017 IEEE Power & Energy Society General Meeting.