Decentralized Control of DR Using a Multi-agent Method

Demand response (DR) is one of the most cost-effective elements of residential and small industrial building for the purpose of reducing the cost of energy. Today with broadening of the smart grid, electricity market and especially smart home, using DR can reduce cost and even make profits for consumers. On the other hand, utilizing centralized controls and have bidirectional communications Bi-directional communication between DR aggregators and consumers make many problems such as scalability and privacy violation. In this chapter, we propose a multi-agent method based on a Q-learning algorithm Q-learning algorithm for decentralized control of DR. Q-learning is a model-free reinforcement learning Reinforcement learning technique and a simple way for agents to learn how to act optimally in controlled Markovian domains. With this method, each consumer adapts its bidding and buying strategy over time according to the market outcomes. We consider energy supply for consumers such as small-scale renewable energy generators. We compare the result of the proposed method with a centralized aggregator-based approach that shows the effectiveness of the proposed decentralized DR market Decentralized DR market.

[1]  Hanchen Xu,et al.  The values of market-based demand response on improving power system reliability under extreme circumstances , 2017 .

[2]  Jhi-Young Joo,et al.  Option Valuation Applied to Implementing Demand Response via Critical Peak Pricing , 2007, 2007 IEEE Power Engineering Society General Meeting.

[3]  Dragan Maksimovic,et al.  Electric vehicle charge optimization including effects of lithium-ion battery degradation , 2011, 2011 IEEE Vehicle Power and Propulsion Conference.

[4]  J. K. Kok,et al.  Intelligence in Electricity Networks for Embedding Renewables and Distributed Generation , 2010 .

[5]  M. Ilic,et al.  Optimal Charge Control of Plug-In Hybrid Electric Vehicles in Deregulated Electricity Markets , 2011, IEEE Transactions on Power Systems.

[6]  M. Hadi Amini,et al.  A Decentralized Framework for Real-Time Energy Trading in Distribution Networks with Load and Generation Uncertainty , 2017, ArXiv.

[7]  Ufuk Topcu,et al.  Optimal decentralized protocol for electric vehicle charging , 2011, IEEE Transactions on Power Systems.

[8]  Fei Wang,et al.  Dynamic Price Vector Formation Model-Based Automatic Demand Response Strategy for PV-Assisted EV Charging Stations , 2017, IEEE Transactions on Smart Grid.

[9]  S. Ashok,et al.  Optimal operation of industrial cogeneration for load management , 2003 .

[10]  Ying Li,et al.  Automated Residential Demand Response: Algorithmic Implications of Pricing Models , 2012, IEEE Transactions on Smart Grid.

[11]  Mahmoud-Reza Haghifam,et al.  Load management using multi-agent systems in smart distribution network , 2013, 2013 IEEE Power & Energy Society General Meeting.

[12]  Gabriela Hug,et al.  Agent-Based Distributed Security Constrained Optimal Power Flow , 2018, IEEE Transactions on Smart Grid.

[13]  Göran Andersson,et al.  Optimal bidding of plug-in electric vehicles in a market-based control setup , 2014, 2014 Power Systems Computation Conference.

[14]  G. Andersson,et al.  Centralized and decentralized approaches to smart charging of plug-in Vehicles , 2012, 2012 IEEE Power and Energy Society General Meeting.

[15]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[16]  Farhad Kamyab,et al.  Demand Response Program in Smart Grid Using Supply Function Bidding Mechanism , 2016, IEEE Transactions on Smart Grid.

[17]  João P. S. Catalão,et al.  A Decentralized Electricity Market Scheme Enabling Demand Response Deployment , 2018, IEEE Transactions on Power Systems.

[18]  A. Philpott,et al.  Optimizing demand-side bids in day-ahead electricity markets , 2006, IEEE Transactions on Power Systems.

[19]  Hosam K. Fathy,et al.  Plug-in hybrid electric vehicle charge pattern optimization for energy cost and battery longevity , 2011 .

[20]  Wilfried Elmenreich,et al.  Residential demand response scheme based on adaptive consumption level pricing , 2016 .

[21]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[22]  Damien Ernst,et al.  A comparison of Nash equilibria analysis and agent-based modelling for power markets , 2006 .

[23]  H. Aalami,et al.  Optimum Time of Use program proposal for Iranian Power Systems , 2009, 2009 International Conference on Electric Power and Energy Conversion Systems, (EPECS).

[24]  Miguel Azenha,et al.  Optimal behavior of responsive residential demand considering hybrid phase change materials , 2016 .

[25]  Gabriela Hug,et al.  Fully distributed corrective security constrained optimal power flow , 2017, 2017 IEEE Manchester PowerTech.