Using reinforcement learning for demand response of domestic hot water buffers: A real-life demonstration

This paper demonstrates a data-driven control approach for demand response in real-life residential buildings. The objective is to optimally schedule the heating cycles of the Domestic Hot Water (DHW) buffer to maximize the self-consumption of the local photovoltaic (PV) production. A model-based reinforcement learning technique is used to tackle the underlying sequential decision-making problem. The proposed algorithm learns the stochastic occupant behavior, predicts the PV production and takes into account the dynamics of the system. A real-life experiment with six residential buildings is performed using this algorithm. The results show that the self-consumption of the PV production is significantly increased, compared to the default thermostat control.

[1]  W. H. Dines Weather Forecasting , 1914, Nature.

[2]  Geert Deconinck,et al.  Beyond theory: Experimental results of a self-learning air conditioning unit , 2016, 2016 IEEE International Energy Conference (ENERGYCON).

[3]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[4]  Chiara Delmastro,et al.  Generalizable occupant-driven optimization model for domestic hot water production in NZEB , 2016 .

[5]  Koen Vanthournout,et al.  LINEAR breakthrough project: Large-scale implementation of smart grid technologies in distribution grids , 2012, 2012 3rd IEEE PES Innovative Smart Grid Technologies Europe (ISGT Europe).

[6]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[7]  D. Ernst,et al.  Approximate Value Iteration in the Reinforcement Learning Context. Application to Electrical Power System Control. , 2005 .

[8]  Bart De Schutter,et al.  Residential Demand Response of Thermostatically Controlled Loads Using Batch Reinforcement Learning , 2017, IEEE Transactions on Smart Grid.

[9]  M. Webber,et al.  The impacts of storing solar energy in the home to reduce reliance on the utility , 2017, Nature Energy.

[10]  Pierre Geurts,et al.  Tree-Based Batch Mode Reinforcement Learning , 2005, J. Mach. Learn. Res..

[11]  Johan Driesen,et al.  LV distribution network feeders in Belgium and power quality issues due to increasing PV penetration levels , 2012, 2012 3rd IEEE PES Innovative Smart Grid Technologies Europe (ISGT Europe).

[12]  Louis Wehenkel,et al.  Reinforcement Learning Versus Model Predictive Control: A Comparison on a Power System Problem , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[13]  Bart De Schutter,et al.  Residential Demand Response Applications Using Batch Reinforcement Learning , 2015, ArXiv.