论文信息 - Balancing comfort and energy consumption of a heat pump using batch reinforcement learning with fitted Q-iteration

Balancing comfort and energy consumption of a heat pump using batch reinforcement learning with fitted Q-iteration

Abstract In this study, a heat pump satisfies the heating and cooling needs of a building, and two water tanks store heat and cold respectively. Reinforcement learning (RL) is a model-free control approach that can learn from the behaviour of the occupants, weather conditions, and the thermal behaviour of the building in order to make near-optimal decisions. In this work we use of a specific RL technique called batch Q-learning, and integrate it into the urban building energy simulator CitySim. The goal of the controller is to reduce the energy consumption while maintaining adequate comfort temperatures.

José R. Vázquez-Canteli | Zoltán Nagy | Jérôme Kämpf | Z. Nagy | J. Kämpf

[1] Peter Dayan,et al. Q-learning , 1992, Machine Learning.

[2] Lei Yang,et al. Reinforcement learning for optimal control of low exergy buildings , 2015 .

[3] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.

[4] Bart De Schutter,et al. Reinforcement Learning and Dynamic Programming Using Function Approximators , 2010 .