Balancing comfort and energy consumption of a heat pump using batch reinforcement learning with fitted Q-iteration
暂无分享,去创建一个
Abstract In this study, a heat pump satisfies the heating and cooling needs of a building, and two water tanks store heat and cold respectively. Reinforcement learning (RL) is a model-free control approach that can learn from the behaviour of the occupants, weather conditions, and the thermal behaviour of the building in order to make near-optimal decisions. In this work we use of a specific RL technique called batch Q-learning, and integrate it into the urban building energy simulator CitySim. The goal of the controller is to reduce the energy consumption while maintaining adequate comfort temperatures.
[1] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[2] Lei Yang,et al. Reinforcement learning for optimal control of low exergy buildings , 2015 .
[3] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[4] Bart De Schutter,et al. Reinforcement Learning and Dynamic Programming Using Function Approximators , 2010 .