Energy-Efficient Thermal Comfort Control in Smart Buildings via Deep Reinforcement Learning

Heating, Ventilation, and Air Conditioning (HVAC) is extremely energy-consuming, accounting for 40% of total building energy consumption. Therefore, it is crucial to design some energy-efficient building thermal control policies which can reduce the energy consumption of HVAC while maintaining the comfort of the occupants. However, implementing such a policy is challenging, because it involves various influencing factors in a building environment, which are usually hard to model and may be different from case to case. To address this challenge, we propose a deep reinforcement learning based framework for energy optimization and thermal comfort control in smart buildings. We formulate the building thermal control as a cost-minimization problem which jointly considers the energy consumption of HVAC and the thermal comfort of the occupants. To solve the problem, we first adopt a deep neural network based approach for predicting the occupants' thermal comfort, and then adopt Deep Deterministic Policy Gradients (DDPG) for learning the thermal control policy. To evaluate the performance, we implement a building thermal control simulation system and evaluate the performance under various settings. The experiment results show that our method can improve the thermal comfort prediction accuracy, and reduce the energy consumption of HVAC while improving the occupants' thermal comfort.

[1]  William J. Batty,et al.  Fuzzy control strategies to provide cost and energy efficient high quality indoor environments in buildings with high occupant densities , 2003 .

[2]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[3]  Mohammad Bagher Menhaj,et al.  Training feedforward networks with the Marquardt algorithm , 1994, IEEE Trans. Neural Networks.

[4]  Raj Jain,et al.  An Internet of Things Framework for Smart Energy in Buildings: Designs, Prototype, and Experiments , 2015, IEEE Internet of Things Journal.

[5]  Anastasios I. Dounis,et al.  Advanced control systems engineering for energy and comfort management in a building environment--A review , 2009 .

[6]  Jianlei Niu,et al.  Thermal comfort models: A review and numerical investigation , 2012 .

[7]  Philip Haves,et al.  Model predictive control for the operation of building cooling systems , 2010, Proceedings of the 2010 American Control Conference.

[8]  Tianshu Wei,et al.  Deep reinforcement learning for building HVAC control , 2017, 2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC).

[9]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[10]  Kalyan Veeramachaneni,et al.  Using reinforcement learning to optimize occupant comfort and energy usage in HVAC systems , 2014, J. Ambient Intell. Smart Environ..

[11]  Gianfranco Rizzo,et al.  The control of indoor thermal comfort conditions: introducing a fuzzy adaptive controller , 2004 .

[12]  Li Xia,et al.  A multi-grid reinforcement learning method for energy conservation and comfort of HVAC in buildings , 2015, 2015 IEEE International Conference on Automation Science and Engineering (CASE).

[13]  P. André,et al.  Optimal heating control in a passive solar commercial building , 2001 .

[14]  Biao Huang,et al.  A Long-Short Term Memory Recurrent Neural Network Based Reinforcement Learning Controller for Office Heating Ventilation and Air Conditioning Systems , 2017 .

[15]  Myra,et al.  Autonomous HVAC Control, A Reinforcement Learning Approach , 2018 .

[16]  Marco Wiering,et al.  Reinforcement Learning and Markov Decision Processes , 2012, Reinforcement Learning.

[17]  Kazem Sohraby,et al.  IoT Considerations, Requirements, and Architectures for Smart Buildings—Energy Optimization and Next-Generation Building Management Systems , 2017, IEEE Internet of Things Journal.

[18]  Marco Filippi,et al.  Perception of the thermal environment in high school and university classrooms: Subjective preferences and thermal comfort , 2007 .

[19]  D. Kolokotsa,et al.  Reinforcement learning for energy conservation and comfort in buildings , 2007 .

[20]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[21]  Jochen Schmidt,et al.  Towards the Intelligent Home: Using Reinforcement-Learning for Optimal Heating Control , 2013, KI.

[22]  G. J. Levermore,et al.  Building energy management systems : application to heating and control , 1992 .

[23]  Tianshu Wei,et al.  Co-scheduling of HVAC control, EV charging and battery usage for building energy efficiency , 2014, 2014 IEEE/ACM International Conference on Computer-Aided Design (ICCAD).

[24]  Anastasios I. Dounis,et al.  Comparison of Conventional and Fuzzy Control of Indoor Air Quality in Buildings , 1996, J. Intell. Fuzzy Syst..

[25]  G. Uhlenbeck,et al.  On the Theory of the Brownian Motion , 1930 .

[26]  Alberto Sangiovanni-Vincentelli,et al.  Model-Based Hierarchical Optimal Control Design for HVAC Systems , 2014 .

[27]  Martin T. Hagan,et al.  Gauss-Newton approximation to Bayesian learning , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).

[28]  D. Nikovski,et al.  A Method for Computing Optimal SetPoint Schedule for HVAC Systems , 2013 .

[29]  M Morari,et al.  Energy efficient building climate control using Stochastic Model Predictive Control and weather predictions , 2010, Proceedings of the 2010 American Control Conference.

[30]  Jennie Si,et al.  Robust Reinforcement Learning for Heating, Ventilation, and Air Conditioning Control of Buildings , 2004 .

[31]  Shengwei Wang,et al.  Model-based optimal control of VAV air-conditioning system using genetic algorithm , 2000 .

[32]  Jerald D. Parker,et al.  Heating, Ventilating, and Air Conditioning: Analysis and Design , 1977 .

[33]  Luis Pérez-Lombard,et al.  A review on buildings energy consumption information , 2008 .