An uncertainty-aware deep reinforcement learning framework for residential air conditioning energy management

Abstract Most existing methods for controlling the energy consumption of air conditioning (AC), focus on either scheduling the switching (on/off) of compressors or optimizing the overall energy consumption of AC system of an entire building. Unlike commercial buildings, residential apartments typically house separate ACs in individual rooms occupied by people with different thermal comfort preferences. Fortunately, the advancement of Internet-of-Things (IoT) technology has enabled the exploitation of sensory data to intelligently control the set-point temperature of ACs in individual rooms based on environmental conditions and occupant’s preferences, improving the energy efficiency of residential buildings. Indeed, control decisions based on sensory data may suffer from uncertainties due to error in data measurement and contribute to model uncertainty. This work proposes a data-driven uncertainty-aware approach to control split-type inverter ACs of residential buildings. First, information from similar AC and residential units are aggregated to reduce data imbalances, and Bayesian-Convolutional-Neural-Networks (BCNNs) are utilized to model the performance and uncertainty of the ACs from the aggregated data. Second, a Q-learning based reinforcement learning algorithm for set-point decision making is designed for setpoint optimization with transitions sampled from the BCNN models. Third, a case study is simulated based on such a framework to show that the control actions taken by the uncertainty-aware agent perform better in terms of discomfort management and energy savings compared to the uncertainty unaware agent. Further, the agent could also be adjusted to capture the trade-off between energy savings and comfort levels for varying degrees of energy and discomfort savings.

[1]  Paul Cooper,et al.  Hybrid model predictive control of a residential HVAC system with on-site thermal energy generation and storage , 2017 .

[2]  Kaamran Raahemifar,et al.  Artificial neural network (ANN) based model predictive control (MPC) and optimization of HVAC systems: A state of the art review and case study of a residential HVAC system , 2017 .

[3]  Bart De Schutter,et al.  Residential Demand Response of Thermostatically Controlled Loads Using Batch Reinforcement Learning , 2017, IEEE Transactions on Smart Grid.

[4]  Qing-Shan Jia,et al.  Optimal Control of Multiroom HVAC System: An Event-Based Approach , 2016, IEEE Transactions on Control Systems Technology.

[5]  Yishay Mansour,et al.  Learning Rates for Q-learning , 2004, J. Mach. Learn. Res..

[6]  Dimitris Kanellopoulos,et al.  Handling imbalanced datasets: A review , 2006 .

[7]  Mohsen Guizani,et al.  Semisupervised Deep Reinforcement Learning in Support of IoT and Smart City Services , 2018, IEEE Internet of Things Journal.

[8]  Yuren Zhou,et al.  A survey of data fusion in smart city applications , 2019, Inf. Fusion.

[9]  Frauke Oldewurtel,et al.  Experimental analysis of model predictive control for an energy efficient building heating system , 2011 .

[10]  Zhenjun Ma,et al.  Supervisory and Optimal Control of Building HVAC Systems: A Review , 2008 .

[11]  Ching Man Chan,et al.  Energy conservation through smart homes in a smart city: A lesson for Singapore households , 2017 .

[12]  Kristin L. Wood,et al.  Data Driven Electricity Management for Residential Air Conditioning Systems: An Experimental Approach , 2019, IEEE Transactions on Emerging Topics in Computing.

[13]  Leslie K. Norford,et al.  Optimal control of HVAC and window systems for natural ventilation through reinforcement learning , 2018, Energy and Buildings.

[14]  Jianhui Wang,et al.  Review of real-time electricity markets for integrating Distributed Energy Resources and Demand Response , 2015 .

[15]  Mahdi Shahbakhti,et al.  Building Efficiency and Sustainability in the Tropics ( SinBerBEST ) Title Handling model uncertainty in model predictive control for energy efficient buildings Permalink , 2014 .

[16]  Shen Li,et al.  RDeepSense: Reliable Deep Mobile Computing Models with Uncertainty Estimations , 2017, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[17]  Tom Schaul,et al.  Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.

[18]  Henk Visscher,et al.  Performance gaps in energy consumption: household groups and building characteristics , 2018 .

[19]  Paris A. Fokaides,et al.  European smart cities: The role of zero energy buildings , 2015 .

[20]  Talal Rahwan,et al.  Automatic HVAC Control with Real-time Occupancy Recognition and Simulation-guided Model Predictive Control in Low-cost Embedded System , 2017, ArXiv.

[21]  Roland Siegwart,et al.  Control of a Quadrotor With Reinforcement Learning , 2017, IEEE Robotics and Automation Letters.

[22]  Michael Stadler,et al.  Quantifying Flexibility of Commercial and Residential Loads for Demand Response using Setpoint Changes , 2016 .

[23]  Farrokh Janabi-Sharifi,et al.  Gray-box modeling and validation of residential HVAC system for control system design , 2015 .

[24]  Can Cui,et al.  An energy-saving oriented air balancing method for demand controlled ventilation systems with branch and black-box model , 2020, Applied Energy.

[25]  Ning Zhang,et al.  Probabilistic individual load forecasting using pinball loss guided LSTM , 2019, Applied Energy.

[26]  Eva Žáčeková,et al.  Towards the real-life implementation of MPC for an office building: Identification issues , 2014 .

[27]  Mugen Peng,et al.  Deep Reinforcement Learning-Based Mode Selection and Resource Management for Green Fog Radio Access Networks , 2018, IEEE Internet of Things Journal.

[28]  Lingyang Song,et al.  Reinforcement Learning for Decentralized Trajectory Design in Cellular UAV Networks With Sense-and-Send Protocol , 2018, IEEE Internet of Things Journal.

[29]  Taher Niknam,et al.  Probabilistic Load Forecasting Using an Improved Wavelet Neural Network Trained by Generalized Extreme Learning Machine , 2018, IEEE Transactions on Smart Grid.

[30]  Yuren Zhou,et al.  Benchmarking air-conditioning energy performance of residential rooms based on regression and clustering techniques , 2019, Applied Energy.

[31]  Wei Zhang,et al.  Aggregated Modeling and Control of Air Conditioning Loads for Demand Response , 2013 .