Automated Control of Transactive HVACs in Energy Distribution Systems

Heating, Ventilation, and Air Conditioning (HVAC) systems contribute significantly to a building’s energy consumption. In the recent years, there is an increased interest in developing transactive approaches which could enable automated and flexible scheduling of HVAC systems based on the customer demand and the electricity prices decided by the suppliers. Flexible and automated scheduling of the HVAC systems make it a prime source for participation in residential demand response or transactive energy systems. Therefore, it is of significant interest to identify an optimal strategy to control the HVAC systems. In this article, reducing the energy cost while keeping the comfort level acceptable to the users, we argue that such a control strategy should consider both the energy cost and user comfort simultaneously. Accordingly, we develop the control strategy through the solution of an optimization problem that balances between the energy cost and consumer’s dissatisfaction. This optimization enables us to solve a decision-making problem through first price prediction and then choosing HVAC temperature settings throughout the day based on the predicted price, history of the price and HVAC settings, and outside temperature. More specifically, we formulate the control design as a Markov decision process (MDP) using deep neural networks and use Deep Deterministic Policy Gradients (DDPG)-based deep reinforcement learning algorithm to find the optimal control strategy for HVAC systems that balances between electricity cost and user comfort.

[1]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[2]  Sergey Levine,et al.  Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.

[3]  Bart De Schutter,et al.  Residential Demand Response of Thermostatically Controlled Loads Using Batch Reinforcement Learning , 2017, IEEE Transactions on Smart Grid.

[4]  Peter B. Luh,et al.  Building Energy Management: Integrated Control of Active and Passive Heating, Cooling, Lighting, Shading, and Ventilation Systems , 2013, IEEE Transactions on Automation Science and Engineering.

[5]  Lei Yang,et al.  Reinforcement learning for optimal control of low exergy buildings , 2015 .

[6]  R. Belmans,et al.  Reinforcement Learning Applied to an Electric Water Heater: From Theory to Practice , 2015, IEEE Transactions on Smart Grid.

[7]  Hamed Mohsenian Rad,et al.  Optimal Residential Load Control With Price Prediction in Real-Time Electricity Pricing Environments , 2010, IEEE Transactions on Smart Grid.

[8]  R. Weron Electricity price forecasting: A review of the state-of-the-art with a look into the future , 2014 .

[9]  Gerard J. M. Smit,et al.  Management and Control of Domestic Smart Grid Technology , 2010, IEEE Transactions on Smart Grid.

[10]  D. Chassin,et al.  Analysis of Residential Demand Response and double-auction markets , 2011, 2011 IEEE Power and Energy Society General Meeting.

[11]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[12]  Simeng Liu,et al.  Evaluation of reinforcement learning for optimal control of building active and passive thermal storage inventory , 2007 .

[13]  José R. Vázquez-Canteli,et al.  Reinforcement learning for demand response: A review of algorithms and modeling techniques , 2019, Applied Energy.

[14]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[15]  Xing Yan,et al.  Mid-term electricity market clearing price forecasting: A multiple SVM approach , 2014 .

[16]  Qing-Shan Jia,et al.  Optimal Control of Multiroom HVAC System: An Event-Based Approach , 2016, IEEE Transactions on Control Systems Technology.

[17]  Tao Jiang,et al.  Online Energy Management for a Sustainable Smart Home With an HVAC Load and Random Occupancy , 2017, IEEE Transactions on Smart Grid.

[18]  Steven T. Bushby,et al.  NIST Transactive Energy Modeling and Simulation Challenge Phase II Final Report , 2019 .

[19]  Siliang Lu,et al.  A DEEP REINFORCEMENT LEARNING APPROACH TO USINGWHOLE BUILDING ENERGYMODEL FOR HVAC OPTIMAL CONTROL , 2018 .

[20]  J. Braun,et al.  Load Control Using Building Thermal Mass , 2003 .

[21]  Damien Picard,et al.  Cloud-based implementation of white-box model predictive control for a GEOTABS office building: A field test demonstration , 2020 .

[22]  M. P. Moghaddam,et al.  Optimal real time pricing in an agent-based retail market using a comprehensive demand response model , 2011 .

[23]  Junjie Wu,et al.  Event-Based HVAC Control—A Complexity-Based Approach , 2018, IEEE Transactions on Automation Science and Engineering.

[24]  Laura Sacerdote,et al.  The Ornstein–Uhlenbeck neuronal model with signal-dependent noise , 2001 .

[25]  Haoran Zhao,et al.  Transactive energy: A review of state of the art and implementation , 2017, 2017 IEEE Manchester PowerTech.

[26]  Pierluigi Siano,et al.  Demand response and smart grids—A survey , 2014 .

[27]  Philip Haves,et al.  Model predictive control for the operation of building cooling systems , 2010, Proceedings of the 2010 American Control Conference.

[28]  Christos T. Maravelias,et al.  Mixed-integer optimization methods for online scheduling in large-scale HVAC systems , 2020, Optim. Lett..

[29]  Adriana Chis,et al.  Reinforcement Learning-Based Plug-in Electric Vehicle Charging With Forecasted Price , 2017, IEEE Transactions on Vehicular Technology.

[30]  Chongqing Kang,et al.  Review and prospect of integrated demand response in the multi-energy system , 2017 .

[31]  Herke van Hoof,et al.  Addressing Function Approximation Error in Actor-Critic Methods , 2018, ICML.

[32]  Seung Ho Hong,et al.  Demand Response for Home Energy Management Using Reinforcement Learning and Artificial Neural Network , 2019, IEEE Transactions on Smart Grid.

[33]  Andrew Fisher,et al.  FNCS: a framework for power system and communication networks co-simulation , 2014, SpringSim.

[34]  Donald J. Hammerstrom,et al.  Simulation-Based Valuation of Transactive Energy Systems , 2019, IEEE Transactions on Power Systems.

[35]  Per Heiselberg,et al.  Energy flexibility of a nearly zero-energy building with weather predictive control on a convective building energy system and evaluated with different metrics , 2019 .

[36]  Martin Kozek,et al.  A general approach for mixed-integer predictive control of HVAC systems using MILP , 2018 .

[37]  Marco Levorato,et al.  Residential Demand Response Using Reinforcement Learning , 2010, 2010 First IEEE International Conference on Smart Grid Communications.