Containerized framework for building control performance comparisons: model predictive control vs deep reinforcement learning control

While both model predictive control (MPC) and deep reinforcement learning control (DRL) have shown significant energy cost savings for building systems, there is a lack of in-depth quantitative study on the comparison between the two. One major obstacle is the lack of a holistic evaluation environment for the building community and the control community to integrate their expertise in studying both model-based and learning-based control methods. To address this challenge, this paper presents a scalable containerized software framework for building control performance comparisons, with a special focus on enabling both optimal model-based control and deep learning-based control. The framework provides a standardized building environment for the control community to benchmark different advanced control strategies, and a flexible software architecture for the building community to standardize their own customized building environments. A preliminary performance comparison of MPC and DRL on a single zone building is performed in the case study. Both MPC and DRL can outperform the rule-based baseline controllers in terms of reducing energy cost and maintaining thermal discomfort. DRL can outperform MPC after a long training time with a predefined reward policy.

[1]  Andreas Junghanns,et al.  The Functional Mockup Interface for Tool independent Exchange of Simulation Models , 2011 .

[2]  Zheng O'Neill,et al.  A flexible and generic functional mock-up unit based threat injection framework for grid-interactive efficient buildings: A case study in Modelica , 2021 .

[3]  Manfred Morari,et al.  Model Predictive Climate Control of a Swiss Office Building: Implementation, Results, and Cost–Benefit Analysis , 2016, IEEE Transactions on Control Systems Technology.

[4]  Wei Liang,et al.  MPC control for improving energy efficiency of a building air handler for multi-zone VAVs , 2015 .

[5]  Draguna Vrabie,et al.  Simulation and experimental demonstration of model predictive control in a building HVAC system , 2015 .

[6]  Manfred Morari,et al.  Increasing energy efficiency in building climate control using weather forecasts and model predictive control , 2010 .

[7]  Moritz Diehl,et al.  CasADi: a software framework for nonlinear optimization and optimal control , 2018, Mathematical Programming Computation.

[8]  Yuval Tassa,et al.  MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Tianshu Wei,et al.  Deep reinforcement learning for building HVAC control , 2017, 2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC).

[10]  2013 , 2018, Eu minha tía e o golpe do atraso.

[11]  Dong Yan,et al.  Tianshou: a Highly Modularized Deep Reinforcement Learning Library , 2021, ArXiv.

[12]  Dirk Müller,et al.  Application of two promising Reinforcement Learning algorithms for load shifting in a cooling supply system , 2020, Energy and Buildings.

[13]  전세경 2015 , 2018, Eu minha tía e o golpe do atraso.

[14]  Thierry S. Nouidui,et al.  Modelica Buildings library , 2014 .

[15]  Maher Kayal,et al.  Comparison of MPC Formulations for Building Control under Commercial Time-of-Use Tariffs , 2019, 2019 IEEE Milan PowerTech.

[16]  Charles Anderson,et al.  Docker [Software engineering] , 2015 .

[17]  Michael Wetter,et al.  Software Architecture and Implementation of Modelica Buildings Library Coupling for Spawn of EnergyPlus , 2021, Proceedings of 14th Modelica Conference 2021, Linköping, Sweden, September 20-24, 2021.

[18]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[19]  Daniel E. Fisher,et al.  EnergyPlus: creating a new-generation building energy simulation program , 2001 .

[20]  Khee Poh Lam,et al.  Whole building energy model for HVAC optimal control: A practical framework based on deep reinforcement learning , 2019, Energy and Buildings.

[21]  Zheng O'Neill,et al.  One for Many: Transfer Learning for Building HVAC Control , 2020, BuildSys@SenSys.

[22]  Tetiana Bogodorova,et al.  ModelicaGym: applying reinforcement learning to Modelica models , 2019, EOOLT.

[23]  Zheng O'Neill,et al.  Model-assisted Learning-based Framework for Sensor Fault-Tolerant Building HVAC Control , 2021, ArXiv.

[24]  Hilding Elmqvist,et al.  Physical system modeling with Modelica , 1998 .

[25]  José A. Candanedo,et al.  Model-based predictive control of an ice storage device in a building cooling system , 2013 .

[26]  Draguna Vrabie,et al.  Prototyping the BOPTEST Framework for Simulation-Based Testing of Advanced Control Strategies in Buildings , 2019 .

[27]  Zheng O'Neill,et al.  Learning-based framework for sensor fault-tolerant building HVAC control with model-assisted learning , 2021, Proceedings of the 8th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation.

[28]  Colin N. Jones,et al.  Experimental Implementation of Frequency Regulation Services Using Commercial Buildings , 2017, IEEE Transactions on Smart Grid.

[29]  Duncan S. Callaway,et al.  Experimental Demonstration of Frequency Regulation by Commercial Buildings—Part I: Modeling and Hierarchical Control Design , 2016, IEEE Transactions on Smart Grid.

[30]  Sergio Bruno,et al.  A Demand Response Implementation in Tertiary Buildings Through Model Predictive Control , 2019, IEEE Transactions on Industry Applications.