A systematic feature extraction and selection framework for data-driven whole-building automated fault detection and diagnostics in commercial buildings

Abstract In data-driven automated fault detection and diagnostics (AFDD) modeling for building energy systems, feature engineering is a critical process of extracting information from high-dimensional and noisy sensor measurement and turning it into informative and representative inputs or features for data-driven modeling. However, few studies specifically discuss the feature engineering, especially the interactions between feature extraction and feature selection in whole-building AFDD. We developed a systematic feature extraction and selection framework for whole-building AFDD. In this framework, features are aggressively extracted from raw sensor data using statistical feature extraction techniques with various window sizes and statistics. With many features extracted, a hybrid feature selection algorithm that combines the filter and wrapper method then selects the best feature set. The framework considers diversity in the duration of fault behavior among fault types in whole-building AFDD, thus achieving high model generalization. We implemented our developed framework in a virtual testbed calibrated with measured data from Oak Ridge National Laboratory's Flexible Research Platform designed to mimic the operation of a typical small commercial building. The AFDD model is trained by the simulation data generated from the virtual testbed. The results show that (1) the developed framework improves the generalization of the AFDD model by 10.7% compared with literature-reported feature extraction and selection methods and (2) features with diverse window sizes and statistics are selected, providing insight into physical systems beyond the current understanding of buildings and faults and improving the detection and diagnostics of multiple fault types.

[1]  Wen Shen,et al.  ARX model based fault detection and diagnosis for chillers using support vector machines , 2014 .

[2]  Liang Zhang,et al.  Metrics and Methods to Assess Building Fault Detection and Diagnosis Tools , 2019 .

[3]  Michael A. Gerber,et al.  EnergyPlus Energy Simulation Software , 2014 .

[4]  James E. Braun,et al.  Representing Small Commercial Building Faults in EnergyPlus, Part I: Model Development , 2019, Buildings.

[5]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[6]  Paul Raftery,et al.  Review of automated fault detection and diagnostic tools in air handling units , 2014 .

[7]  John E. Seem,et al.  Using intelligent data analysis to detect abnormal energy consumption in buildings , 2007 .

[8]  Marko Robnik-Sikonja,et al.  Theoretical and Empirical Analysis of ReliefF and RReliefF , 2003, Machine Learning.

[9]  Hua Han,et al.  Study on a hybrid SVM model for chiller FDD applications , 2011 .

[10]  Woohyun Kim,et al.  A review of fault detection and diagnostics methods for building systems , 2018 .

[11]  Joshua Ryan New,et al.  Multiyear Plan for Validation of EnergyPlus Multi-Zone HVAC System Modeling using ORNL's Flexible Research Platform , 2016 .

[12]  James E. Braun,et al.  Representing Small Commercial Building Faults in EnergyPlus, Part II: Model Validation , 2019 .

[13]  Fu Xiao,et al.  Data mining in building automation system for improving building operational performance , 2014 .

[14]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[15]  Guoqiang Hu,et al.  Fault detection and diagnosis for building cooling system with a tree-structured learning method , 2016 .

[16]  Arno Schlueter,et al.  Automated daily pattern filtering of measured building performance data , 2015 .

[17]  Leslie K. Norford,et al.  Robust model-based fault diagnosis for air handling units , 2015 .

[18]  Tak-Chung Fu,et al.  A review on time series data mining , 2011, Eng. Appl. Artif. Intell..

[19]  Ting Wang,et al.  Important sensors for chiller fault detection and diagnosis (FDD) from the perspective of feature selection and machine learning , 2011 .

[20]  Yanfei Li,et al.  A critical review of fault modeling of HVAC systems in buildings , 2018, Building Simulation.

[21]  Jin Wen,et al.  A systematic feature selection procedure for short-term data-driven building energy forecasting model development , 2019, Energy and Buildings.

[22]  Joseph Robertson,et al.  Hybrid Model-Based and Data-Driven Fault Detection and Diagnostics for Commercial Buildings: Preprint , 2016 .

[23]  Krishna R. Pattipati,et al.  Data-Driven Modeling, Fault Diagnosis and Optimal Sensor Selection for HVAC Chillers , 2007, IEEE Transactions on Automation Science and Engineering.

[24]  James E. Braun,et al.  Common Faults and Their Prioritization in Small Commercial Buildings: February 2017 - December 2017 , 2018 .

[25]  Zhenjun Ma,et al.  A decision tree based data-driven diagnostic strategy for air handling units , 2016 .

[26]  Michael R. Brambley,et al.  Advanced Sensors and Controls for Building Applications: Market Assessment and Potential R&D Pathways , 2005 .

[27]  Jessica Granderson,et al.  Building fault detection and diagnostics: Achieved savings, and methods to evaluate algorithm performance , 2020, Building and Environment.

[28]  Srinivas Katipamula,et al.  Review Article: Methods for Fault Detection, Diagnostics, and Prognostics for Building Systems—A Review, Part I , 2005 .

[29]  Guoqiang Hu,et al.  Optimal Sensor Configuration and Feature Selection for AHU Fault Detection and Diagnosis , 2017, IEEE Transactions on Industrial Informatics.