Performance Evaluation of Techniques for Identifying Abnormal Energy Consumption in Buildings

Energy consumption in buildings has steadily increased. Buildings consume more energy than necessary due to suboptimal design and operation. Apart from retro-fitting, not much can be done with the design of the existing building, but the operation of the building can be improved. Ignoring or failing to fix the faults can lead to problems like the higher cost in excess energy usage or premature component failure. At the same time understanding, identifying, and addressing abnormal energy consumption in buildings can lead to energy savings and detection of faulty appliances. This paper investigates two key challenges found in energy anomaly detection research: 1) the lack of labeled ground truth and 2) the lack of consistent performance accuracy metrics. In the first challenge, labeled ground truth is imperative for training and benchmarking algorithms to detect anomalies. In the second challenge, consistent performance accuracy metrics are crucial to quantifying how well algorithms perform against each other. There exists no publicly available energy consumption dataset with labeled anomaly events. Therefore, we propose two approaches that help in the automatic annotation of the ground truth data from publicly available datasets: a statistical approach for short-term data and a piecewise linear regression method for long-term data. We demonstrate these approaches using two publicly available datasets called Dataport (Pecan Street) and HUE. Using different existing accuracy metrics, we run a series of experiments on anomaly detection algorithms and discuss what metrics can be best used for consistent accuracy testing amongst researchers. In addition, while providing the source code, we also release an anomaly annotated dataset produced by this source code.

[1]  Sameer Singh,et al.  Novelty detection: a review - part 1: statistical approaches , 2003, Signal Process..

[2]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[3]  Alex Rogers,et al.  Dataport and NILMTK: A building data set designed for non-intrusive load monitoring , 2015, 2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[4]  Fiorella Lauro,et al.  Fault detection analysis using data mining techniques for a cluster of smart office buildings , 2015, Expert Syst. Appl..

[5]  Fu Xiao,et al.  Analytical investigation of autoencoder-based methods for unsupervised anomaly detection in building energy data , 2018 .

[6]  Priyanka Singh,et al.  Integration of new evolutionary approach with artificial neural network for solving short term load forecast problem , 2018 .

[7]  Victoria J. Hodge,et al.  A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.

[8]  M. Shyu,et al.  A Novel Anomaly Detection Scheme Based on Principal Component Classifier , 2003 .

[9]  Andreas Dengel,et al.  Histogram-based Outlier Score (HBOS): A fast Unsupervised Anomaly Detection Algorithm , 2012 .

[10]  David J. Hill,et al.  Anomaly detection in streaming environmental sensor data: A data-driven modeling approach , 2010, Environ. Model. Softw..

[11]  A. Kialashaki,et al.  Modeling of the energy demand of the residential sector in the United States using regression models and artificial neural networks , 2013 .

[12]  Matthew Brown,et al.  Kernel regression for real-time building energy analysis , 2012 .

[13]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[14]  Jian Tang,et al.  Capabilities of outlier detection schemes in large datasets, framework and methodologies , 2006, Knowledge and Information Systems.

[15]  Harshad Khadilkar,et al.  Multi-User Energy Consumption Monitoring and Anomaly Detection with Partial Context Information , 2015, BuildSys@SenSys.

[16]  Victor C. M. Leung,et al.  Electricity Theft Detection in AMI Using Customers’ Consumption Patterns , 2016, IEEE Transactions on Smart Grid.

[17]  Douglas M. Hawkins Identification of Outliers , 1980, Monographs on Applied Probability and Statistics.

[18]  M. J. Desforges,et al.  Applications of probability density estimation to the detection of abnormal conditions in engineering , 1998 .

[19]  V. Geros,et al.  Modeling and predicting building's energy use with artificial neural networks: Methods and results , 2006 .

[20]  Charu C. Aggarwal,et al.  Outlier ensembles: position paper , 2013, SKDD.

[21]  Tania Cerquitelli,et al.  Fault Detection Analysis of Building Energy Consumption Using Data Mining Techniques , 2013 .

[22]  Fu Xiao,et al.  AHU sensor fault diagnosis using principal component analysis method , 2004 .

[23]  R. Pincus Barnett, V., and Lewis T.: Outliers in Statistical Data. 3rd edition. J. Wiley & Sons 1994, XVII. 582 pp., £49.95 , 1995 .

[24]  Soteris A. Kalogirou,et al.  Applications of artificial neural-networks for energy systems , 2000 .

[25]  Hiroshi Esaki,et al.  Strip, Bind, and Search: A method for identifying abnormal energy consumption in buildings , 2013, 2013 ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN).

[26]  Jui-Sheng Chou,et al.  Real-time detection of anomalous power consumption , 2014 .

[27]  Brett J. Borghetti,et al.  A Review of Anomaly Detection in Automated Surveillance , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[28]  Rajesh K. Gupta,et al.  Data driven investigation of faults in HVAC systems with model, cluster and compare (MCC) , 2014, BuildSys@SenSys.

[29]  Thomas G. Dietterich,et al.  A study of distance-based machine learning algorithms , 1994 .

[30]  Weiwei Chen,et al.  Anomaly detection in premise energy consumption data , 2011, 2011 IEEE Power and Energy Society General Meeting.

[31]  William DuMouchel,et al.  A Fast Computer Intrusion Detection Algorithm Based on Hypothesis Testing of Command Transition Probabilities , 1998, KDD.

[32]  Stephen Makonin,et al.  HUE: The hourly usage of energy dataset for buildings in British Columbia , 2019, Data in brief.

[33]  Sridhar Ramaswamy,et al.  Efficient algorithms for mining outliers from large data sets , 2000, SIGMOD '00.

[34]  Manish Marwah,et al.  Towards an understanding of campus-scale power consumption , 2011, BuildSys '11.

[35]  T. Ferryman,et al.  Data outlier detection using the Chebyshev theorem , 2005, 2005 IEEE Aerospace Conference.

[36]  David A. Clifton,et al.  A review of novelty detection , 2014, Signal Process..

[37]  Carlos León,et al.  Rule-based system to detect energy efficiency anomalies in smart buildings, a data mining approach , 2016, Expert Syst. Appl..

[38]  B. Rosner Percentage Points for a Generalized ESD Many-Outlier Procedure , 1983 .

[39]  John E. Seem,et al.  Using intelligent data analysis to detect abnormal energy consumption in buildings , 2007 .

[40]  Tom Fawcett,et al.  Adaptive Fraud Detection , 1997, Data Mining and Knowledge Discovery.

[41]  Raymond T. Ng,et al.  Algorithms for Mining Distance-Based Outliers in Large Datasets , 1998, VLDB.