Development of prediction models for next-day building energy consumption and peak power demand using data mining techniques

This paper presents a data mining (DM) based approach to developing ensemble models for predicting next-day energy consumption and peak power demand, with the aim of improving the prediction accuracy. This approach mainly consists of three steps. Firstly, outlier detection, which merges feature extraction, clustering analysis, and the generalized extreme studentized deviate (GESD), is performed to remove the abnormal daily energy consumption profiles. Secondly, the recursive feature elimination (RFE), an embedded variable selection method, is applied to select the optimal inputs to the base prediction models developed separately using eight popular predictive algorithms. The parameters of each model are then obtained through leave-group-out cross validation (LGOCV). Finally, the ensemble model is developed and the weights of the eight predictive models are optimized using genetic algorithm (GA).

[1]  Roger N. Anderson,et al.  Forecasting Energy Demand in Large Commercial Buildings Using Support Vector Machine Regression , 2011 .

[2]  V. Ismet Ugursal,et al.  Comparison of neural network, conditional demand analysis, and engineering approaches for modeling end-use energy consumption in the residential sector , 2008 .

[3]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[4]  E. Gonzalez-Romera,et al.  Monthly Electric Energy Demand Forecasting Based on Trend Extraction , 2006, IEEE Transactions on Power Systems.

[5]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[6]  Xiaoli Li,et al.  This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. 1 Classification of Energy Consumption in Buildings with Outlier Detection , 2022 .

[7]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[8]  Silja Meyer-Nieberg,et al.  Electric load forecasting methods: Tools for decision making , 2009, Eur. J. Oper. Res..

[9]  Michael K. Ng,et al.  An Entropy Weighting k-Means Algorithm for Subspace Clustering of High-Dimensional Sparse Data , 2007, IEEE Transactions on Knowledge and Data Engineering.

[10]  Teri A. Crosby,et al.  How to Detect and Handle Outliers , 1993 .

[11]  Zhenjun Ma,et al.  Online supervisory and optimal control of complex building central chilling systems , 2008 .

[12]  Rahmat-Allah Hooshmand,et al.  A new hybrid day-ahead peak load forecasting method for Iran’s National Grid , 2013 .

[13]  B. Rosner Percentage Points for a Generalized ESD Many-Outlier Procedure , 1983 .

[14]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[15]  Sedat Akkurt,et al.  Artificial neural networks applications in building energy predictions and a case study for tropical climates , 2005 .

[16]  R. E. Abdel-Aal,et al.  Modeling and forecasting electric daily peak loads using abductive networks , 2006 .

[17]  K. Steemers,et al.  A method of formulating energy load profile for domestic buildings in the UK , 2005 .

[18]  David Hsu Identifying key variables and interactions in statistical models of building energy consumption using regularization , 2015 .

[19]  Lei Wang,et al.  Study on Power Energy Consumption Model for Large-Scale Public Building , 2010, 2010 2nd International Workshop on Intelligent Systems and Applications.

[20]  B. Dong,et al.  Applying support vector machines to predict building energy consumption in tropical region , 2005 .

[21]  J. Friedman Multivariate adaptive regression splines , 1990 .

[22]  J. A. White,et al.  Simplified method for predicting building energy consumption using average monthly temperatures , 1996, IECEC 96. Proceedings of the 31st Intersociety Energy Conversion Engineering Conference.

[23]  John E. Seem,et al.  Using intelligent data analysis to detect abnormal energy consumption in buildings , 2007 .

[24]  Robert Černý,et al.  A fast computational approach for the determination of thermal properties of hollow bricks in energy-related calculations , 2015 .

[25]  J Elith,et al.  A working guide to boosted regression trees. , 2008, The Journal of animal ecology.

[26]  Vic Barnett,et al.  Outliers in Statistical Data , 1980 .

[27]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[28]  Frédéric Magoulès,et al.  A review on the prediction of building energy consumption , 2012 .

[29]  Xinhua Xu,et al.  A grey‐box model of next‐day building thermal load prediction for energy‐efficient control , 2008 .

[30]  Cameron W. Potter,et al.  Building a smarter smart grid through better renewable energy information , 2009, 2009 IEEE/PES Power Systems Conference and Exposition.

[31]  Frédéric Magoulès,et al.  Development of an RDP neural network for building energy consumption fault detection and diagnosis , 2013 .