Application of the BORUTA Algorithm to Input Data Selection for a Model Based on Rough Set Theory (RST) to Prediction Energy Consumption for Building Heating

Energy prediction used for building heating has attracted particular attention because it is often required in the development of various strategies to improve the energy efficiency of buildings, especially those undergoing thermal improvements. The complexity, dynamics, uncertainty, and nonlinearity of existing building energy systems create a great need for modeling techniques. One of them is machine learning models, which are based on input data consisting of features that describe the objects under study. The data describing actual buildings used to build the model may be characterized by missing values, duplicate or inconsistent features, noise, and outliers. Therefore, an extremely important aspect of the prediction model development effort is the proper selection of features to simplify the prediction of energy consumption for heating. In this connection, the goal was to evaluate the usefulness of a model describing the final energy demand rate for building heating using groups of features describing actual residential buildings undergoing thermal retrofit. The model was created by combining two algorithms: the BORUTA feature selection algorithm, which prepares conditional variables corresponding to features for a prediction model based on rough set theory (RST). The research was conducted on a group of 109 multi-family buildings from the end of the last century (made in large-panel technology), thermomodernized at the beginning of the 21st century. Evaluation metrics such as MAPE, MBE, CV RMSE, and R2, which are adopted as statistical calibration standards by ASHRAE, were used to assess the quality of the developed prediction model. The analysis of the obtained results indicated that the model based on RST, based on the features selected by the BORUTA algorithm, gives a satisfactory prediction quality with a limited number of input variables, and thus allows to predict energy consumption (after thermal improvement) for this type of buildings with high accuracy.

[1]  Betul Bektas Ekici,et al.  Prediction of building energy needs in early stage of design by using ANFIS , 2011, Expert Syst. Appl..

[2]  Jin Wen,et al.  Review of building energy modeling for control and operation , 2014 .

[3]  Mohsen Mohammadi,et al.  Small-Scale Building Load Forecast based on Hybrid Forecast Engine , 2017, Neural Processing Letters.

[4]  Tanveer Ahmad,et al.  Short and medium-term forecasting of cooling and heating load demand in building environment with data-mining based approaches , 2018 .

[5]  Sachin Kumar,et al.  A novel method based on extreme learning machine to predict heating and cooling load through design and structural attributes , 2018, Energy and Buildings.

[6]  Luis Romeral,et al.  Activity-aware HVAC power demand forecasting , 2018, Energy and Buildings.

[7]  Jin Wen,et al.  A systematic feature selection procedure for short-term data-driven building energy forecasting model development , 2019, Energy and Buildings.

[8]  Madeleine Gibescu,et al.  Unsupervised energy prediction in a Smart Grid context using reinforcement cross-building transfer learning , 2016 .

[9]  Jian Chu,et al.  Forecasting building energy consumption using neural networks and hybrid neuro-fuzzy system: A compa , 2011 .

[10]  Stanisław Kokoszka,et al.  Application of Rough Set Theory (RST) to Forecast Energy Consumption in Buildings Undergoing Thermal Modernization , 2020, Energies.

[11]  Àngela Nebot,et al.  Hybrid methodologies for electricity load forecasting: Entropy-based feature selection with machine learning and soft computing techniques , 2015 .

[12]  Sylvain Robert,et al.  State of the art in building modelling and energy performances prediction: A review , 2013 .

[13]  Frédéric Magoulès,et al.  Feature Selection for Predicting Building Energy Consumption Based on Statistical Learning Method , 2012 .

[14]  G. Sudheer,et al.  Short term load forecasting using wavelet transform combined with Holt–Winters and weighted nearest neighbor models , 2015 .

[15]  Radiša Jovanović,et al.  Ensemble of various neural networks for prediction of heating energy consumption , 2015 .

[16]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[17]  Yacine Rezgui,et al.  Trees vs Neurons: Comparison between random forest and ANN for high-resolution prediction of building energy consumption , 2017 .

[18]  Koichi Yamada,et al.  Extended Tolerance Relation to Define a New Rough Set Model in Incomplete Information Systems , 2013, Adv. Fuzzy Syst..

[19]  Xiaofeng Guo,et al.  Modeling and forecasting building energy consumption: A review of data-driven techniques , 2019, Sustainable Cities and Society.

[20]  V. Ismet Ugursal,et al.  Comparison of neural network, conditional demand analysis, and engineering approaches for modeling end-use energy consumption in the residential sector , 2008 .

[21]  Jianjun Wang,et al.  An annual load forecasting model based on support vector regression with differential evolution algorithm , 2012 .

[22]  Pedro J. Mago,et al.  Building hourly thermal load prediction using an indexed ARX model , 2012 .

[23]  Andrew Kusiak,et al.  A data-driven approach for steam load prediction in buildings , 2010 .

[24]  Madeleine Gibescu,et al.  Deep learning for estimating building energy consumption , 2016 .

[25]  Nelson Fumo,et al.  A review on the basics of building energy estimation , 2014 .

[26]  Germán Ramos Ruiz,et al.  Validation of calibrated energy models: Common errors , 2017 .

[27]  Frédéric Magoulès,et al.  A review on the prediction of building energy consumption , 2012 .

[28]  Hwataik Han,et al.  Simplified dynamic neural network model to predict heating load of a building using Taguchi method , 2016 .

[29]  Joaquim Melendez,et al.  Short-term load forecasting in a non-residential building contrasting models and attributes , 2015 .

[30]  T. Mathia,et al.  Neural Methods Comparison for Prediction of Heating Energy Based on Few Hundreds Enhanced Buildings in Four Season’s Climate , 2020, Energies.

[31]  Witold R. Rudnicki,et al.  Feature Selection with the Boruta Package , 2010 .

[32]  Hongzhan Nie,et al.  Hybrid of ARIMA and SVMs for Short-Term Load Forecasting , 2012 .

[33]  Fu Xiao,et al.  A short-term building cooling load prediction method using deep learning algorithms , 2017 .

[34]  Zoltán Nagy,et al.  Using machine learning techniques for occupancy-prediction-based cooling control in office buildings , 2018 .

[35]  Ali Azadeh,et al.  An integrated fuzzy regression algorithm for energy consumption estimation with non-stationary data: A case study of Iran , 2010 .

[36]  Danielle Monfet,et al.  Development of an energy prediction tool for commercial buildings using case-based reasoning , 2014 .

[37]  Hyojoo Son,et al.  Forecasting Short-term Electricity Demand in Residential Sector Based on Support Vector Regression and Fuzzy-rough Feature Selection with Particle Swarm Optimization , 2015 .

[38]  Jin Wen,et al.  A review of machine learning in building load prediction , 2021 .

[39]  Gregor P. Henze,et al.  Statistical Analysis of Neural Networks as Applied to Building Energy Prediction , 2004 .

[40]  Shengwei Wang,et al.  Development of prediction models for next-day building energy consumption and peak power demand using data mining techniques , 2014 .

[41]  Dug Hun Hong,et al.  Short-term load forecasting for the holidays using fuzzy linear regression method , 2005 .

[42]  B. Dong,et al.  Applying support vector machines to predict building energy consumption in tropical region , 2005 .

[43]  Vladimir Ceperic,et al.  A Strategy for Short-Term Load Forecasting by Support Vector Regression Machines , 2013, IEEE Transactions on Power Systems.

[44]  T. Szul,et al.  Application of Rough Set Theory to Establish the Amount of Waste in Households in Rural Areas , 2017 .

[45]  Zhiwei Lian,et al.  Hourly cooling load prediction by a combined forecasting model based on Analytic Hierarchy Process , 2004 .

[46]  Sung-Kwan Joo,et al.  Holiday Load Forecasting Using Fuzzy Polynomial Regression With Weather Feature Selection and Adjustment , 2012, IEEE Transactions on Power Systems.

[47]  Junzo Watada,et al.  Building an integrated hybrid model for short-term and mid-term load forecasting with genetic optimization , 2012 .

[48]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[49]  A. Arabali,et al.  A hybrid short-term load forecasting with a new input selection framework , 2015 .

[50]  Jui-Sheng Chou,et al.  Modeling heating and cooling loads by artificial intelligence for energy-efficient building design , 2014 .

[51]  Frédéric Magoulès,et al.  Parallel Support Vector Machines Applied to the Prediction of Multiple Buildings Energy Consumption , 2010 .

[52]  Kevin M. Smith,et al.  Forecasting energy consumption of multi-family residential buildings using support vector regression: Investigating the impact of temporal and spatial monitoring granularity on performance accuracy , 2014 .

[53]  Hongye Su,et al.  Forecasting building energy consumption with hybrid genetic algorithm-hierarchical adaptive network-based fuzzy inference system , 2010 .

[54]  Krishnan Gowri,et al.  The ASHRAE Great Energy Predictor III competition: Overview and results , 2020, ArXiv.

[55]  Benjamin C. M. Fung,et al.  A decision tree method for building energy demand modeling , 2010 .

[56]  Matthew Brown,et al.  Kernel regression for real-time building energy analysis , 2012 .

[57]  S. Ranji Ranjithan,et al.  Multivariate regression as an energy assessment tool in early building design , 2012 .