A novel data-driven approach for residential electricity consumption prediction based on ensemble learning

Abstract With the development of smart grid as well as the electricity market, it is of increasing significance to predict the household electricity consumption. In this paper, a novel data-driven framework is proposed to predict the annual household electricity consumption using ensemble learning technique. The extreme gradient boosting forest and feedforward deep networks are served as base models. These base models are combined by ridge regression. What is more, the importances of input features are estimated. A subset of features is selected as the important features to feed into the model to increase its accuracy. A comparison of the proposed ensemble framework against classical regression models indicates that the former can reduce by 30 % of the prediction error. The results of this study show that ensemble learning method can be a convenient and accurate approach to predict household electricity consumption.

[1]  Pei-Chann Chang,et al.  Monthly electricity demand forecasting based on a weighted evolving fuzzy neural network approach , 2011 .

[2]  Shanjun Li,et al.  A New Look at Residential Electricity Demand Using Household Expenditure Data , 2011 .

[3]  Daniela M. Witten,et al.  An Introduction to Statistical Learning: with Applications in R , 2013 .

[4]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[5]  Evgeny A. Antipov,et al.  Mass Appraisal of Residential Apartments: An Application of Random Forest for Valuation and a CART-Based Approach for Model Diagnostics , 2010, Expert Syst. Appl..

[6]  Ning An,et al.  Using multi-output feedforward neural network with empirical mode decomposition based signal filtering for electricity demand forecasting , 2013 .

[7]  Cha Zhang,et al.  Ensemble Machine Learning: Methods and Applications , 2012 .

[8]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[9]  Peter C. Reiss,et al.  Household Electricity Demand, Revisited , 2005 .

[10]  Theologos Dergiades,et al.  Estimating Residential Demand for Electricity in the United States, 1965-2006 , 2008 .

[11]  Hsuan-Tien Lin,et al.  Learning From Data , 2012 .

[12]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[13]  Ertuğrul Çam,et al.  Forecasting electricity consumption: A comparison of regression analysis, neural networks and least squares support vector machines , 2015 .

[14]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[15]  F. Teng,et al.  Estimation of urban residential electricity demand in China using household survey data , 2013 .

[16]  L. Hunt,et al.  Modelling and forecasting Turkish residential electricity demand , 2011 .

[17]  Anna Alberini,et al.  Residential Consumption of Gas and Electricity in the U.S.: The Role of Prices and Income , 2011 .

[18]  Yang Yu,et al.  Diversity Regularized Ensemble Pruning , 2012, ECML/PKDD.

[19]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[20]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[21]  Athanasios Tsanas,et al.  Accurate quantitative estimation of energy performance of residential buildings using statistical machine learning tools , 2012 .

[22]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.