Additive stacking for disaggregate electricity demand forecasting

Future grid management systems will coordinate distributed production and storage resources to manage, in a cost effective fashion, the increased load and variability brought by the electrification of transportation and by a higher share of weather dependent production. Electricity demand forecasts at a low level of aggregation will be key inputs for such systems. We focus on forecasting demand at the individual household level, which is more challenging than forecasting aggregate demand, due to the lower signal-to-noise ratio and to the heterogeneity of consumption patterns across households. We propose a new ensemble method for probabilistic forecasting, which borrows strength across the households while accommodating their individual idiosyncrasies. In particular, we develop a set of models or 'experts' which capture different demand dynamics and we fit each of them to the data from each household. Then we construct an aggregation of experts where the ensemble weights are estimated on the whole data set, the main innovation being that we let the weights vary with the covariates by adopting an additive model structure. In particular, the proposed aggregation method is an extension of regression stacking (Breiman, 1996) where the mixture weights are modelled using linear combinations of parametric, smooth or random effects. The methods for building and fitting additive stacking models are implemented by the gamFactory R package, available at this https URL.

[1]  Yannig Goude,et al.  Scalable visualisation methods for modern Generalized Additive Models , 2018 .

[2]  Joseph Sill,et al.  Feature-Weighted Linear Stacking , 2009, ArXiv.

[3]  R. Rigby,et al.  Generalized additive models for location, scale and shape , 2005 .

[4]  S. Wood Generalized Additive Models: An Introduction with R , 2006 .

[5]  Marco Henrique de Almeida Inácio,et al.  The NN-Stacking: Feature weighted linear stacking through neural networks , 2019, Neurocomputing.

[6]  T. Gneiting,et al.  Combining probability forecasts , 2010 .

[7]  R. Tibshirani,et al.  Combining Estimates in Regression and Classification , 1996 .

[8]  Marko Aunedi,et al.  Smart control for minimizing distribution network reinforcement cost due to electrification , 2013 .

[9]  S. Wood,et al.  Smoothing Parameter and Model Selection for General Smooth Models , 2015, 1511.03864.

[10]  T. Gneiting,et al.  Combining Predictive Distributions , 2011, 1106.1638.

[11]  Tim Cockerill,et al.  Time-of-use and time-of-export tariffs for home batteries: Effects on low voltage distribution networks , 2018, Journal of Energy Storage.

[12]  Aki Vehtari,et al.  Using Stacking to Average Bayesian Predictive Distributions (with Discussion) , 2017, Bayesian Analysis.

[13]  R. Tibshirani,et al.  Generalized Additive Models , 1986 .

[14]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[15]  Mike West,et al.  Dynamic Bayesian predictive synthesis in time series forecasting , 2016, Journal of Econometrics.

[16]  Peter Grünwald,et al.  Using Stacking to Average Bayesian Predictive Distributions (with Discussion) , 2018 .

[17]  Leo Breiman,et al.  Stacked regressions , 2004, Machine Learning.

[18]  Douglas Nychka,et al.  Bayesian Confidence Intervals for Smoothing Splines , 1988 .

[19]  Enrique Moral-Benito,et al.  Model Averaging in Economics: An Overview , 2015 .

[20]  S. Sheather Density Estimation , 2004 .

[21]  M. Stone Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[22]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[23]  Daniel W. Apley,et al.  Visualizing the effects of predictor variables in black box supervised learning models , 2016, Journal of the Royal Statistical Society: Series B (Statistical Methodology).

[24]  R. Koenker,et al.  Regression Quantiles , 2007 .

[25]  Roberto Casarin,et al.  Bayesian Nonparametric Calibration and Combination of Predictive Distributions , 2015, 1502.07246.

[26]  L. Breiman Stacked Regressions , 1996, Machine Learning.

[27]  Gábor Lugosi,et al.  Prediction, learning, and games , 2006 .

[28]  Gilles Stoltz,et al.  Forecasting electricity consumption by aggregating specialized experts , 2012, Machine Learning.

[29]  Alan Y. Chiang,et al.  Generalized Additive Models: An Introduction With R , 2007, Technometrics.

[30]  Yannig Goude,et al.  Scalable Visualization Methods for Modern Generalized Additive Models , 2018, Journal of Computational and Graphical Statistics.