Smart baseline models for solar irradiation forecasting

Abstract This work presents a kind of smart baseline models for solar irradiation forecasting, as models are only fed with meteorological records and solar-computed values, easy-to-obtain inputs that facilitate their implementation worldwide. Global horizontal irradiation (GHI) is predicted for horizons of 1 h in a site of Southeast Spain. Two types of approaches are undertaken: fixed models, trained just once with a global database, and moving models, where the training database is updated based on the features of the testing sample. The approaches are implemented with two machine learning algorithms, support vector regression (SVR) and random forest (RFs), along with the classic linear regression and kNN. Besides, genetic algorithms (GAs) are used to automate the training process of fixed models, a task traditionally performed based on the experience or the researcher. Significant improvements were obtained over the basic persistence methods with both approaches. In the case of moving models, results proved that the best approach to update the calibration set was by computing the Euclidean distance in the principal components space. Results of both approaches were comparable in terms of MAE and forecast skill ( s ), though slightly superior predictions were obtained with the moving SVR, with a forecast skill ranging from 8% to 23% and a testing MAE ranging from 49 to 64  W / m 2 for the different states of cloudiness. Anyway, both approaches are valid baselines to compare new forecasting models fed with more difficult-to-obtain features, supplementing the classic but naive persistence models.

[1]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[2]  Badia Amrouche,et al.  Artificial neural network based daily local forecasting for global solar radiation , 2014 .

[3]  L. Wald,et al.  On the clear sky model of the ESRA — European Solar Radiation Atlas — with respect to the heliosat method , 2000 .

[4]  Dazhi Yang,et al.  Satellite image analysis and a hybrid ESSS/ANN model to forecast solar irradiance in the tropics , 2014 .

[5]  A. Massi Pavan,et al.  A hybrid model (SARIMA-SVM) for short-term power forecasting of a small-scale grid-connected photovoltaic plant , 2013 .

[6]  Jianming Ye On Measuring and Correcting the Effects of Data Mining and Model Selection , 1998 .

[7]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[8]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[9]  A. Zeileis,et al.  zoo: S3 Infrastructure for Regular and Irregular Time Series , 2005, math/0505527.

[10]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[11]  Guang Yang,et al.  Solar irradiance feature extraction and support vector machines based weather status pattern recognition model for short-term photovoltaic power forecasting , 2015 .

[12]  M. Diagne,et al.  Review of solar irradiance forecasting methods and a proposition for small-scale insular grids , 2013 .

[13]  Francisco Javier Martinez-de-Pison,et al.  Downscaling of global solar irradiation in complex areas in R , 2014 .

[14]  Carlos F.M. Coimbra,et al.  Nearest-neighbor methodology for prediction of intra-hour global horizontal and direct normal irradiances , 2015 .

[15]  Bangyin Liu,et al.  Online 24-h solar power forecasting based on weather type classification using artificial neural network , 2011 .

[16]  C. K. Chan,et al.  Prediction of hourly solar radiation using a novel hybrid model of ARMA and TDNN , 2011 .

[17]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[18]  A. Mellit,et al.  A 24-h forecast of solar irradiance using artificial neural network: Application for performance prediction of a grid-connected PV plant at Trieste, Italy , 2010 .

[19]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[20]  Giovanni Seni,et al.  Ensemble Methods in Data Mining: Improving Accuracy Through Combining Predictions , 2010, Ensemble Methods in Data Mining.

[21]  Carlos F.M. Coimbra,et al.  Real-time forecasting of solar irradiance ramps with smart image processing , 2015 .

[22]  S. H. Cao,et al.  Study of forecasting solar irradiance using neural networks with preprocessing sample data by wavelet analysis , 2006 .

[23]  Richard C. Zink,et al.  NParCov3: A SAS/IML Macro for Nonparametric Randomization-Based Analysis of Covariance , 2012 .

[24]  Max Kuhn,et al.  Applied Predictive Modeling , 2013 .

[25]  O. Perpiñán,et al.  PV power forecast using a nonparametric PV model , 2015 .

[26]  Manfred Georg Kratzenberg,et al.  Identification and Handling of Critical Irradiance Forecast Errors Using a Random Forest Scheme – A Case Study for Southern Brazil , 2015 .

[27]  Athanasios Sfetsos,et al.  Univariate and multivariate forecasting of hourly solar radiation with artificial intelligence techniques , 2000 .

[28]  Oscar Perpiñán Lamigueiro solaR: Solar Radiation and Photovoltaic Systems with R , 2012 .

[29]  Daniel Rowe,et al.  Short-term irradiance forecasting using skycams: Motivation and development , 2014 .

[30]  Yan Su,et al.  An ARMAX model for forecasting the power output of a grid connected photovoltaic system , 2014 .

[31]  D. Boldo,et al.  Very short term forecasting of the Global Horizontal Irradiance using a spatio-temporal autoregressive model , 2014 .

[32]  F. J. Martinez-de-Pison,et al.  Generation of daily global solar irradiation with support vector machines for regression , 2015 .

[33]  Pierre Ineichen,et al.  Conversion function between the Linke turbidity and the atmospheric water vapor and aerosol content , 2008 .

[34]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[35]  R. Inman,et al.  Solar forecasting methods for renewable energy integration , 2013 .

[36]  Kurt Hornik,et al.  Misc Functions of the Department of Statistics (e1071), TU Wien , 2014 .

[37]  Hsu-Yung Cheng,et al.  Bi-model short-term solar irradiance prediction using support vector regressors , 2014 .

[38]  R. Urraca,et al.  Estimation of solar global irradiation in remote areas , 2015 .

[39]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[40]  Viorel Badescu,et al.  Tailored vs black-box models for forecasting hourly average solar irradiance , 2015 .

[41]  Gilles Notton,et al.  Estimation of 5-min time-step data of tilted solar global irradiation using ANN (Artificial Neural Network) model , 2014 .

[42]  R. Urraca,et al.  Solar irradiation mapping with exogenous data from support vector regression machines estimations , 2015 .

[43]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[44]  Olivier Pannekoucke,et al.  A benchmark of statistical regression methods for short-term forecasting of photovoltaic electricity production, part I: Deterministic forecast of hourly production , 2014 .

[45]  Sancho Salcedo-Sanz,et al.  Daily global solar radiation prediction based on a hybrid Coral Reefs Optimization – Extreme Learning Machine approach , 2014 .

[46]  Carlos F.M. Coimbra,et al.  Real-time prediction intervals for intra-hour DNI forecasts , 2015 .

[47]  V. Vapnik Pattern recognition using generalized portrait method , 1963 .