Reliable solar irradiance prediction using ensemble learning-based models: A comparative study

Abstract Accurately predicting solar irradiance is important in designing and efficiently managing photovoltaic systems. This paper aims to provide a reliable short-term prediction of solar irradiance based on various meteorological factors using ensemble learning-based models that take into account the time-dependent nature of the solar irradiance data. The use of ensemble learning models is motivated by their desirable characteristics in combining several weak regressors to achieve an improved prediction quality relative to conventional single learners. Furthermore, they reduce the overall prediction error and have the ability to combine different models. In this paper, we first investigate the prediction performance of the well-known ensemble methods, Boosted Trees, Bagged Trees, Random Forest, and Generalized Random Forest in short-term prediction of solar irradiance. The performance of these ensemble methods has been compared to two commonly known prediction methods namely Gaussian process regression, and Support Vector Regression. Typical Meteorological Year data are used to verify the prediction performance of the considered models. Results showed that ensemble methods offer superior prediction performance compared to the individual regressors. Furthermore, the results showed that the ensemble models have a consistent and reliable prediction when applied to data from different locations. Lastly, variables contribution assessment showed that the lagged solar irradiance variables contribute significantly to the ensemble models, which help in designing more parsimonious models.

[1]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[2]  Chang-Yeol Yun,et al.  Probabilistic prediction of direct normal irradiance derived from global horizontal irradiance over the Korean Peninsula by using Monte-Carlo simulation , 2019, Solar Energy.

[3]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[4]  Ravinesh C. Deo,et al.  Wavelet-based 3-phase hybrid SVR model trained with satellite-derived predictors, particle swarm optimization and maximum overlap discrete wavelet transform for solar radiation prediction , 2019, Renewable and Sustainable Energy Reviews.

[5]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[6]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[7]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[8]  S. Athey,et al.  Generalized random forests , 2016, The Annals of Statistics.

[9]  Hsu-Yung Cheng,et al.  Multi-model solar irradiance prediction based on automatic cloud classification , 2015 .

[10]  Kazuyuki Aihara,et al.  Improving time series prediction of solar irradiance after sunrise: Comparison among three methods for time series prediction , 2017 .

[11]  F. Harrou,et al.  Wind power prediction using bootstrap aggregating trees approach to enabling sustainable wind power integration in a smart grid , 2019 .

[12]  Hadrien Verbois,et al.  Solar irradiance forecasting in the tropics using numerical weather prediction and statistical learning , 2018 .

[13]  Peter Buhlmann,et al.  BOOSTING ALGORITHMS: REGULARIZATION, PREDICTION AND MODEL FITTING , 2007, 0804.2752.

[14]  Kai Goebel,et al.  Degradation Modeling and Remaining Useful Life Prediction of Aircraft Engines Using Ensemble Learning , 2018, Journal of Engineering for Gas Turbines and Power.

[15]  Mario Paolone,et al.  Model-free computation of ultra-short-term prediction intervals of solar irradiance , 2016 .

[16]  Andreas Ziegler,et al.  ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R , 2015, 1508.04409.

[17]  Haoru Li,et al.  Machine learning models to quantify and map daily global solar radiation and photovoltaic power , 2020 .

[18]  S. E. Haupt,et al.  A model tree approach to forecasting solar irradiance variability , 2015 .

[19]  A. Regattieri,et al.  Artificial neural network optimisation for monthly average daily global solar radiation prediction , 2016 .

[20]  Philippe Lauret,et al.  Probabilistic forecasting of the solar irradiance with recursive ARMA and GARCH models , 2016 .

[21]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[22]  Mitsuru Kakimoto,et al.  Probabilistic Solar Irradiance Forecasting by Conditioning Joint Probability Method and Its Application to Electric Power Trading , 2019, IEEE Transactions on Sustainable Energy.

[23]  Lifeng Wu,et al.  Potential of kernel-based nonlinear extension of Arps decline model and gradient boosting with categorical features support for predicting daily global solar radiation in humid regions , 2019, Energy Conversion and Management.

[24]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[25]  J. Friedman Stochastic gradient boosting , 2002 .

[26]  Tohru Suwa,et al.  Sky image-based solar irradiance prediction methodologies using artificial neural networks , 2019, Renewable Energy.

[27]  Hsu-Yung Cheng,et al.  Predicting solar irradiance with all-sky image features via regression , 2013 .

[28]  Yugang Niu,et al.  Hourly day-ahead solar irradiance prediction using weather forecasts by LSTM , 2018 .

[29]  A. El Bouardi,et al.  On the prediction of the daily global solar radiation intensity on south-facing plane surfaces inclined at varying angles , 2016 .

[30]  Daisuke Nohara,et al.  Prediction of time series for several hours of surface solar irradiance using one-granule cloud property data from satellite observations , 2019, Solar Energy.

[31]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[32]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[33]  C. Sutton Classification and Regression Trees, Bagging, and Boosting , 2005 .

[34]  Dan Keun Sung,et al.  Hourly Solar Irradiance Prediction Based on Support Vector Machine and Its Error Analysis , 2017, IEEE Transactions on Power Systems.

[35]  Yu Feng,et al.  Evaluation of temperature-based machine learning and empirical models for predicting daily global solar radiation , 2019, Energy Conversion and Management.

[36]  Yoav Freund,et al.  Boosting the margin: A new explanation for the effectiveness of voting methods , 1997, ICML.

[37]  Shen Xu,et al.  Simplified vector-based model tailored for urban-scale prediction of solar irradiance , 2019, Solar Energy.

[38]  Leo Breiman,et al.  Prediction Games and Arcing Algorithms , 1999, Neural Computation.

[39]  Ningbo Cui,et al.  Development of data-driven models for prediction of daily global horizontal irradiance in Northwest China , 2019, Journal of Cleaner Production.

[40]  K. Kaba,et al.  Estimation of daily global solar radiation using deep learning model , 2018, Energy.

[41]  Kan-Jian Zhang,et al.  Wind power prediction with missing data using Gaussian process regression and multiple imputation , 2018, Appl. Soft Comput..

[42]  Ted Soubdhan,et al.  Nonparametric Bayesian-based recognition of solar irradiance conditions: Application to the generation of high temporal resolution synthetic solar irradiance data , 2019, Solar Energy.