Optimal sub-models selection algorithm for combination forecasting model

Abstract It has been widely demonstrated in forecasting that combining forecasts can improve the forecast performance compared to individual forecasts. However, how to select the optimal sub-models from all the available models is a difficult problem in combination forecasting model. Consider that the redundancy information among the selected sub-models will reduce the performance of the combination forecasting model, it is advocated to select a sub-model only if it contributes to the redundancy removing mutual information between the outputs of the selected sub-models and the actual outputs. As linear combination method is promising and popular in the field of combination forecast, a novel Max-Linear-Relevance and Min-Linear-Redundancy based selection algorithm is proposed in this paper. The proposed selection algorithm provides a theoretical approach for the optimal sub-models selection, and tries to compute the redundancy removing linear mutual information between the outputs of the selected sub-models and the actual outputs. Three monthly time series from DataMarket are used as illustrative examples to evaluate the forecasting. As a result of the implementation, it is seen that the proposed combination forecasting model produces better forecasts than those produced by other models.

[1]  Çagdas Hakan Aladag,et al.  Forecast Combination by Using Artificial Neural Networks , 2010, Neural Processing Letters.

[2]  Chih-Chou Chiu,et al.  Integration of nonlinear independent component analysis and support vector regression for stock price forecasting , 2013, Neurocomputing.

[3]  Chao Ren,et al.  Optimal parameters selection for BP neural network based on particle swarm optimization: A case study of wind speed forecasting , 2014, Knowl. Based Syst..

[4]  Ralph D. Snyder,et al.  Forecasting intraday time series with multiple seasonal cycles using parsimonious seasonal exponential smoothing , 2009 .

[5]  George E. P. Box,et al.  Time Series Analysis: Forecasting and Control , 1977 .

[6]  J. M. Bates,et al.  The Combination of Forecasts , 1969 .

[7]  Claude E. Shannon,et al.  The Mathematical Theory of Communication , 1950 .

[8]  Ruey-Chyn Tsaur,et al.  A new piecewise fuzzy exponential smoothing model based on some change-points , 2011, Expert Syst. Appl..

[9]  C. Holt Author's retrospective on ‘Forecasting seasonals and trends by exponentially weighted moving averages’ , 2004 .

[10]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[11]  Marina Theodosiou,et al.  Disaggregation & aggregation of time series components: A hybrid forecasting approach using generalized regression neural networks and the theta method , 2011, Neurocomputing.

[12]  Chih-Chou Chiu,et al.  Financial time series forecasting using independent component analysis and support vector regression , 2009, Decis. Support Syst..

[13]  Haiyan Lu,et al.  Combined modeling for electric load forecasting with adaptive particle swarm optimization , 2010 .

[14]  Hongnian Yu,et al.  A combination selection algorithm on forecasting , 2014, Eur. J. Oper. Res..

[15]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[16]  R. K. Agrawal,et al.  Performance evaluation of weights selection schemes for linear combination of multiple forecasts , 2012, Artificial Intelligence Review.

[17]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[18]  Chee Keong Kwoh,et al.  A Feature Subset Selection Method Based On High-Dimensional Mutual Information , 2011, Entropy.

[19]  Dipti Srinivasan,et al.  Evolving artificial neural networks for short term load forecasting , 1998, Neurocomputing.

[20]  Jacek M. Zurada,et al.  Normalized Mutual Information Feature Selection , 2009, IEEE Transactions on Neural Networks.

[21]  J. Shao,et al.  APPLICATION OF AN ARTIFICIAL NEURAL NETWORK TO IMPROVE SHORT-TERM ROAD ICE FORECASTS , 1998 .

[22]  F. Diebold,et al.  Forecast Evaluation and Combination , 1996 .

[23]  Wei-Chiang Hong,et al.  Traffic flow forecasting by seasonal SVR with chaotic simulated annealing algorithm , 2011, Neurocomputing.

[24]  Young-Koo Lee,et al.  An Improved Maximum Relevance and Minimum Redundancy Feature Selection Algorithm Based on Normalized Mutual Information , 2010, 2010 10th IEEE/IPSJ International Symposium on Applications and the Internet.

[25]  Guoqiang Peter Zhang,et al.  Time series forecasting using a hybrid ARIMA and neural network model , 2003, Neurocomputing.

[26]  JinXing Che Support vector regression based on optimal training subset and adaptive particle swarm optimization algorithm , 2013, Appl. Soft Comput..

[27]  Minvydas Ragulskis,et al.  Short-term time series algebraic forecasting with internal smoothing , 2014, Neurocomputing.

[28]  Philip Hans Franses,et al.  Model selection for forecast combination , 2008 .

[29]  Turgut Kisinbay The use of encompassing tests for forecast combinations , 2010 .

[30]  Chong-Ho Choi,et al.  Input feature selection for classification problems , 2002, IEEE Trans. Neural Networks.

[31]  Haiyan Lu,et al.  A case study on a hybrid wind speed forecasting method using BP neural network , 2011, Knowl. Based Syst..

[32]  Chao Wu,et al.  Forecasting stock indices using radial basis function neural networks optimized by artificial fish swarm algorithm , 2011, Knowl. Based Syst..

[33]  Lee-Ing Tong,et al.  Forecasting time series using a methodology based on autoregressive integrated moving average and genetic programming , 2011, Knowl. Based Syst..

[34]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Araceli Sanchis,et al.  Time series forecasting using a weighted cross-validation evolutionary artificial neural network ensemble , 2013, Neurocomputing.