Long-term time series prediction using wrappers for variable selection and clustering for data partition

In an attempt to implement long-term time series prediction based on the recursive application of a one-step-ahead multilayer neural network predictor, we have considered the eleven short time series provided by the organizers of the Special Session NN3 Neural Network Forecasting Competition, and have proposed a joint application of a variable selection technique and a clustering procedure. The purpose was to define unbiased partition subsets and predictors with high generalization capability, based on a wrapper methodology. The proposed approach overcomes the performance of the predictor that considers all the lags in the regression vector. After obtaining the eleven long-term predictors, we conclude the paper presenting the eighteen multi-step predictions for each time series, as requested in the competition.

[1]  F. Fleuret Fast Binary Feature Selection with Conditional Mutual Information , 2004, J. Mach. Learn. Res..

[2]  C.A.M. Lima,et al.  Mixture of heterogeneous experts applied to time series: a comparative study , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[3]  P. Rapp,et al.  Statistical validation of mutual information calculations: comparison of alternative numerical algorithms. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Lyle H. Ungar,et al.  Neural network forecasting of short, noisy time series , 1992 .

[5]  Robert A. Lordo,et al.  Learning from Data: Concepts, Theory, and Methods , 2001, Technometrics.

[6]  Fernando José Von Zuben,et al.  Data partition and variable selection for time series prediction using wrappers , 2006, IJCNN.

[7]  Robert L. Winkler,et al.  The accuracy of extrapolation (time series) methods: Results of a forecasting competition , 1982 .

[8]  Ramesh Sharda,et al.  Connectionist approach to time series prediction: an empirical test , 1992, J. Intell. Manuf..

[9]  Patrick Gallinari,et al.  FEATURE SELECTION WITH NEURAL NETWORKS , 1999 .

[10]  Brian D. Ripley,et al.  Statistical aspects of neural networks , 1993 .

[11]  Kagan Tumer,et al.  Theoretical Foundations Of Linear And Order Statistics Combiners For Neural Pattern Classifiers , 1995 .

[12]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[13]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[14]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[15]  John C. Brown,et al.  Delayed time series predictions with neural networks , 1998, Neurocomputing.

[16]  Andreas S. Weigend,et al.  Nonparametric selection of input variables for connectionist learning , 1996 .

[17]  Lars Kai Hansen,et al.  Neural Network Ensembles , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[19]  Bruce W. Schmeiser,et al.  Optimal linear combinations of neural networks: an overview , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[20]  Magali E. Azema-Barac,et al.  Currency exchange rate forecasting by error backpropagation , 1992, Proceedings of the Twenty-Fifth Hawaii International Conference on System Sciences.

[21]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[22]  George E. P. Box,et al.  Time Series Analysis: Forecasting and Control , 1977 .

[23]  Paul A. Fishwick,et al.  Time series forecasting using neural networks vs. Box- Jenkins methodology , 1991, Simul..

[24]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[25]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[26]  William Remus,et al.  Time series forecasting using neural networks: should the data be deseasonalized first? , 1999 .

[27]  Gang Wang,et al.  Feature selection with conditional mutual information maximin in text categorization , 2004, CIKM '04.

[28]  Spyros Makridakis,et al.  Forecasting accuracy and system complexity , 1995 .

[29]  Berkman Sahiner,et al.  Dual system approach to computer-aided detection of breast masses on mammograms. , 2006, Medical physics.