A hybrid SOFM-SVR with a filter-based feature selection for stock market forecasting

Stock market price index prediction is regarded as a challenging task of the financial time series prediction process. Support vector regression (SVR) has successfully solved prediction problems in many domains, including the stock market. This paper hybridizes SVR with the self-organizing feature map (SOFM) technique and a filter-based feature selection to reduce the cost of training time and to improve prediction accuracies. The hybrid system conducts the following processes: filter-based feature selection to choose important input attributes; SOFM algorithm to cluster the training samples; and SVR to predict the stock market price index. The proposed model was demonstrated using a real future dataset - Taiwan index futures (FITX) to predict the next day's price index. The experiment results show that the proposed SOFM-SVR is an improvement over the traditional single SVR in average prediction accuracy and training time.

[1]  Kyoung-jae Kim,et al.  Financial time series forecasting using support vector machines , 2003, Neurocomputing.

[2]  Bernhard Schölkopf,et al.  Statistical Learning and Kernel Methods , 2001, Data Fusion and Perception.

[3]  Michael T. Manry,et al.  Feature Selection Using a Piecewise Linear Network , 2006, IEEE Transactions on Neural Networks.

[4]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[5]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[6]  Lijuan Cao,et al.  Support vector machines experts for time series forecasting , 2003, Neurocomputing.

[7]  Teuvo Kohonen,et al.  Self-organization and associative memory: 3rd edition , 1989 .

[8]  Manuel Martín-Merino,et al.  A New SOM Algorithm for Electricity Load Forecasting , 2006, ICONIP.

[9]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[10]  Jie Wang,et al.  A High Precision Prediction Method by Using Combination of ELMAN and SOM Neural Networks , 2004, ISNN.

[11]  George M. Giaglis,et al.  A hybrid approach for improving predictive accuracy of collaborative filtering algorithms , 2007, User Modeling and User-Adapted Interaction.

[12]  Jovina Roman,et al.  Backpropagation and recurrent neural networks in financial analysis of multiple stock market returns , 1996, Proceedings of HICSS-29: 29th Hawaii International Conference on System Sciences.

[13]  Kyung-shik Shin,et al.  A hybrid approach based on neural networks and genetic algorithms for detecting temporal patterns in stock markets , 2007, Appl. Soft Comput..

[14]  Lutgarde M. C. Buydens,et al.  Using support vector machines for time series prediction , 2003 .

[15]  Xing Jin,et al.  Ensemble SVR for prediction of time series , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[16]  Ingoo Han,et al.  Hybrid genetic algorithms and support vector machines for bankruptcy prediction , 2006, Expert Syst. Appl..

[17]  Anthony Brabazon,et al.  A hybrid genetic model for the prediction of corporate failure , 2004, Comput. Manag. Sci..

[18]  Shitong Wang,et al.  A Fully Automated Pattern Classification Method of Combining Self-Organizing Map with Generalization Regression Neural Network , 2006, ICONIP.

[19]  Ping-Feng Pai,et al.  A hybrid ARIMA and support vector machines model in stock price forecasting , 2005 .

[20]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[21]  Dustin Boswell,et al.  Introduction to Support Vector Machines , 2002 .

[22]  Ingoo Han,et al.  Genetic algorithms approach to feature discretization in artificial neural networks for the prediction of stock price index , 2000 .

[23]  Soushan Wu,et al.  Credit rating analysis with support vector machines and neural networks: a market comparative study , 2004, Decis. Support Syst..

[24]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[25]  Ray Tsaih,et al.  Forecasting S&P 500 stock index futures with a hybrid AI system , 1998, Decis. Support Syst..

[26]  Shouyang Wang,et al.  Forecasting stock market movement direction with support vector machine , 2005, Comput. Oper. Res..

[27]  Wu Wei,et al.  A hybrid SOM-SVM method for analyzing zebra fish gene expression , 2004, ICPR 2004.

[28]  Alexander J. Smola,et al.  Support Vector Method for Function Approximation, Regression Estimation and Signal Processing , 1996, NIPS.

[29]  Francis Eng Hock Tay,et al.  Improved financial time series forecasting by combining Support Vector Machines with self-organizing feature map , 2001, Intell. Data Anal..

[30]  S. Achelis Technical analysis a to z , 1994 .

[31]  W. J. Conover,et al.  Practical Nonparametric Statistics , 1972 .

[32]  Amir F. Atiya,et al.  Introduction to financial forecasting , 1996, Applied Intelligence.

[33]  Wu Meng,et al.  Application of Support Vector Machines in Financial Time Series Forecasting , 2007 .

[34]  Mu-Chen Chen,et al.  Credit scoring with a data mining approach based on support vector machines , 2007, Expert Syst. Appl..

[35]  Jen-Ying Shih,et al.  A study of Taiwan's issuer credit rating systems using support vector machines , 2006, Expert Syst. Appl..

[36]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[37]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[38]  Myungsook Klassen,et al.  Investigation of Some Technical Indexes in Stock Forecasting Using Neural Networks , 2007, WEC.

[39]  Hsinchun Chen,et al.  Document clustering for electronic meetings: an experimental comparison of two techniques , 1999, Decis. Support Syst..

[40]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[41]  F. Tay,et al.  Application of support vector machines in financial time series forecasting , 2001 .

[42]  Lorenzo Mussone,et al.  A Study of Hybrid Neural Network Approaches and the Effects of Missing Data on Traffic Forecasting , 2001, Neural Computing & Applications.

[43]  Tong-Seng Quah,et al.  Improving returns on stock investment through neural network selection , 1999 .

[44]  Tamás D. Gedeon,et al.  Rainfall prediction model using soft computing technique , 2003, Soft Comput..

[45]  Bernhard Schölkopf,et al.  Extracting Support Data for a Given Task , 1995, KDD.

[46]  Hiroshi Motoda,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998, The Springer International Series in Engineering and Computer Science.

[47]  Young-Chan Lee,et al.  Application of support vector machines to corporate credit rating prediction , 2007, Expert Syst. Appl..

[48]  Kyoung-jae Kim,et al.  Stock market prediction using artificial neural networks with optimal feature transformation , 2004, Neural Computing & Applications.

[49]  Wu Wei,et al.  A hybrid SOM-SVM method for analyzing zebra fish gene expression , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[50]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[51]  R. A. Groeneveld,et al.  Practical Nonparametric Statistics (2nd ed). , 1981 .

[52]  Francis Eng Hock Tay,et al.  Support vector machine with adaptive parameters in financial time series forecasting , 2003, IEEE Trans. Neural Networks.

[53]  Fi-John Chang,et al.  Adaptive neuro-fuzzy inference system for prediction of water level in reservoir , 2006 .

[54]  Steven Salzberg,et al.  On Comparing Classifiers: Pitfalls to Avoid and a Recommended Approach , 1997, Data Mining and Knowledge Discovery.

[55]  Olli Simula,et al.  Combining linear equalization and self-organizing adaptation in dynamic discrete-signal detection , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[56]  Pei-Chann Chang,et al.  Combining SOM and fuzzy rule base for flow time prediction in semiconductor manufacturing factory , 2006, Appl. Soft Comput..

[57]  Kazuo Asakawa,et al.  Stock market prediction system with modular neural networks , 1990, 1990 IJCNN International Joint Conference on Neural Networks.