Integration of nonlinear independent component analysis and support vector regression for stock price forecasting

Forecasting stock prices is a major activity of financial firms and private investors. In developing a stock price forecasting model, the first step is usually feature extraction. Nonlinear independent component analysis (NLICA), a novel feature extraction technique that assumes the observed mixtures are non-linear combinations of latent source signals, is used to find independent sources when observed data are mixtures of unknown sources, and prior knowledge of the mixing mechanisms is not available. In this paper, a stock price forecasting model which first uses NLICA as preprocessing to extract features from forecasting variables is developed. Then the features, called independent components (ICs), serve as the inputs of support vector regression (SVR) to build the forecasting model. The advantage of the proposed methodology is that the information hidden in the original data can be discovered by feature extraction. Therefore, NLICA can provide more valuable information for financial forecasting. Two datasets of major Asian stock markets-China and Japan, Shanghai Stock Exchange Composite (SSEC) and Nikkei 225 stock indexes, are used as illustrative examples. For comparison, the integration of traditional principal component analysis (PCA) with SVR (called PCA-SVR), linear ICA with SVR (called LICA-SVR) and single SVR approaches were applied to evaluate the prediction accuracy of the proposed approach. Empirical results show that the proposed method (NLICA-SVR) not only improves the prediction accuracy of the SVR approach but also outperforms the PCA-SVR, LICA-SVR and single SVR methods.

[1]  Chi-Jie Lu,et al.  Combining independent component analysis and growing hierarchical self-organizing maps with support vector regression in product demand forecasting , 2010 .

[2]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[3]  Q. Henry Wu,et al.  Electric Load Forecasting Based on Locally Weighted Support Vector Regression , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[4]  Andrew D. Back,et al.  A First Application of Independent Component Analysis to Extracting Structure from Stock Returns , 1997, Int. J. Neural Syst..

[5]  Guoqing Wang,et al.  Estimation of source spectra profiles and simultaneous determination of polycomponent in mixtures from ultraviolet spectra data using kernel independent component analysis and support vector regression. , 2007, Analytica chimica acta.

[6]  Francis Eng Hock Tay,et al.  Financial Forecasting Using Support Vector Machines , 2001, Neural Computing & Applications.

[7]  Aapo Hyvärinen,et al.  Nonlinear independent component analysis: Existence and uniqueness results , 1999, Neural Networks.

[8]  L. B. Almeida,et al.  Linear and nonlinear ICA based on mutual information , 2000, Proceedings of the IEEE 2000 Adaptive Systems for Signal Processing, Communications, and Control Symposium (Cat. No.00EX373).

[9]  Jürgen Schmidhuber,et al.  Feature Extraction Through LOCOCODE , 1999, Neural Computation.

[10]  Hujun Yin,et al.  Image denoising using self-organizing map-based nonlinear independent component analysis , 2002, Neural Networks.

[11]  E. Oja,et al.  Independent Component Analysis , 2013 .

[12]  Juha Karhunen,et al.  Advances in blind source separation (BSS) and independent component analysis (ICA) for nonlinear mixtures , 2004, Int. J. Neural Syst..

[13]  Guoqiang Peter Zhang,et al.  An investigation of neural networks for linear time-series forecasting , 2001, Comput. Oper. Res..

[14]  E. Oja,et al.  Independent component analysis for financial time series , 2000, Proceedings of the IEEE 2000 Adaptive Systems for Signal Processing, Communications, and Control Symposium (Cat. No.00EX373).

[15]  J. Amini OPTIMUM LEARNING RATE IN BACK-PROPAGATION N EURAL NETWORK FOR CLASSIFICATION OF SATELLITE IMAGES (IRS-ID) , 2008 .

[16]  Kimon P. Valavanis,et al.  Surveying stock market forecasting techniques - Part II: Soft computing methods , 2009, Expert Syst. Appl..

[17]  William Leigh,et al.  Forecasting the New York stock exchange composite index with past price and interest rate on condition of volume spike , 2005, Expert Syst. Appl..

[18]  Mary E. Thomson,et al.  Performance evaluation of judgemental directional exchange rate predictions B , 2005 .

[19]  Cheng-Hua Wang,et al.  Support vector regression with genetic algorithms in forecasting tourism demand , 2007 .

[20]  Ming-Ming Lai,et al.  An Examination of the Random Walk Model and Technical Trading Rules in the Malaysian Stock Market , 2002 .

[21]  Chih-Chou Chiu,et al.  Forecasting stock price using Nonlinear independent component analysis and support vector regression , 2009, 2009 IEEE International Conference on Industrial Engineering and Engineering Management.

[22]  C.G. Puntonet,et al.  Time series prediction using ICA algorithms , 2003, Second IEEE International Workshop on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications, 2003. Proceedings.

[23]  Feifeng Zheng,et al.  Hybrid evolutionary algorithms in a SVR traffic flow forecasting model , 2011, Appl. Math. Comput..

[24]  P. Y. Mok,et al.  An ICA design of intraday stock prediction models with automatic variable selection , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[25]  ChangKyoo Yoo,et al.  Statistical process monitoring with independent component analysis , 2004 .

[26]  Francis Eng Hock Tay,et al.  Support vector machine with adaptive parameters in financial time series forecasting , 2003, IEEE Trans. Neural Networks.

[27]  Feifeng Zheng,et al.  Forecasting urban traffic flow by SVR with continuous ACO , 2011 .

[28]  Chih-Chou Chiu,et al.  Neural network forecasting of an opening cash price index , 2002, Int. J. Syst. Sci..

[29]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[30]  Tian-Shyug Lee,et al.  Investigating the information content of non-cash-trading index futures using neural networks , 2002, Expert Syst. Appl..

[31]  Lai-Wan Chan,et al.  Nonlinear independent component analysis with minimal nonlinear distortion , 2007, ICML '07.

[32]  Chih-Chou Chiu,et al.  Financial time series forecasting using independent component analysis and support vector regression , 2009, Decis. Support Syst..

[33]  Kyoung-jae Kim,et al.  Financial time series forecasting using support vector machines , 2003, Neurocomputing.

[34]  Lijuan Cao,et al.  Support vector machines experts for time series forecasting , 2003, Neurocomputing.

[35]  Ying Chen,et al.  Improving option price forecasts with neural networks and support vector regressions , 2009, Neurocomputing.

[36]  Yunqian Ma,et al.  Practical selection of SVM parameters and noise estimation for SVM regression , 2004, Neural Networks.

[37]  Juha Karhunen,et al.  Nonlinear Independent Component Analysis Using Ensemble Learning: Experiments and Discussion , 2000 .

[38]  F. Diebold,et al.  Comparing Predictive Accuracy , 1994, Business Cycles.

[39]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[40]  L. J. Cao,et al.  Feature extraction in support vector machine: a comparison of PCA, XPCA and ICA , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[41]  Chih-Chou Chiu,et al.  Integrating Nonlinear Independent Component Analysis and Neural Network in Stock Price Prediction , 2009, IEA/AIE.

[42]  Luís B. Almeida,et al.  MISEP -- Linear and Nonlinear ICA Based on Mutual Information , 2003, J. Mach. Learn. Res..

[43]  Wei-Chiang Hong,et al.  Traffic flow forecasting by seasonal SVR with chaotic simulated annealing algorithm , 2011, Neurocomputing.

[44]  Chih-Chou Chiu,et al.  Predicting Stock Index Using an Integrated Model of NLICA, SVR and PSO , 2011, ISNN.

[45]  Michael Y. Hu,et al.  Forecasting with artificial neural networks: The state of the art , 1997 .

[46]  Marian Stewart Bartlett,et al.  Face recognition by independent component analysis , 2002, IEEE Trans. Neural Networks.

[47]  Alfredo Vellido,et al.  Neural networks in business: a survey of applications (1992–1998) , 1999 .

[48]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[49]  F. Tay,et al.  Application of support vector machines in financial time series forecasting , 2001 .