Integrating piecewise linear representation and weighted support vector machine for stock trading signal prediction

Piecewise linear representation (PLR) and back-propagation artificial neural network (BPN) have been integrated for the stock trading signal prediction recently (PLR-BPN). However, there are some disadvantages in avoiding over-fitting, trapping in local minimum and choosing the threshold of the trading decision. Since support vector machine (SVM) has a good way to avoid over-fitting and trapping in local minimum, we integrate PLR and weighted SVM (WSVM) to forecast the stock trading signals (PLR-WSVM). The new characteristics of PLR-WSVM are as follows: (1) the turning points obtained from PLR are set by different weights according to the change rate of the closing price between the current turning point and the next one, in which the weight reflects the relative importance of each turning point; (2) the prediction of stock trading signal is formulated as a weighted four-class classification problem, in which it does not need to determine the threshold of trading decision; (3) WSVM is used to model the relationship between the trading signal and the input variables, which improves the generalization performance of prediction model; (4) the history dataset is divided into some overlapping training-testing sets rather than training-validation-testing, which not only makes use of data fully but also reduces the time variability of data; and (5) some new technical indicators representing investors' sentiment are added to the input variables, which improves the prediction performance. The comparative experiments among PLR-WSVM, PLR-BPN and buy-and-hold strategy (BHS) on 20 shares from Shanghai Stock Exchange in China show that the prediction accuracy and profitability of PLR-WSVM are all the best, which indicates PLR-WSVM is effective and can be used in the stock trading signal prediction.

[1]  Chih-Jen Lin,et al.  Working Set Selection Using Second Order Information for Training Support Vector Machines , 2005, J. Mach. Learn. Res..

[2]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[3]  Eamonn J. Keogh,et al.  An online algorithm for segmenting time series , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[4]  Amir F. Atiya,et al.  Introduction to financial forecasting , 1996, Applied Intelligence.

[5]  S. Sathiya Keerthi,et al.  Improvements to Platt's SMO Algorithm for SVM Classifier Design , 2001, Neural Computation.

[6]  Kyuseok Shim,et al.  Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases , 1995, VLDB.

[7]  Jason Weston,et al.  Support vector machines for multi-class pattern recognition , 1999, ESANN.

[8]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[9]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[10]  Haixun Wang,et al.  Landmarks: a new model for similarity-based pattern querying in time series databases , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[11]  Pei-Chann Chang,et al.  Integrating a Piecewise Linear Representation Method and a Neural Network Model for Stock Trading Points Prediction , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[12]  Pei-Chann Chang,et al.  A dynamic threshold decision system for stock trading signal detection , 2011, Appl. Soft Comput..

[13]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[14]  Donghui Zhang,et al.  Online event-driven subsequence matching over financial data streams , 2004, SIGMOD '04.

[15]  J. Platt Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[16]  Eamonn J. Keogh,et al.  Segmenting Time Series: A Survey and Novel Approach , 2002 .

[17]  Xu Wenhua,et al.  Training neural network with genetic algorithms for forecasting the stock price index , 1997, 1997 IEEE International Conference on Intelligent Processing Systems (Cat. No.97TH8335).

[18]  E. Fama,et al.  Filter Rules and Stock-Market Trading , 1966 .

[19]  Dongsong Zhang,et al.  Discovering golden nuggets: data mining in financial application , 2004, IEEE Trans. Syst. Man Cybern. Part C.

[20]  Heikki Mannila,et al.  Rule Discovery from Time Series , 1998, KDD.

[21]  Perry J. Kaufman,et al.  Trading Systems and Methods , 1997 .

[22]  Sheng-De Wang,et al.  Fuzzy support vector machines , 2002, IEEE Trans. Neural Networks.

[23]  Alexander J. Smola,et al.  Learning with Kernels: support vector machines, regularization, optimization, and beyond , 2001, Adaptive computation and machine learning series.

[24]  M. Ready,et al.  Profits from Technical Trading Rules , 1998 .

[25]  Byung Ro Moon,et al.  A Hybrid Neurogenetic Approach for Stock Forecasting , 2007, IEEE Transactions on Neural Networks.

[26]  Francis Eng Hock Tay,et al.  Support vector machine with adaptive parameters in financial time series forecasting , 2003, IEEE Trans. Neural Networks.

[27]  Hong Peng,et al.  Improving the Computational Efficiency of Recursive Cluster Elimination for Gene Selection , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[28]  Franklin Allen,et al.  Using genetic algorithms to find technical trading rules , 1999 .

[29]  Chunhua Zhang,et al.  The new interpretation of support vector machines on statistical learning theory , 2010 .

[30]  Ada Wai-Chee Fu,et al.  Efficient time series matching by wavelets , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).