OBST-based segmentation approach to financial time series

Financial time series data are large in size and dynamic and non-linear in nature. Segmentation is often performed as a pre-processing step for locating technical patterns in financial time series. In this paper, we propose a segmentation method based on Turning Points (TPs). The proposed method selects TPs from the financial time series in question based on their degree of importance. A TP's degree of importance is calculated on the basis of its contribution to the preservation of the trends and shape of the time series. Algorithms are also devised to store the selected TPs in an Optimal Binary Search Tree (OBST) and to reconstruct the reduced sample time series. Comparison with existing approaches show that the time series reconstructed by the proposed method is able to maintain the shape of the original time series very well and preserve more trends. Our approach also ensures that the average retrieval cost is kept at a minimum.

[1]  R. R. Prechter,et al.  Elliott Wave Principle: Key to Market Behavior , 1978 .

[2]  Wesley W. Chu,et al.  Efficient searches for similar subsequences of different lengths in sequence databases , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[3]  Theodosios Pavlidis,et al.  Segmentation of Plane Curves , 1974, IEEE Transactions on Computers.

[4]  Eamonn J. Keogh Fast similarity search in the presence of longitudinal scaling in time series databases , 1997, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence.

[5]  Bernhard Sick,et al.  Online Segmentation of Time Series Based on Polynomial Least-Squares Approximations , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Paul S. Heckbert,et al.  Survey of Polygonal Surface Simplification Algorithms , 1997 .

[7]  Haixun Wang,et al.  Landmarks: a new model for similarity-based pattern querying in time series databases , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[8]  Neil D. Lawrence,et al.  Gaussian Process Latent Variable Models for Visualisation of High Dimensional Data , 2003, NIPS.

[9]  Huaiqing Wang,et al.  Pattern Recognition in Stock Data Based on a New Segmentation Algorithm , 2007, KSEM.

[10]  Man Hon Wong,et al.  Fast time-series searching with scaling and shifting , 1999, PODS '99.

[11]  Tak-Chung Fu,et al.  Evolutionary time series segmentation for stock data mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[12]  Kimon P. Valavanis,et al.  Surveying stock market forecasting techniques - Part II: Soft computing methods , 2009, Expert Syst. Appl..

[13]  David D. Jensen,et al.  Mining of Concurrent Text and Time Series , 2008 .

[14]  Eamonn Keogh Mining Time Series Data , 2005 .

[15]  Ken'ichi Kamijo,et al.  Stock price pattern recognition-a recurrent neural network approach , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[16]  Maja J. Mataric,et al.  A spatio-temporal extension to Isomap nonlinear dimension reduction , 2004, ICML.

[17]  David J. Fleet,et al.  Gaussian Process Dynamical Models , 2005, NIPS.

[18]  Tak-Chung Fu,et al.  Representing financial time series based on data point importance , 2008, Eng. Appl. Artif. Intell..

[19]  Donghui Zhang,et al.  Online event-driven subsequence matching over financial data streams , 2004, SIGMOD '04.

[20]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[21]  Pei-Chann Chang,et al.  Integrating a Piecewise Linear Representation Method and a Neural Network Model for Stock Trading Points Prediction , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[22]  Yung-Keun Kwon,et al.  A hybrid system integrating a piecewise linear representation and a neural network for stock prediction , 2011, Proceedings of 2011 6th International Forum on Strategic Technology.

[23]  Ambuj K. Singh,et al.  Variable length queries for time series data , 2001, Proceedings 17th International Conference on Data Engineering.

[24]  Eamonn J. Keogh,et al.  An online algorithm for segmenting time series , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[25]  William Hamilton,et al.  The Stock Market Barometer; a Study of its Forecast Value Based on Charles H. Dow's Theory of the Price Movement. With an Analysis of the Market and its History Since 1897 , 1989 .

[26]  Ronald L. Rivest,et al.  Introduction to Algorithms, third edition , 2009 .

[27]  Eamonn J. Keogh,et al.  Relevance feedback retrieval of time series data , 1999, SIGIR '99.

[28]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[29]  Tak-Chung Fu,et al.  Stock time series pattern matching: Template-based vs. rule-based approaches , 2007, Eng. Appl. Artif. Intell..

[30]  Tak-Chung Fu,et al.  A review on time series data mining , 2011, Eng. Appl. Artif. Intell..

[31]  Zhiguo Gong,et al.  Financial time series segmentation based on Turning Points , 2011, Proceedings 2011 International Conference on System Science and Engineering.

[32]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[33]  Eamonn J. Keogh,et al.  A Probabilistic Approach to Fast Pattern Matching in Time Series Databases , 1997, KDD.

[34]  Huaiqing Wang,et al.  A New Segmentation Algorithm to Stock Time Series Based on PIP Approach , 2007, 2007 International Conference on Wireless Communications, Networking and Mobile Computing.

[35]  Jean-Christophe Nebel,et al.  Temporal Extension of Laplacian Eigenmaps for Unsupervised Dimensionality Reduction of Time Series , 2010, 2010 20th International Conference on Pattern Recognition.

[36]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[37]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[38]  Ambuj K. Singh,et al.  Dimensionality Reduction for Similarity Searching in Dynamic Databases , 1999, Comput. Vis. Image Underst..

[39]  Joaquin Quiñonero Candela,et al.  Local distance preservation in the GP-LVM through back constraints , 2006, ICML.

[40]  Rui Zhang,et al.  A real time hybrid pattern matching scheme for stock time series , 2010, ADC.

[41]  Zehong Yang,et al.  Intelligent stock trading system by turning point confirming and probabilistic reasoning , 2008, Expert Syst. Appl..

[42]  Tak-chung Fu,et al.  Flexible time series pattern matching based on perceptually important points , 2001 .

[43]  Ada Wai-Chee Fu,et al.  Efficient time series matching by wavelets , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[44]  Eamonn J. Keogh,et al.  A Simple Dimensionality Reduction Technique for Fast Similarity Search in Large Time Series Databases , 2000, PAKDD.

[45]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.