Finding Motifs of Financial Data Streams in Real Time

Finding motifs of financial data streams in real time is a very interesting and valuable work. We hope to find the motif existing in financial data streams on local trend subsequence. A stock market trader might use such a tool to spot arbitrage opportunities or escape the underlying venture. The paper introduces a novel distance measurement, that is SDD (Slope Duration Distance), for local subsequences. At the same time, we propose an efficient algorithm of motif discovery over a great deal of financial data streams, that is PMDGS (P-Motif Discovery based on Grid Structure), which make use of PLA (Piecewise Linear Approximation) technology and grid structure. Extensive experiments on synthetic data and real world financial trading data show that our model provides several orders of magnitude performance improvement relative to traditional naive linear scan techniques.

[1]  Eamonn J. Keogh,et al.  Exact indexing of dynamic time warping , 2002, Knowledge and Information Systems.

[2]  Eamonn J. Keogh,et al.  Detecting time series motifs under uniform scaling , 2007, KDD '07.

[3]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[4]  David B. Lomet,et al.  Foundations of Data Organization and Algorithms , 1993, Lecture Notes in Computer Science.

[5]  Eamonn J. Keogh,et al.  Mining motifs in massive time series databases , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[6]  Eamonn J. Keogh,et al.  Finding Motifs in a Database of Shapes , 2007, SDM.

[7]  Eamonn J. Keogh,et al.  Probabilistic discovery of time series motifs , 2003, KDD '03.

[8]  Donald J. Berndt,et al.  Finding Patterns in Time Series: A Dynamic Programming Approach , 1996, Advances in Knowledge Discovery and Data Mining.

[9]  Yunhao Liu,et al.  Indexable PLA for Efficient Similarity Search , 2007, VLDB.