Prefix-querying with an L1 distance metric for time-series subsequence matching under time warping

This paper discusses the way of processing time-series subsequence matching under time warping. Time warping enables sequences to be found with similar patterns even when they are of different lengths. Theprefix-querying method is the first index-based approach that efficiently performs time-series subsequence matching under time warping without false dismissals. This method employs theL distance metric as a base distance function so as to allow users to issue queries conveniently. In this paper, we extend the prefix-querying method for absorbingL 1, which is the most widely used as a base distance function in time-series subsequence matching under time warping, instead ofL. We formally prove that the prefix-querying method with theL 1 distance metric does not incur any false dismissals in the subsequence matching. To show its superiority, we conduct performance evaluation via a variety of experiments. The results reveal that our method achieves significant performance improvement over the previous methods, up to 10.7 times, with a data set containing real-world Korean stock data sequences, and up to 180 times with data sets containing a very large volume of synthetic data sequences.

[1]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[2]  Ada Wai-Chee Fu,et al.  Efficient time series matching by wavelets , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[3]  Christos Faloutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[4]  Graham A. Stephen String Searching Algorithms , 1994, Lecture Notes Series on Computing.

[5]  P. A. Blight The Analysis of Time Series: An Introduction , 1991 .

[6]  Wesley W. Chu,et al.  An index-based approach for similarity search supporting time warping in large sequence databases , 2001, Proceedings 17th International Conference on Data Engineering.

[7]  Dimitrios Gunopulos,et al.  Finding Similar Time Series , 1997, PKDD.

[8]  Alberto O. Mendelzon,et al.  Similarity-based queries for time series data , 1997, SIGMOD '97.

[9]  Wesley W. Chu,et al.  Efficient searches for similar subsequences of different lengths in sequence databases , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[10]  Sang-Wook Kim,et al.  Index interpolation: an approach to subsequence matching supporting normalization transform in time-series databases , 2000, CIKM '00.

[11]  Kyuseok Shim,et al.  Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases , 1995, VLDB.

[12]  Journal of Information Science , 1984 .

[13]  Eamonn Keogh Exact Indexing of Dynamic Time Warping , 2002, VLDB.

[14]  Sriram Padmanabhan,et al.  Prefix-querying: an approach for effective subsequence matching under time warping in sequence databases , 2001, CIKM '01.

[15]  Dina Q. Goldin,et al.  On Similarity Queries for Time-Series Data: Constraint Specification and Implementation , 1995, CP.

[16]  Christos Faloutsos,et al.  Fast Time Sequence Indexing for Arbitrary Lp Norms , 2000, VLDB.

[17]  Donald J. Berndt,et al.  Finding Patterns in Time Series: A Dynamic Programming Approach , 1996, Advances in Knowledge Discovery and Data Mining.

[18]  Sang-Wook Kim,et al.  Optimization of subsequence matching under time warping in time-series databases , 2005, SAC '05.

[19]  Man Hon Wong,et al.  Fast time-series searching with scaling and shifting , 1999, PODS '99.

[20]  Wesley W. Chu,et al.  Efficient processing of similarity search under time warping in sequence databases: an index-based approach , 2004, Inf. Syst..

[21]  Philip S. Yu,et al.  Data Mining: An Overview from a Database Perspective , 1996, IEEE Trans. Knowl. Data Eng..

[22]  W. K. Loh Index Interpolation: A Subsequence Matching Algroithm Supporting Moving Average Transforms of Arbitrary Order in Time-Series Databases , 2001 .

[23]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[24]  Christos Faloutsos,et al.  Efficient retrieval of similar time sequences under time warping , 1998, Proceedings 14th International Conference on Data Engineering.