Effective Pattern Similarity Match for Multidimensional Sequence Data Sets

In this paper we present an effective pattern similarity match algorithm for multidimensional sequence data sets such as video streams and various analog or digital signals. To approximate a sequence of data points we introduce a trend vectorthat captures the moving trend of the sequence. Using the trend vector, our method is designed to filter out irrelevant sequences from a database and to find similar sequences with respect to a query. Experimental results show that it provides a lower reconstruction error and a higher precision rate compared to existing methods.

[1]  Theodosios Pavlidis,et al.  Waveform Segmentation Through Functional Approximation , 1973, IEEE Transactions on Computers.

[2]  Eamonn J. Keogh,et al.  Locally adaptive dimensionality reduction for indexing large time series databases , 2001, SIGMOD '01.

[3]  Christos Faloutsos,et al.  A signature technique for similarity-based queries , 1997, Proceedings. Compression and Complexity of SEQUENCES 1997 (Cat. No.97TB100171).

[4]  Deok-Hwan Kim,et al.  Similarity search for multidimensional data sequences , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[5]  Changzhou Wang,et al.  Supporting content-based searches on time series via approximation , 2000, Proceedings. 12th International Conference on Scientific and Statistica Database Management.

[6]  Eamonn J. Keogh,et al.  A Probabilistic Approach to Fast Pattern Matching in Time Series Databases , 1997, KDD.

[7]  Christos Faloutsos,et al.  Fast Time Sequence Indexing for Arbitrary Lp Norms , 2000, VLDB.

[8]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.

[9]  Chin-Wan Chung,et al.  Hyper-rectangle based segmentation and clustering of large video data sets , 2002, Inf. Sci..