Perceptual Indexing of Multivariate Time Series

We consider the problem of deriving compressed perceptual representation of multivariate time series and using it for efficient indexing and similarity search. Our algorithm is based on the identification of perceptual skeletons in multidimensional space and the use of these "simplifications" in similarity measurements. We illustrate the performance of the algorithm in a financial modeling application. Our results indicate that the skeleton representation outperforms the traditional approaches and is robust enough to be used even with the simplest distance metrics.

[1]  Alberto O. Mendelzon,et al.  Similarity-based queries for time series data , 1997, SIGMOD '97.

[2]  Haixun Wang,et al.  Landmarks: a new model for similarity-based pattern querying in time series databases , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[3]  Robert H. Shumway,et al.  Discrimination and Clustering for Multivariate Time Series , 1998 .

[4]  Fl Chung,et al.  Financial time series indexing based on low resolution clustering , 2004 .

[5]  David H. Douglas,et al.  ALGORITHMS FOR THE REDUCTION OF THE NUMBER OF POINTS REQUIRED TO REPRESENT A DIGITIZED LINE OR ITS CARICATURE , 1973 .

[6]  Eugene Fink,et al.  Search for Patterns in Compressed Time Series , 2002, Int. J. Image Graph..

[7]  Clu-istos Foutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[8]  Jaideep Srivastava,et al.  Event detection from time series data , 1999, KDD '99.

[9]  Hernando Ombao,et al.  Discrimination and Classification of Nonstationary Time Series Using the SLEX Model , 2004 .

[10]  Kyuseok Shim,et al.  Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases , 1995, VLDB.

[11]  Christos Faloutsos,et al.  Efficient retrieval of similar time sequences under time warping , 1998, Proceedings 14th International Conference on Data Engineering.

[12]  Fei Wu,et al.  Knowledge discovery in time-series databases , 2001 .

[13]  Konstantinos Kalpakis,et al.  Distance measures for effective clustering of ARIMA time-series , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[14]  Ada Wai-Chee Fu,et al.  Efficient time series matching by wavelets , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[15]  Piotr Indyk,et al.  Mining the stock market (extended abstract): which measure is best? , 2000, KDD '00.

[16]  Dragomir Anguelov,et al.  Mining The Stock Market : Which Measure Is Best ? , 2000 .

[17]  Robert H. Shumway,et al.  Discrimination and Clustering for Multivariate Time Series , 1998 .

[18]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.