论文信息 - Performance bottleneck in time-series subsequence matching

Performance bottleneck in time-series subsequence matching

This paper addresses a performance bottleneck in time-series subsequence matching. First, we analyze the disk access and CPU processing times required during the index searching and post-processing steps of subsequence matching through preliminary experiments. Based on their results, we show that the post-processing step is a main performance bottleneck in subsequence matching. In order to resolve the performance bottleneck, we propose a simple yet quite effective method that processes the post-processing step. By rearranging the order of candidate subsequences to be compared with a query sequence, our method completely eliminates the redundancies of disk accesses and CPU processing occurring in the post-processing step. We show that our method is optimal and also does not incur any false dismissal. Also, we justify the effectiveness of our method by extensive experiments.

Sang-Wook Kim | Byeong-Soo Jeong

[1] Hans-Peter Kriegel,et al. The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[2] Sang-Wook Kim,et al. Index interpolation: an approach to subsequence matching supporting normalization transform in time-series databases , 2000, CIKM '00.

[3] Christos Faloutsos,et al. Efficient Similarity Search In Sequence Databases , 1993, FODO.

[4] Eamonn J. Keogh,et al. Exact indexing of dynamic time warping , 2002, Knowledge and Information Systems.

[5] Davood Rafiei,et al. On similarity-based queries for time series data , 1997, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[6] Man Hon Wong,et al. Fast time-series searching with scaling and shifting , 1999, PODS '99.

[7] Hans-Jörg Schek,et al. A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces , 1998, VLDB.

[8] Clu-istos Foutsos,et al. Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[9] Yang-Sae Moon,et al. Duality-based subsequence matching in time-series databases , 2001, Proceedings 17th International Conference on Data Engineering.

[10] Wesley W. Chu,et al. An index-based approach for similarity search supporting time warping in large sequence databases , 2001, Proceedings 17th International Conference on Data Engineering.

[11] Kyuseok Shim,et al. Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases , 1995, VLDB.

[12] Christos Faloutsos,et al. Fast Time Sequence Indexing for Arbitrary Lp Norms , 2000, VLDB.

[13] Sriram Padmanabhan,et al. Prefix-querying: an approach for effective subsequence matching under time warping in sequence databases , 2001, CIKM '01.