Change Detection in Time Series Data Using Wavelet Footprints

In this paper, we propose a novel approach to address the problem of change detection in time series data. Our approach is based on wavelet footprints proposed originally by the signal processing community for signal compression. We, however, exploit the properties of footprints to capture discontinuities in a signal. We show that transforming data using footprints generates nonzero coefficients only at the change points. Exploiting this property, we propose a change detection query processing scheme which employs footprint-transformed data to identify change points, their amplitudes, and degrees of change efficiently and accurately. Our analytical and empirical results show that our approach outperforms the best known change detection approach in terms of both performance and accuracy. Furthermore, unlike the state of the art approaches, our query response time is independent of the number of change points and the user-defined change threshold.

[1]  Kenji Yamanishi,et al.  A unifying framework for detecting outliers and change points from non-stationary time series data , 2002, KDD.

[2]  Cyrus Shahabi,et al.  ProPolyne: A Fast Wavelet-Based Algorithm for Progressive Evaluation of Polynomial Range-Sum Queries , 2002, EDBT.

[3]  R. C. Earlougher Advances in Well Test Analysis , 1977 .

[4]  Cyrus Shahabi,et al.  ProDA: a suite of web-services for progressive data analysis , 2005, SIGMOD '05.

[5]  Dimitris Sacharidis,et al.  SHIFT-SPLIT: I/O efficient maintenance of wavelet-transformed multidimensional data , 2005, SIGMOD '05.

[6]  Padhraic Smyth,et al.  Segmental semi-markov models and applications to sequence analysis , 2002 .

[7]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[8]  Kyuseok Shim,et al.  Approximate query processing using wavelets , 2001, The VLDB Journal.

[9]  Eamonn J. Keogh,et al.  An online algorithm for segmenting time series , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[10]  Stéphane Mallat,et al.  Singularity detection and processing with wavelets , 1992, IEEE Trans. Inf. Theory.

[11]  Jaideep Srivastava,et al.  Event detection from time series data , 1999, KDD '99.

[12]  Paul R. Cohen,et al.  Segmenting time series with a hybrid neural networks - hidden Markov model , 2002, AAAI/IAAI.

[13]  Alan V. Oppenheim,et al.  Digital Signal Processing , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[14]  Henri Cohen,et al.  A course in computational algebraic number theory , 1993, Graduate texts in mathematics.

[15]  Konstantinos Kalpakis,et al.  Adaptive Methods for Activity Monitoring of Streaming Data , 2002, ICMLA.

[16]  Eamonn J. Keogh,et al.  Clustering of streaming time series is meaningless , 2003, DMKD '03.

[17]  Matthias Jarke,et al.  Advances in Database Technology — EDBT 2002 , 2002, Lecture Notes in Computer Science.

[18]  Eamonn J. Keogh,et al.  Locally adaptive dimensionality reduction for indexing large time series databases , 2001, SIGMOD '01.

[19]  Martin Vetterli,et al.  Wavelet transform footprints: catching singularities for compression and denoising , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[20]  Martin Vetterli,et al.  Deconvolution with wavelet footprints for ill-posed inverse problems , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[21]  Martin Vetterli,et al.  Wavelet footprints: theory, algorithms, and applications , 2003, IEEE Trans. Signal Process..

[22]  Jan Chomicki,et al.  Hippo: A System for Computing Consistent Answers to a Class of SQL Queries , 2004, EDBT.

[23]  Dimitrios Gunopulos,et al.  Iterative Incremental Clustering of Time Series , 2004, EDBT.

[24]  Jeffrey Scott Vitter,et al.  Wavelet-based histograms for selectivity estimation , 1998, SIGMOD '98.

[25]  Yixin Chen,et al.  Multi-Dimensional Regression Analysis of Time-Series Data Streams , 2002, VLDB.

[26]  Eamonn J. Keogh,et al.  A Probabilistic Approach to Fast Pattern Matching in Time Series Databases , 1997, KDD.