Optimal multi-scale patterns in time series streams

We introduce a method to discover optimal local patterns, which concisely describe the main trends in a time series. Our approach examines the time series at multiple time scales (i.e., window sizes) and efficiently discovers the key patterns in each. We also introduce a criterion to select the best window sizes, which most concisely capture the key oscillatory as well as aperiodic trends. Our key insight lies in learning an optimal orthonormal transform from the data itself, as opposed to using a predetermined basis or approximating function (such as piecewise constant, short-window Fourier or wavelets), which essentially restricts us to a particular family of trends. We go one step further, lifting even that limitation. Furthermore, our method lends itself to fast, incremental estimation in a streaming setting. Experimental evaluation shows that our method can capture meaningful patterns in a variety of settings. Our streaming approach requires order of magnitude less time and space, while still producing concise and informative patterns.

[1]  Clu-istos Foutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[2]  Christos Faloutsos,et al.  Adaptive, unsupervised stream mining , 2004, The VLDB Journal.

[3]  Dimitrios Gunopulos,et al.  Iterative Incremental Clustering of Time Series , 2004, EDBT.

[4]  M. Portnoff Short-time Fourier analysis of sampled speech , 1981 .

[5]  Yixin Chen,et al.  Multi-Dimensional Regression Analysis of Time-Series Data Streams , 2002, VLDB.

[6]  Philip S. Yu,et al.  Integrating DCT and DWT for approximating cube streams , 2005, CIKM '05.

[7]  Dimitrios Gunopulos,et al.  Correlating synchronous and asynchronous data streams , 2003, KDD '03.

[8]  Dimitrios Gunopulos,et al.  Identifying similarities, periodicities and bursts for online search queries , 2004, SIGMOD '04.

[9]  Michael Ghil,et al.  ADVANCED SPECTRAL METHODS FOR CLIMATIC TIME SERIES , 2002 .

[10]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[11]  Eamonn J. Keogh,et al.  Probabilistic discovery of time series motifs , 2003, KDD '03.

[12]  Eamonn J. Keogh,et al.  Mining motifs in massive time series databases , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[13]  Dimitrios Gunopulos,et al.  A data compression technique for sensor networks with dynamic bandwidth allocation , 2005, 12th International Symposium on Temporal Representation and Reasoning (TIME'05).

[14]  Jimeng Sun,et al.  Streaming Pattern Discovery in Multiple Time-Series , 2005, VLDB.

[15]  D. Sornette,et al.  Data-adaptive wavelets and multi-scale singular-spectrum analysis , 1998, chao-dyn/9810034.

[16]  Thomas W. Parks,et al.  Adaptive principal components and image denoising , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[17]  Nick Roussopoulos,et al.  Compressing historical information in sensor networks , 2004, SIGMOD '04.

[18]  Walid G. Aref,et al.  Using Convolution to Mine Obscure Periodic Patterns in One Pass , 2004, EDBT.

[19]  Todd R. Ogden,et al.  Wavelet Methods for Time Series Analysis , 2002 .

[20]  Aristides Gionis,et al.  Segmentation and dimensionality reduction , 2006, SDM.

[21]  Keisuke Inoue,et al.  Knowledge Discovery from Heterogeneous Dynamic Systems using Change-Point Correlations , 2005, SDM.

[22]  Matthew Brand,et al.  Fast Online SVD Revisions for Lightweight Recommender Systems , 2003, SDM.

[23]  Audra E. Kosh,et al.  Linear Algebra and its Applications , 1992 .

[24]  Qiang Wang,et al.  A multiresolution symbolic representation of time series , 2005, 21st International Conference on Data Engineering (ICDE'05).

[25]  Wim Sweldens,et al.  Building your own wavelets at home , 2000 .

[26]  Michael E. Tipping,et al.  Probabilistic Principal Component Analysis , 1999 .

[27]  Eamonn J. Keogh,et al.  Locally adaptive dimensionality reduction for indexing large time series databases , 2001, SIGMOD '01.

[28]  Walid G. Aref,et al.  WARP: time warping for periodicity detection , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[29]  Heikki Mannila,et al.  Rule Discovery from Time Series , 1998, KDD.

[30]  Dimitrios Gunopulos,et al.  Online amnesic approximation of streaming time series , 2004, Proceedings. 20th International Conference on Data Engineering.

[31]  E. Oja,et al.  Sparse code shrinkage for image denoising , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).

[32]  Piotr Indyk,et al.  Identifying Representative Trends in Massive Time Series Data Sets Using Sketches , 2000, VLDB.

[33]  Benjamin Belzer,et al.  Wavelet filter evaluation for image compression , 1995, IEEE Trans. Image Process..

[34]  Bin Yang,et al.  Projection approximation subspace tracking , 1995, IEEE Trans. Signal Process..