Time series discord discovery using WAT algorithm and iSAX representation

Among several existing algorithms proposed to solve the problem of time series discord discovery, HOT SAX and WAT are two widely used algorithms. Especially, WAT can make use of the multi-resolution property in Haar wavelet transform. In this work, we employ state-of-the-art iSAX representation rather than SAX representation in WAT algorithm. To apply iSAX in WAT algorithm, we have to devise two new auxiliary functions and also modify iSAX index structure to adapt Haar transform that is used in WAT algorithm. We empirically evaluate our algorithm with a set of experiments. Experimental results show that WATiSAX algorithm is more effective than original WAT algorithm.

[1]  Huynh Tran Quoc Buu,et al.  Time Series Discord Discovery Based on iSAX Symbolic Representation , 2011, 2011 Third International Conference on Knowledge and Systems Engineering.

[2]  Eamonn J. Keogh,et al.  HOT SAX: efficiently finding the most unusual time series subsequence , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[3]  Junshui Ma,et al.  Online novelty detection on temporal sequences , 2003, KDD '03.

[4]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[5]  Eamonn J. Keogh,et al.  iSAX: indexing and mining terabyte sized time series , 2008, KDD.

[6]  Eamonn J. Keogh,et al.  UCR Time Series Data Mining Archive , 1983 .

[7]  Eamonn J. Keogh,et al.  Finding surprising patterns in a time series database in linear time and space , 2002, KDD.

[8]  Ada Wai-Chee Fu,et al.  Efficient time series matching by wavelets , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[9]  Eamonn J. Keogh,et al.  Finding Time Series Discords Based on Haar Transform , 2006, ADMA.

[10]  Dipankar Dasgupta,et al.  Novelty detection in time series data using ideas from immunology , 1996 .

[11]  Jessica Lin,et al.  Finding the Unusual Medical Time Series: Algorithms and Applications , 2005 .

[12]  Philip Chan,et al.  Learning States and Rules for Time Series Anomaly Detection , 2004, FLAIRS.

[13]  Jian Pei,et al.  WAT: Finding Top-K Discords in Time Series Database , 2007, SDM.