Time series representation for anomaly detection

Anomaly detection in time series has attracted a lot of attention in the last decade, and is still a hot topic in time series mining. However, time series are high dimensional and feature correlational, directly detecting anomaly patterns in its raw format is very expensive, in addition, different time series may have different lengths of anomaly patterns, and usually, the lengths of anomaly patterns is unknown. This paper presents a new conception key point and an algorithm of seeking key points, the algorithm uses key points to rerepresent time series and still preserves its fundamental characteristics. Variable length method was used to segment re-represented time series into patterns and calculate anomaly scores of patterns. Anomaly patterns are identified by their anomaly scores automatically. The effectiveness of representational algorithm and anomaly detecting algorithm are demonstrated with both synthetic and standard datasets, and the experimental results confirm that our methods can identify anomaly patterns with different lengths and improve the speed of detecting algorithm greatly.

[1]  Eamonn J. Keogh,et al.  Disk aware discord discovery: finding unusual time series in terabyte sized datasets , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[2]  Li Wei,et al.  Experiencing SAX: a novel symbolic representation of time series , 2007, Data Mining and Knowledge Discovery.

[3]  Eamonn J. Keogh,et al.  Exact indexing of dynamic time warping , 2002, Knowledge and Information Systems.

[4]  M. Leng,et al.  Variable Length Methods for Detecting Anomaly Patterns in Time Series , 2008, International Symposium on Computational Intelligence and Design.

[5]  Ada Wai-Chee Fu,et al.  Efficient time series matching by wavelets , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[6]  Fabian Mörchen,et al.  Optimizing time series discretization for knowledge discovery , 2005, KDD '05.

[7]  Junshui Ma,et al.  Online novelty detection on temporal sequences , 2003, KDD '03.

[8]  Qiang Wang,et al.  A symbolic representation of time series , 2005, Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, 2005..

[9]  Tak-chung Fu,et al.  A specialized binary tree for financial time series representation , 2004 .

[10]  Jae-Gil Lee,et al.  Trajectory Outlier Detection: A Partition-and-Detect Framework , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[11]  Philip K. Chan,et al.  Modeling multiple time series for anomaly detection , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[12]  Eamonn J. Keogh,et al.  HOT SAX: efficiently finding the most unusual time series subsequence , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[13]  Anthony K. H. Tung,et al.  SpADe: On Shape-based Pattern Detection in Streaming Time Series , 2007, 2007 IEEE 23rd International Conference on Data Engineering.