Local-to-Global Unsupervised Anomaly Detection from Temporal Data

Anomaly detection for temporal data has received much attention by many real-world applications. Most existing unsupervised methods dealing with this task are based on a sequential two-way approach (clustering and detection). Because of this, the clustering is less robust to anomalous series in data which distorts the detection step. Thus, to overcome this problem, we propose an embedded technique simultaneously dealing with both methods. We reformulate the task of anomaly detection as a local-weighting-instance clustering problem. The anomalous series are detected locally in each cluster as well as globally in the data, as a whole. Extensive experiments on benchmark datasets are carried out to validate our approach and compare it with other state-of-the-art methods of detection.

[1]  Michael K. Ng,et al.  An Entropy Weighting k-Means Algorithm for Subspace Clustering of High-Dimensional Sparse Data , 2007, IEEE Transactions on Knowledge and Data Engineering.

[2]  Bernhard Schölkopf,et al.  Support Vector Method for Novelty Detection , 1999, NIPS.

[3]  T. K. Vintsyuk Speech discrimination by dynamic programming , 1968 .

[4]  Michael K. Ng,et al.  Automated variable weighting in k-means type clustering , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Leonid Portnoy,et al.  Intrusion detection with unlabeled data using clustering , 2000 .

[6]  Geoffrey I. Webb,et al.  Dynamic Time Warping Averaging of Time Series Allows Faster and More Accurate Classification , 2014, 2014 IEEE International Conference on Data Mining.

[7]  P. Vieu,et al.  Nonparametric Functional Data Analysis: Theory and Practice (Springer Series in Statistics) , 2006 .

[8]  Yan Liu,et al.  Functional Subspace Clustering with Application to Time Series , 2015, ICML.

[9]  Ashok N. Srivastava,et al.  Anomaly Detection and Diagnosis Algorithms for Discrete Symbol Sequences with Applications to Airline Safety , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[10]  T. Lane,et al.  Sequence Matching and Learning in Anomaly Detection for Computer Security , 1997 .

[11]  Philip Chan,et al.  Toward accurate dynamic time warping in linear time and space , 2007, Intell. Data Anal..

[12]  W. Scott Spangler,et al.  Feature Weighting in k-Means Clustering , 2003, Machine Learning.

[13]  Pasi Fränti,et al.  Time-series clustering by approximate prototypes , 2008, ICPR.

[14]  Charu C. Aggarwal,et al.  Outlier Detection for Temporal Data: A Survey , 2014, IEEE Transactions on Knowledge and Data Engineering.

[15]  Marius Kloft,et al.  Hidden Markov Anomaly Detection , 2015, ICML.

[16]  Vipin Kumar,et al.  Comparative Evaluation of Anomaly Detection Techniques for Sequence Data , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[17]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..