Online anomaly detection on the webscope S5 dataset: A comparative study

An unresolved challenge for all kind of temporal data is the reliable anomaly detection, especially when adaptability is required in the case of non-stationary time series or when the nature of future anomalies is unknown or only vaguely defined. Most of the current anomaly detection algorithms follow the general idea to classify an anomaly as a significant deviation from the prediction. In this paper we present a comparative study where several online anomaly detection algorithms are compared on the large Yahoo Webscope S5 anomaly benchmark. We show that a relatively Simple Online Regression Anomaly Detector (SORAD) is quite successful compared to other anomaly detectors. We discuss the importance of several adaptive and online elements of the algorithm and their influence on the overall anomaly detection accuracy.

[1]  B. Welford Note on a Method for Calculating Corrected Sums of Squares and Products , 1962 .

[2]  Victoria J. Hodge,et al.  A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.

[3]  Reda Alhajj,et al.  A comprehensive survey of numeric and symbolic outlier mining techniques , 2006, Intell. Data Anal..

[4]  Jung-Min Park,et al.  An overview of anomaly detection techniques: Existing solutions and latest technological trends , 2007, Comput. Networks.

[5]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[6]  Dileep George,et al.  Towards a Mathematical Theory of Cortical Micro-circuits , 2009, PLoS Comput. Biol..

[7]  Miguel Lázaro-Gredilla,et al.  Estimation of the forgetting factor in kernel recursive least squares , 2012, 2012 IEEE International Workshop on Machine Learning for Signal Processing.

[8]  Bernhard Sendhoff,et al.  Learning-Guided Exploration in Airfoil Optimization , 2013, IDEAL.

[9]  Bernhard Sendhoff,et al.  Novelty and interestingness measures for design-space exploration , 2013, GECCO '13.

[10]  Gianluca Bontempi,et al.  Statistical foundations of machine learning , 2013 .

[11]  Arun Kejariwal,et al.  A Novel Technique for Long-Term Anomaly Detection in the Cloud , 2014, HotCloud.

[12]  Subutai Ahmad,et al.  Evaluating Real-Time Anomaly Detection Algorithms -- The Numenta Anomaly Benchmark , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[13]  Rob J. Hyndman,et al.  Large-Scale Unusual Time Series Detection , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[14]  Seungjin Choi,et al.  Echo-state conditional variational autoencoder for anomaly detection , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[15]  Erik Elmroth,et al.  Real-time detection of performance anomalies for cloud services , 2016, 2016 IEEE/ACM 24th International Symposium on Quality of Service (IWQoS).