Anomaly Detection in Univariate Time-series: A Survey on the State-of-the-Art

Anomaly detection for time-series data has been an important research field for a long time. Seminal work on anomaly detection methods has been focussing on statistical approaches. In recent years an increasing number of machine learning algorithms have been developed to detect anomalies on time-series. Subsequently, researchers tried to improve these techniques using (deep) neural networks. In the light of the increasing number of anomaly detection methods, the body of research lacks a broad comparative evaluation of statistical, machine learning and deep learning methods. This paper studies 20 univariate anomaly detection methods from the all three categories. The evaluation is conducted on publicly available datasets, which serve as benchmarks for time-series anomaly detection. By analyzing the accuracy of each method as well as the computation time of the algorithms, we provide a thorough insight about the performance of these anomaly detection approaches, alongside some general notion of which method is suited for a certain type of data.

[1]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[2]  Oliver Kramer,et al.  Event Detection in Marine Time Series Data , 2015, KI.

[3]  Andreas Dengel,et al.  FuseAD: Unsupervised Anomaly Detection in Streaming Sensors Data by Fusing Statistical and Deep Learning Models , 2019, Sensors.

[4]  Demetris Koutsoyiannis,et al.  Comparison of stochastic and machine learning methods for multi-step ahead forecasting of hydrological processes , 2019, Stochastic Environmental Research and Risk Assessment.

[5]  Evangelos Spiliotis,et al.  Statistical and Machine Learning forecasting methods: Concerns and ways forward , 2018, PloS one.

[6]  Rinkle Rani,et al.  Comparative analysis of density based outlier detection techniques on breast cancer data using hadoop and map reduce , 2016, 2016 International Conference on Inventive Computation Technologies (ICICT).

[7]  James P. Crutchfield,et al.  Geometry from a Time Series , 1980 .

[8]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[9]  David A. Clifton,et al.  A review of novelty detection , 2014, Signal Process..

[10]  Yoshua Bengio,et al.  Algorithms for Hyper-Parameter Optimization , 2011, NIPS.

[11]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[12]  Rob J Hyndman,et al.  A state space framework for automatic forecasting using exponential smoothing methods , 2002 .

[13]  Pu Han,et al.  Wind Speed Forecasting Based on Support Vector Machine with Forecasting Error Estimation , 2007, 2007 International Conference on Machine Learning and Cybernetics.

[14]  Shigeru Katagiri,et al.  Analysis of Subsequence Time-Series Clustering Based on Moving Average , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[15]  A. Madansky Identification of Outliers , 1988 .

[16]  Yoshikiyo Kato,et al.  Fault Detection by Mining Association Rules from House-keeping Data , 2001 .

[17]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[18]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[19]  Varun Chandola,et al.  Anomaly detection for symbolic sequences and time series data , 2009 .

[20]  Phyks Introducing practical and robust anomaly detection in a time series | Twitter Blogs , 2015 .

[21]  Spyros Makridakis,et al.  The M3-Competition: results, conclusions and implications , 2000 .

[22]  Skipper Seabold,et al.  Statsmodels: Econometric and Statistical Modeling with Python , 2010, SciPy.

[23]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[24]  Kishan G. Mehrotra,et al.  Forecasting the behavior of multivariate time series using neural networks , 1992, Neural Networks.

[25]  Michael Bohlke-Schneider,et al.  Criteria for Classifying Forecasting Methods , 2020, International Journal of Forecasting.

[26]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[27]  A. Omar Portillo-Dominguez,et al.  Choosing Machine Learning Algorithms for Anomaly Detection in Smart Building IoT Scenarios , 2019, 2019 IEEE 5th World Forum on Internet of Things (WF-IoT).

[28]  Xindong Wu,et al.  Object Detection With Deep Learning: A Review , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[29]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[30]  Jianmin Jiang,et al.  Network Anomaly Detection Using One Class Support Vector Machine , 2008 .

[31]  Lovekesh Vig,et al.  Anomaly detection in ECG time signals via deep long short-term memory networks , 2015, 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[32]  G Pfurtscheller,et al.  Using time-dependent neural networks for EEG classification. , 2000, IEEE transactions on rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society.

[33]  Seiichi Uchida,et al.  A Comparative Evaluation of Unsupervised Anomaly Detection Algorithms for Multivariate Data , 2016, PloS one.

[34]  Jaideep Srivastava,et al.  A Comparative Study of Anomaly Detection Schemes in Network Intrusion Detection , 2003, SDM.

[35]  Jürgen Schmidhuber,et al.  LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[36]  Ah Chung Tsoi,et al.  FIR and IIR Synapses, a New Neural Network Architecture for Time Series Modeling , 1991, Neural Computation.

[37]  Randy C. Paffenroth,et al.  Anomaly Detection with Robust Deep Autoencoders , 2017, KDD.

[38]  W. R. Buckland,et al.  Outliers in Statistical Data , 1979 .

[39]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[40]  Vic Barnett,et al.  Outliers in Statistical Data , 1980 .

[41]  Yi Zheng,et al.  Time Series Classification Using Multi-Channels Deep Convolutional Neural Networks , 2014, WAIM.

[42]  Mete Celik,et al.  Anomaly detection in temperature data using DBSCAN algorithm , 2011, 2011 International Symposium on Innovations in Intelligent Systems and Applications.

[43]  George Athanasopoulos,et al.  Forecasting: principles and practice , 2013 .

[44]  Saeed Aghabozorgi,et al.  A Review of Subsequence Time Series Clustering , 2014, TheScientificWorldJournal.

[45]  Nassir Navab,et al.  Deep Autoencoding Models for Unsupervised Anomaly Segmentation in Brain MR Images , 2018, BrainLes@MICCAI.

[46]  Zengyou He,et al.  Discovering cluster-based local outliers , 2003, Pattern Recognit. Lett..

[47]  Paul A. Fishwick,et al.  Feedforward Neural Nets as Models for Time Series Forecasting , 1993, INFORMS J. Comput..

[48]  G. C. Tiao,et al.  Estimation of time series parameters in the presence of outliers , 1988 .

[49]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  Keun Ho Ryu,et al.  Unsupervised Novelty Detection Using Deep Autoencoders with Density Based Clustering , 2018, Applied Sciences.

[51]  Xiaodong Gu,et al.  Max-Pooling Dropout for Regularization of Convolutional Neural Networks , 2015, ICONIP.

[52]  Dingsheng Wan,et al.  Time Series Outlier Detection Based on Sliding Window Prediction , 2014 .

[53]  Susana M. Vieira,et al.  Noise Versus Outliers , 2016 .

[54]  Jason R. Chen Making subsequence time series clustering meaningful , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[55]  Philip Hans Franses,et al.  Recognizing changing seasonal patterns using artificial neural networks , 1997 .

[56]  Ramesh Sharda,et al.  Connectionist approach to time series prediction: an empirical test , 1992, J. Intell. Manuf..

[57]  Fei Tony Liu,et al.  Isolation-Based Anomaly Detection , 2012, TKDD.

[58]  Lovekesh Vig,et al.  Long Short Term Memory Networks for Anomaly Detection in Time Series , 2015, ESANN.

[59]  Charu C. Aggarwal,et al.  Outlier Detection for Temporal Data , 2014, Outlier Detection for Temporal Data.

[60]  Christos Faloutsos,et al.  Robust multivariate autoregression for anomaly detection in dynamic product ratings , 2014, WWW.

[61]  F. E. Grubbs Procedures for Detecting Outlying Observations in Samples , 1969 .

[62]  Eamonn J. Keogh,et al.  Time Series Epenthesis: Clustering Time Series Streams Requires Ignoring Some Data , 2011, 2011 IEEE 11th International Conference on Data Mining.

[63]  Takayuki Nakata,et al.  Theorectical Analysis of Subsequence Time-Series Clustering from a Frequency-Analysis Viewpoint , 2008, SDM.

[64]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001, Statistical Science.

[65]  J. Faraway,et al.  Time series forecasting with neural networks: a comparative study using the air line data , 2008 .

[66]  Ligang He,et al.  Local Trend Inconsistency: A Prediction-driven Approach to Unsupervised Anomaly Detection in Multi-seasonal Time Series , 2019, ArXiv.

[67]  Charu C. Aggarwal,et al.  Outlier Analysis , 2013, Springer New York.

[68]  Takehisa Yairi,et al.  Anomaly Detection Using Autoencoders with Nonlinear Dimensionality Reduction , 2014, MLSDA'14.

[69]  M. Braga,et al.  Exploratory Data Analysis , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[70]  Tim Oates,et al.  Time series classification from scratch with deep neural networks: A strong baseline , 2016, 2017 International Joint Conference on Neural Networks (IJCNN).

[71]  J. Ma,et al.  Time-series novelty detection using one-class support vector machines , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[72]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[73]  Ellen Friedman,et al.  Practical Machine Learning: A New Look at Anomaly Detection , 2014 .

[74]  Paul S. Bradley,et al.  Refining Initial Points for K-Means Clustering , 1998, ICML.

[75]  Eamonn J. Keogh,et al.  Clustering of time-series subsequences is meaningless: implications for previous and future research , 2004, Knowledge and Information Systems.

[76]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[77]  Jianmin Jiang,et al.  One class support vector machine for anomaly detection in the communication network performance data , 2007 .

[78]  Andreas Dengel,et al.  DeepAnT: A Deep Learning Approach for Unsupervised Anomaly Detection in Time Series , 2019, IEEE Access.

[79]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[80]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[81]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[82]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[83]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[84]  Zhi-Hua Zhou,et al.  Isolation Forest , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[85]  Sander Bohte,et al.  Conditional Time Series Forecasting with Convolutional Neural Networks , 2017, 1703.04691.

[86]  Bernhard Schölkopf,et al.  Support Vector Method for Novelty Detection , 1999, NIPS.

[87]  J. Franklin,et al.  The elements of statistical learning: data mining, inference and prediction , 2005 .

[88]  Ken Ueno,et al.  dLSTM: a new approach for anomaly detection using deep learning with delayed prediction , 2019, International Journal of Data Science and Analytics.

[89]  Chris Chatfield,et al.  Time series forecasting with neural networks: a comparative study using the air line data , 2008 .

[90]  Robert L. Winkler,et al.  The accuracy of extrapolation (time series) methods: Results of a forecasting competition , 1982 .

[91]  Sven Behnke,et al.  Evaluation of Pooling Operations in Convolutional Architectures for Object Recognition , 2010, ICANN.

[92]  Minrui Fei,et al.  An Anomaly Detection Approach Based on Isolation Forest Algorithm for Streaming Data Using Sliding Window , 2013, ICONS.

[93]  Guoqiang Peter Zhang,et al.  Neural network forecasting for seasonal and trend time series , 2005, Eur. J. Oper. Res..

[94]  Tsuyoshi Idé Why Does Subsequence Time-Series Clustering Produce Sine Waves? , 2006, PKDD.

[95]  Wilpen L. Gorr,et al.  Editorial: Research prospective on neural network forecasting , 1994 .

[96]  Rich Caruana,et al.  Do Deep Nets Really Need to be Deep? , 2013, NIPS.

[97]  Michael Y. Hu,et al.  Effect of data standardization on neural network training , 1996 .