DeepAnT: A Deep Learning Approach for Unsupervised Anomaly Detection in Time Series
Abstract:Traditional distance and density-based anomaly detection techniques are unable to detect periodic and seasonality related point anomalies which occur commonly in streaming data, leaving a big gap in time series anomaly detection in the current era of the IoT. To address this problem, we present a novel deep learning-based anomaly detection approach (DeepAnT) for time series data, which is equally applicable to the non-streaming cases. DeepAnT is capable of detecting a wide range of anomalies, i.e., point anomalies, contextual anomalies, and discords in time series data. In contrast to the anomaly detection methods where anomalies are learned, DeepAnT uses unlabeled data to capture and learn the data distribution that is used to forecast the normal behavior of a time series. DeepAnT consists of two modules: time series predictor and anomaly detector. The time series predictor module uses deep convolutional neural network (CNN) to predict the next time stamp on the defined horizon. This module takes a window of time series (used as a context) and attempts to predict the next time stamp. The predicted value is then passed to the anomaly detector module, which is responsible for tagging the corresponding time stamp as normal or abnormal. DeepAnT can be trained even without removing the anomalies from the given data set. Generally, in deep learning-based approaches, a lot of data are required to train a model. Whereas in DeepAnT, a model can be trained on relatively small data set while achieving good generalization capabilities due to the effective parameter sharing of the CNN. As the anomaly detection in DeepAnT is unsupervised, it does not rely on anomaly labels at the time of model generation. Therefore, this approach can be directly applied to real-life scenarios where it is practically impossible to label a big stream of data coming from heterogeneous sensors comprising of both normal as well as anomalous points. We have performed a detailed evaluation of 15 algorithms on 10 anomaly detection benchmarks, which contain a total of 433 real and synthetic time series. Experiments show that DeepAnT outperforms the state-of-the-art anomaly detection methods in most of the cases, while performing on par with others.
暂无分享,去 创建一个
[1] Jürgen Schmidhuber,et al. Applying LSTM to Time Series Predictable through Time-Window Approaches , 2000, ICANN.
[2] Lovekesh Vig,et al. Long Short Term Memory Networks for Anomaly Detection in Time Series , 2015, ESANN.
[3] Markus Goldstein,et al. Anomaly Detection in Large Datasets , 2014 .
[4] Anthony K. H. Tung,et al. Ranking Outliers Using Symmetric Neighborhood Relationship , 2006, PAKDD.
[5] Charu C. Aggarwal,et al. An Introduction to Outlier Analysis , 2013 .
[6] Qin Yu,et al. An Improved ARIMA-Based Traffic Anomaly Detection Algorithm for Wireless Sensor Networks , 2016, Int. J. Distributed Sens. Networks.
[7] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[8] Hans-Peter Kriegel,et al. LOF: identifying density-based local outliers , 2000, SIGMOD '00.
[9] Nidhi Singh,et al. Demystifying Numenta anomaly benchmark , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).
[10] Subutai Ahmad,et al. Unsupervised real-time anomaly detection for streaming data , 2017, Neurocomputing.
[11] Forrest N. Iandola,et al. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.
[12] Saeed Amizadeh,et al. Generic and Scalable Framework for Automated Time-series Anomaly Detection , 2015, KDD.
[13] Zengyou He,et al. Discovering cluster-based local outliers , 2003, Pattern Recognit. Lett..
[14] Roland Kwitt. Robust Methods for Unsupervised PCA-based Anomaly Detection , 2006 .
[15] Eric Séverin,et al. Predicting corporate bankruptcy using a self-organizing map: An empirical study to improve the forecasting horizon of a financial failure model , 2011, Decis. Support Syst..
[16] Charles Elkan,et al. Learning to Diagnose with LSTM Recurrent Neural Networks , 2015, ICLR.
[17] Andreas Dengel,et al. Histogram-based Outlier Score (HBOS): A fast Unsupervised Anomaly Detection Algorithm , 2012 .
[18] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[19] Jian Tang,et al. Enhancing Effectiveness of Outlier Detections for Low Density Patterns , 2002, PAKDD.
[20] Robert P. W. Duin,et al. Support Vector Data Description , 2004, Machine Learning.
[21] Graham J. Williams,et al. On-Line Unsupervised Outlier Detection Using Finite Mixtures with Discounting Learning Algorithms , 2000, KDD '00.
[22] Yi Zheng,et al. Time Series Classification Using Multi-Channels Deep Convolutional Neural Networks , 2014, WAIM.
[23] M. Shyu,et al. A Novel Anomaly Detection Scheme Based on Principal Component Classifier , 2003 .
[24] Longbing Cao,et al. SVDD-based outlier detection on uncertain data , 2012, Knowledge and Information Systems.
[25] Keun Ho Ryu,et al. Unsupervised Novelty Detection Using Deep Autoencoders with Density Based Clustering , 2018, Applied Sciences.
[26] Eamonn J. Keogh,et al. Disk aware discord discovery: finding unusual time series in terabyte sized datasets , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).
[27] Biswajit Basu,et al. Real-Time Traffic Flow Forecasting Using Spectral Analysis , 2012, IEEE Transactions on Intelligent Transportation Systems.
[28] Quoc V. Le,et al. Neural Architecture Search with Reinforcement Learning , 2016, ICLR.
[29] Markus Schneider,et al. Expected similarity estimation for large-scale batch and streaming anomaly detection , 2016, Machine Learning.
[30] F. E. Grubbs. Procedures for Detecting Outlying Observations in Samples , 1969 .
[31] David L. Woodruff,et al. Identification of Outliers in Multivariate Data , 1996 .
[32] Sridhar Ramaswamy,et al. Efficient algorithms for mining outliers from large data sets , 2000, SIGMOD '00.
[33] Slim Abdennadher,et al. Enhancing one-class support vector machines for unsupervised anomaly detection , 2013, ODD '13.
[34] Su Fong Chien,et al. ARIMA Based Network Anomaly Detection , 2010, 2010 Second International Conference on Communication Software and Networks.
[35] Eamonn J. Keogh,et al. HOT SAX: efficiently finding the most unusual time series subsequence , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).
[36] Phyks. Introducing practical and robust anomaly detection in a time series | Twitter Blogs , 2015 .
[37] Michael E. Fitzpatrick,et al. Detecting anomalies in time series data via a deep learning algorithm combining wavelets, neural networks and Hilbert transform , 2017, Expert Syst. Appl..
[38] Andreas Dengel,et al. Detection of Anomalies in Large Scale Accounting Data using Deep Autoencoder Networks , 2017, ArXiv.
[39] Ryan P. Adams,et al. Bayesian Online Changepoint Detection , 2007, 0710.3742.
[40] Moa Samuelsson. Detecting Anomalies In Time Series Data , 2016 .
[41] Ahmad Lotfi,et al. Anomaly Detection in Activities of Daily Living Using One-Class Support Vector Machine , 2018, UKCI.
[42] B. Rosner. Percentage Points for a Generalized ESD Many-Outlier Procedure , 1983 .
[43] Nicolas Goix,et al. How to Evaluate the Quality of Unsupervised Anomaly Detection Algorithms? , 2016, ArXiv.
[44] Bernhard Schölkopf,et al. Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.
[45] Lovekesh Vig,et al. Anomaly detection in ECG time signals via deep long short-term memory networks , 2015, 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA).
[46] Subutai Ahmad,et al. Evaluating Real-Time Anomaly Detection Algorithms -- The Numenta Anomaly Benchmark , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).
[47] Vanish Talwar,et al. Statistical techniques for online anomaly detection in data centers , 2011, 12th IFIP/IEEE International Symposium on Integrated Network Management (IM 2011) and Workshops.
[48] Zhi-Hua Zhou,et al. Isolation Forest , 2008, 2008 Eighth IEEE International Conference on Data Mining.
[49] Zhiwei Ji,et al. Detecting Anomalies in Time Series Data via a Meta-Feature Based Approach , 2018, IEEE Access.
[50] Chandan Srivastava,et al. Support Vector Data Description , 2011 .
[51] Irma J. Terpenning,et al. STL : A Seasonal-Trend Decomposition Procedure Based on Loess , 1990 .