相关论文

DeepAnT: A Deep Learning Approach for Unsupervised Anomaly Detection in Time Series

Abstract:Traditional distance and density-based anomaly detection techniques are unable to detect periodic and seasonality related point anomalies which occur commonly in streaming data, leaving a big gap in time series anomaly detection in the current era of the IoT. To address this problem, we present a novel deep learning-based anomaly detection approach (DeepAnT) for time series data, which is equally applicable to the non-streaming cases. DeepAnT is capable of detecting a wide range of anomalies, i.e., point anomalies, contextual anomalies, and discords in time series data. In contrast to the anomaly detection methods where anomalies are learned, DeepAnT uses unlabeled data to capture and learn the data distribution that is used to forecast the normal behavior of a time series. DeepAnT consists of two modules: time series predictor and anomaly detector. The time series predictor module uses deep convolutional neural network (CNN) to predict the next time stamp on the defined horizon. This module takes a window of time series (used as a context) and attempts to predict the next time stamp. The predicted value is then passed to the anomaly detector module, which is responsible for tagging the corresponding time stamp as normal or abnormal. DeepAnT can be trained even without removing the anomalies from the given data set. Generally, in deep learning-based approaches, a lot of data are required to train a model. Whereas in DeepAnT, a model can be trained on relatively small data set while achieving good generalization capabilities due to the effective parameter sharing of the CNN. As the anomaly detection in DeepAnT is unsupervised, it does not rely on anomaly labels at the time of model generation. Therefore, this approach can be directly applied to real-life scenarios where it is practically impossible to label a big stream of data coming from heterogeneous sensors comprising of both normal as well as anomalous points. We have performed a detailed evaluation of 15 algorithms on 10 anomaly detection benchmarks, which contain a total of 433 real and synthetic time series. Experiments show that DeepAnT outperforms the state-of-the-art anomaly detection methods in most of the cases, while performing on par with others.

参考文献

[1]  Jürgen Schmidhuber,et al.  Applying LSTM to Time Series Predictable through Time-Window Approaches , 2000, ICANN.

[2]  Lovekesh Vig,et al.  Long Short Term Memory Networks for Anomaly Detection in Time Series , 2015, ESANN.

[3]  Markus Goldstein,et al.  Anomaly Detection in Large Datasets , 2014 .

[4]  Anthony K. H. Tung,et al.  Ranking Outliers Using Symmetric Neighborhood Relationship , 2006, PAKDD.

[5]  Charu C. Aggarwal,et al.  An Introduction to Outlier Analysis , 2013 .

[6]  Qin Yu,et al.  An Improved ARIMA-Based Traffic Anomaly Detection Algorithm for Wireless Sensor Networks , 2016, Int. J. Distributed Sens. Networks.

[7]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[8]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[9]  Nidhi Singh,et al.  Demystifying Numenta anomaly benchmark , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[10]  Subutai Ahmad,et al.  Unsupervised real-time anomaly detection for streaming data , 2017, Neurocomputing.

[11]  Forrest N. Iandola,et al.  SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[12]  Saeed Amizadeh,et al.  Generic and Scalable Framework for Automated Time-series Anomaly Detection , 2015, KDD.

[13]  Zengyou He,et al.  Discovering cluster-based local outliers , 2003, Pattern Recognit. Lett..

[14]  Roland Kwitt Robust Methods for Unsupervised PCA-based Anomaly Detection , 2006 .

[15]  Eric Séverin,et al.  Predicting corporate bankruptcy using a self-organizing map: An empirical study to improve the forecasting horizon of a financial failure model , 2011, Decis. Support Syst..

[16]  Charles Elkan,et al.  Learning to Diagnose with LSTM Recurrent Neural Networks , 2015, ICLR.

[17]  Andreas Dengel,et al.  Histogram-based Outlier Score (HBOS): A fast Unsupervised Anomaly Detection Algorithm , 2012 .

[18]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[19]  Jian Tang,et al.  Enhancing Effectiveness of Outlier Detections for Low Density Patterns , 2002, PAKDD.

[20]  Robert P. W. Duin,et al.  Support Vector Data Description , 2004, Machine Learning.

[21]  Graham J. Williams,et al.  On-Line Unsupervised Outlier Detection Using Finite Mixtures with Discounting Learning Algorithms , 2000, KDD '00.

[22]  Yi Zheng,et al.  Time Series Classification Using Multi-Channels Deep Convolutional Neural Networks , 2014, WAIM.

[23]  M. Shyu,et al.  A Novel Anomaly Detection Scheme Based on Principal Component Classifier , 2003 .

[24]  Longbing Cao,et al.  SVDD-based outlier detection on uncertain data , 2012, Knowledge and Information Systems.

[25]  Keun Ho Ryu,et al.  Unsupervised Novelty Detection Using Deep Autoencoders with Density Based Clustering , 2018, Applied Sciences.

[26]  Eamonn J. Keogh,et al.  Disk aware discord discovery: finding unusual time series in terabyte sized datasets , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[27]  Biswajit Basu,et al.  Real-Time Traffic Flow Forecasting Using Spectral Analysis , 2012, IEEE Transactions on Intelligent Transportation Systems.

[28]  Quoc V. Le,et al.  Neural Architecture Search with Reinforcement Learning , 2016, ICLR.

[29]  Markus Schneider,et al.  Expected similarity estimation for large-scale batch and streaming anomaly detection , 2016, Machine Learning.

[30]  F. E. Grubbs Procedures for Detecting Outlying Observations in Samples , 1969 .

[31]  David L. Woodruff,et al.  Identification of Outliers in Multivariate Data , 1996 .

[32]  Sridhar Ramaswamy,et al.  Efficient algorithms for mining outliers from large data sets , 2000, SIGMOD '00.

[33]  Slim Abdennadher,et al.  Enhancing one-class support vector machines for unsupervised anomaly detection , 2013, ODD '13.

[34]  Su Fong Chien,et al.  ARIMA Based Network Anomaly Detection , 2010, 2010 Second International Conference on Communication Software and Networks.

[35]  Eamonn J. Keogh,et al.  HOT SAX: efficiently finding the most unusual time series subsequence , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[36]  Phyks Introducing practical and robust anomaly detection in a time series | Twitter Blogs , 2015 .

[37]  Michael E. Fitzpatrick,et al.  Detecting anomalies in time series data via a deep learning algorithm combining wavelets, neural networks and Hilbert transform , 2017, Expert Syst. Appl..

[38]  Andreas Dengel,et al.  Detection of Anomalies in Large Scale Accounting Data using Deep Autoencoder Networks , 2017, ArXiv.

[39]  Ryan P. Adams,et al.  Bayesian Online Changepoint Detection , 2007, 0710.3742.

[40]  Moa Samuelsson Detecting Anomalies In Time Series Data , 2016 .

[41]  Ahmad Lotfi,et al.  Anomaly Detection in Activities of Daily Living Using One-Class Support Vector Machine , 2018, UKCI.

[42]  B. Rosner Percentage Points for a Generalized ESD Many-Outlier Procedure , 1983 .

[43]  Nicolas Goix,et al.  How to Evaluate the Quality of Unsupervised Anomaly Detection Algorithms? , 2016, ArXiv.

[44]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[45]  Lovekesh Vig,et al.  Anomaly detection in ECG time signals via deep long short-term memory networks , 2015, 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[46]  Subutai Ahmad,et al.  Evaluating Real-Time Anomaly Detection Algorithms -- The Numenta Anomaly Benchmark , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[47]  Vanish Talwar,et al.  Statistical techniques for online anomaly detection in data centers , 2011, 12th IFIP/IEEE International Symposium on Integrated Network Management (IM 2011) and Workshops.

[48]  Zhi-Hua Zhou,et al.  Isolation Forest , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[49]  Zhiwei Ji,et al.  Detecting Anomalies in Time Series Data via a Meta-Feature Based Approach , 2018, IEEE Access.

[50]  Chandan Srivastava,et al.  Support Vector Data Description , 2011 .

[51]  Irma J. Terpenning,et al.  STL : A Seasonal-Trend Decomposition Procedure Based on Loess , 1990 .

引用
A Comparative Analysis of Traditional and Deep Learning-Based Anomaly Detection Methods for Streaming Data
2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)
2019
Deep anomaly detection for industrial systems: a case study
Annual Conference of the PHM Society
2020
Unsupervised Anomaly Detection in Multivariate Spatio-Temporal Data Using Deep Learning: Early Detection of COVID-19 Outbreak in Italy
IEEE Access
2020
Denoising Adversarial Autoencoder for Obfuscated Traffic Detection and Recovery
MLN
2019
Confident Classification Using a Hybrid Between Deterministic and Probabilistic Convolutional Neural Networks
IEEE Access
2020
Loners Stand Out. Identification of Anomalous Subsequences Based on Group Performance
ADMA
2020
Stock Index Prices Prediction via Temporal Pattern Attention and Long-Short-Term Memory
Adv. Multim.
2020
Development of an anomaly detection method with a novel hidden semi-Markov model incorporating unlearned states
2020 IEEE/SICE International Symposium on System Integration (SII)
2020
Time Series Encodings with Temporal Convolutional Networks
BIOMA
2020
FuseAD: Unsupervised Anomaly Detection in Streaming Sensors Data by Fusing Statistical and Deep Learning Models
Sensors
2019
Learning Kullback-Leibler Divergence-Based Gaussian Model for Multivariate Time Series Classification
IEEE Access
2019
Monitoring Routine Changes From Daily Smartphone Usage
2020 IEEE International Conference on Human-Machine Systems (ICHMS)
2020
Effective Attack Detection in Internet of Medical Things Smart Environment Using a Deep Belief Neural Network
IEEE Access
2020
Deep Anomaly Detection for Time-Series Data in Industrial IoT: A Communication-Efficient On-Device Federated Learning Approach
IEEE Internet of Things Journal
2020
A Review on Outlier/Anomaly Detection in Time Series Data
ACM Comput. Surv.
2020
An effective disease prediction system using incremental feature selection and temporal convolutional neural network
J. Ambient Intell. Humaniz. Comput.
2020
Unsupervised Anomaly Detection Using Intelligent and Heterogeneous Autonomous Systems: Highlights From the 2020 IEEE Signal Processing Cup Student Competition [SP Competitions]
IEEE Signal Process. Mag.
2020
TSXplain: Demystification of DNN Decisions for Time-Series using Natural Language and Statistical Features
ICANN
2019
Bioinspired Optimization Methods and Their Applications: 9th International Conference, BIOMA 2020, Brussels, Belgium, November 19–20, 2020, Proceedings
BIOMA
2020
The Integration of Azure Sphere and Azure Cloud Services for Internet of Things
2019