MAD-GAN: Multivariate Anomaly Detection for Time Series Data with Generative Adversarial Networks
Abstract:The prevalence of networked sensors and actuators in many real-world systems such as smart buildings, factories, power plants, and data centers generate substantial amounts of multivariate time series data for these systems. The rich sensor data can be continuously monitored for intrusion events through anomaly detection. However, conventional threshold-based anomaly detection methods are inadequate due to the dynamic complexities of these systems, while supervised machine learning methods are unable to exploit the large amounts of data due to the lack of labeled data. On the other hand, current unsupervised machine learning approaches have not fully exploited the spatial-temporal correlation and other dependencies amongst the multiple variables (sensors/actuators) in the system for detecting anomalies. In this work, we propose an unsupervised multivariate anomaly detection method based on Generative Adversarial Networks (GANs). Instead of treating each data stream independently, our proposed MAD-GAN framework considers the entire variable set concurrently to capture the latent interactions amongst the variables. We also fully exploit both the generator and discriminator produced by the GAN, using a novel anomaly score called DR-score to detect anomalies by discrimination and reconstruction. We have tested our proposed MAD-GAN using two recent datasets collected from real-world CPS: the Secure Water Treatment (SWaT) and the Water Distribution (WADI) datasets. Our experimental results showed that the proposed MAD-GAN is effective in reporting anomalies caused by various cyber-intrusions compared in these complex real-world systems.
暂无分享,去 创建一个
[1] Jiawei Han,et al. Data Mining: Concepts and Techniques , 2000 .
[2] Hans-Peter Kriegel,et al. LOF: identifying density-based local outliers , 2000, SIGMOD 2000.
[3] Clara Pizzuti,et al. Fast Outlier Detection in High Dimensional Spaces , 2002, PKDD.
[4] I. Jolliffe. Principal Component Analysis , 2002 .
[5] Vipin Kumar,et al. Feature bagging for outlier detection , 2005, KDD '05.
[6] Bernhard Schölkopf,et al. A Kernel Method for the Two-Sample-Problem , 2006, NIPS.
[7] Hans-Peter Kriegel,et al. Angle-based outlier detection in high-dimensional data , 2008, KDD.
[8] Vipin Kumar,et al. Comparative Evaluation of Anomaly Detection Techniques for Sequence Data , 2008, 2008 Eighth IEEE International Conference on Data Mining.
[9] Heng Tao Shen,et al. Principal Component Analysis , 2009, Encyclopedia of Biometrics.
[10] Zhiwei Gao,et al. From Model, Signal to Knowledge: A Data-Driven Perspective of Fault Detection and Diagnosis , 2013, IEEE Transactions on Industrial Informatics.
[11] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[12] Peter B. Luh,et al. Building Energy Doctors: An SPC and Kalman Filter-Based Method for System-Level Fault Detection in HVAC Systems , 2014, IEEE Transactions on Automation Science and Engineering.
[13] Jin Wen,et al. A model-based fault detection and diagnostic methodology based on PCA method and wavelet transform , 2014 .
[14] Tim Oates,et al. Adversarial feature selection , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).
[15] Hazem Nounou,et al. PLS-based EWMA fault detection strategy for process monitoring , 2015 .
[16] Richard S. Zemel,et al. Generative Moment Matching Networks , 2015, ICML.
[17] Sridhar Adepu,et al. A Dataset to Support Research in the Design of Secure Water Treatment Systems , 2016, CRITIS.
[18] Patrick P. K. Chan,et al. Adversarial Feature Selection Against Evasion Attacks , 2016, IEEE Transactions on Cybernetics.
[19] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.
[20] Nils Ole Tippenhauer,et al. SWaT: a water treatment testbed for research and training on ICS security , 2016, 2016 International Workshop on Cyber-physical Systems for Smart Water Networks (CySWater).
[21] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.
[22] Minh N. Do,et al. Semantic Image Inpainting with Perceptual and Contextual Losses , 2016, ArXiv.
[23] Costas J. Spanos,et al. Data-driven event detection with partial knowledge: A Hidden Structure Semi-Supervised learning method , 2016, 2016 American Control Conference (ACC).
[24] Guoqiang Hu,et al. A data-driven strategy for detection and diagnosis of building chiller faults using linear discriminant analysis , 2016 .
[25] Olof Mogren,et al. C-RNN-GAN: Continuous recurrent neural networks with adversarial training , 2016, ArXiv.
[26] Sridhar Adepu,et al. Anomaly Detection in Cyber Physical Systems Using Recurrent Neural Networks , 2017, 2017 IEEE 18th International Symposium on High Assurance Systems Engineering (HASE).
[27] Aditya P. Mathur,et al. WADI: a water distribution testbed for research in the design of secure cyber physical systems , 2017, CySWATER@CPSWeek.
[28] Qianlong Wang,et al. A Latent Space Understandable Generative Adversarial Network: SelfExGAN , 2017, 2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA).
[29] Jinoh Kim,et al. A survey of deep learning-based network anomaly detection , 2017, Cluster Computing.
[30] Yiming Yang,et al. MMD GAN: Towards Deeper Understanding of Moment Matching Network , 2017, NIPS.
[31] Gunnar Rätsch,et al. Real-valued (Medical) Time Series Generation with Recurrent Conditional GANs , 2017, ArXiv.
[32] Georg Langs,et al. Unsupervised Anomaly Detection with Generative Adversarial Networks to Guide Marker Discovery , 2017, IPMI.
[33] Tao Xu,et al. SegAN: Adversarial Network with Multi-scale L1 Loss for Medical Image Segmentation , 2017, Neuroinformatics.
[34] Charu C. Aggarwal,et al. NetWalk: A Flexible Deep Embedding Approach for Anomaly Detection in Dynamic Networks , 2018, KDD.
[35] Asaf Shabtai,et al. Using LSTM Encoder-Decoder Algorithm for Detecting Anomalous ADS-B Messages , 2017, Comput. Secur..
[36] Chuan Sheng Foo,et al. Efficient GAN-Based Anomaly Detection , 2018, ArXiv.
[37] W. Wang,et al. NetWalk , 2018, Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining.
[38] Bo Zong,et al. Deep Autoencoding Gaussian Mixture Model for Unsupervised Anomaly Detection , 2018, ICLR.
[39] Yang Ning,et al. Efficient augmentation and relaxation learning for individualized treatment rules using observational data , 2019, J. Mach. Learn. Res..
[40] Yue Zhao,et al. PyOD: A Python Toolbox for Scalable Outlier Detection , 2019, J. Mach. Learn. Res..
[41] David M. W. Powers,et al. Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation , 2011, ArXiv.