论文信息 - Anomaly Detection Based on Unsupervised Disentangled Representation Learning in Combination with Manifold Learning

Anomaly Detection Based on Unsupervised Disentangled Representation Learning in Combination with Manifold Learning

Identifying anomalous samples from highly complex and unstructured data is a crucial but challenging task in a variety of intelligent systems. In this paper, we present a novel deep anomaly detection framework named AnoDM (standing for Anomaly detection based on unsupervised Disentangled representation learning and Manifold learning). The disentanglement learning is currently implemented by β-VAE for automatically discovering interpretable factorized latent representations in a completely unsupervised manner. The manifold learning is realized by t-SNE for projecting the latent representations to a 2D map. We define a new anomaly score function by combining β-VAE’s reconstruction error in the raw feature space and local density estimation in the t-SNE space. AnoDM was evaluated on both image and time-series data and achieved better results than models that use just one of the two measures and other deep learning methods.

[1] Y. LeCun,et al. Learning methods for generic object recognition with invariance to pose and lighting , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[2] Robert P. W. Duin,et al. Support vector domain description , 1999, Pattern Recognit. Lett..

[3] Leland McInnes,et al. UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction , 2018, ArXiv.

[4] Sungzoon Cho,et al. Variational Autoencoder based Anomaly Detection using Reconstruction Probability , 2015 .

[5] Georg Langs,et al. Unsupervised Anomaly Detection with Generative Adversarial Networks to Guide Marker Discovery , 2017, IPMI.

[6] Yee Whye Teh,et al. A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[7] Lawrence D. Jackel,et al. Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[8] Martin Wattenberg,et al. How to Use t-SNE Effectively , 2016 .

[9] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[10] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[11] Marius Kloft,et al. Toward Supervised Anomaly Detection , 2014, J. Artif. Intell. Res..

[12] Yee Whye Teh,et al. Disentangling Disentanglement in Variational Autoencoders , 2018, ICML.

[13] Marius Kloft,et al. Image Anomaly Detection with Generative Adversarial Networks , 2018, ECML/PKDD.

[14] Yee Whye Teh,et al. Do Deep Generative Models Know What They Don't Know? , 2018, ICLR.

[15] Chris Dyer,et al. On the State of the Art of Evaluation in Neural Language Models , 2017, ICLR.

[16] Wojciech Zaremba,et al. An Empirical Exploration of Recurrent Network Architectures , 2015, ICML.

[17] Pascal Vincent,et al. Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18] Alioune Ngom,et al. A review on machine learning principles for multi-view biological data integration , 2016, Briefings Bioinform..

[19] Yann Dauphin,et al. Language Modeling with Gated Convolutional Networks , 2016, ICML.

[20] Charles C. Kemp,et al. A Multimodal Anomaly Detector for Robot-Assisted Feeding Using an LSTM-Based Variational Autoencoder , 2017, IEEE Robotics and Automation Letters.

[21] Vladlen Koltun,et al. An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling , 2018, ArXiv.

[22] Yifeng Li,et al. Exponential Family Restricted Boltzmann Machines and Annealed Importance Sampling , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[23] See-Kiong Ng,et al. Anomaly Detection with Generative Adversarial Networks for Multivariate Time Series , 2018, ArXiv.

[24] Yoshua Bengio,et al. On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[25] Jürgen Schmidhuber,et al. LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[26] Guillaume Desjardins,et al. Understanding disentangling in β-VAE , 2018, ArXiv.

[27] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[28] Christopher Leckie,et al. High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning , 2016, Pattern Recognit..

[29] Hedvig Kjellström,et al. Advances in Variational Inference , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30] Christopher Burgess,et al. beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.

[31] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[32] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[33] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[34] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .

[35] Roland Vollgraf,et al. Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms , 2017, ArXiv.

[36] Chong Wang,et al. Stochastic variational inference , 2012, J. Mach. Learn. Res..

[37] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[38] Yann Dauphin,et al. Convolutional Sequence to Sequence Learning , 2017, ICML.

[39] Yann Dauphin,et al. A Convolutional Encoder Model for Neural Machine Translation , 2016, ACL.

[40] Iluju Kiringa,et al. Exploring Deep Anomaly Detection Methods Based on Capsule Net , 2019, Canadian AI.