论文信息 - Deep Reinforcement Learning for Unknown Anomaly Detection

Deep Reinforcement Learning for Unknown Anomaly Detection

We address a critical yet largely unsolved anomaly detection problem, in which we aim to learn detection models from a small set of partially labeled anomalies and a large-scale unlabeled dataset. This is a common scenario in many important applications. Existing related methods either proceed unsupervised with the unlabeled data, or exclusively fit the limited anomaly examples that often do not span the entire set of anomalies. We propose here instead a deep reinforcement-learning-based approach that actively seeks novel classes of anomaly that lie beyond the scope of the labeled training data. This approach learns to balance exploiting its existing data model against exploring for new classes of anomaly. It is thus able to exploit the labeled anomaly data to improve detection accuracy, without limiting the set of anomalies sought to those given anomaly examples. This is of significant practical benefit, as anomalies are inevitably unpredictable in form and often expensive to miss. Extensive experiments on 48 real-world datasets show that our approach significantly outperforms five state-of-the-art competing methods.

[1] Wojciech M. Czarnecki,et al. Grandmaster level in StarCraft II using multi-agent reinforcement learning , 2019, Nature.

[2] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[3] Shenghua Gao,et al. Future Frame Prediction for Anomaly Detection - A New Baseline , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[4] Charles Elkan,et al. Learning classifiers from only positive and unlabeled data , 2008, KDD.

[5] Alexander Binder,et al. Deep One-Class Classification , 2018, ICML.

[6] Thomas G. Dietterich,et al. Feedback-Guided Anomaly Discovery via Online Optimization , 2018, KDD.

[7] Davide Anguita,et al. A Public Domain Dataset for Human Activity Recognition using Smartphones , 2013, ESANN.

[8] Jianping Yin,et al. Effective End-to-end Unsupervised Outlier Detection via Inlier Priority of Discriminative Network , 2019, NeurIPS.

[9] Ramesh Nallapati,et al. OCGAN: One-Class Novelty Detection Using GANs With Constrained Latent Representations , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Min-hwan Oh,et al. Sequential Anomaly Detection using Inverse Reinforcement Learning , 2019, KDD.

[11] David Page,et al. Area under the Precision-Recall Curve: Point Estimates and Confidence Intervals , 2013, ECML/PKDD.

[12] Thomas G. Dietterich,et al. Incorporating Expert Feedback into Active Anomaly Discovery , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[13] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.

[14] Christos Faloutsos,et al. Fast and reliable anomaly detection in categorical data , 2012, CIKM.

[15] Alexander Binder,et al. Deep Semi-Supervised Anomaly Detection , 2019, ICLR.

[16] Ling Chen,et al. Learning Representations of Ultrahigh-dimensional Data for Random Distance-based Outlier Detection , 2018, KDD.

[17] Kristian Lum,et al. An algorithm for removing sensitive information: Application to race-independent recidivism prediction , 2017, The Annals of Applied Statistics.

[18] James Bailey,et al. Discovering outlying aspects in large datasets , 2016, Data Mining and Knowledge Discovery.

[19] Karsten M. Borgwardt,et al. Rapid Distance-Based Outlier Detection via Sampling , 2013, NIPS.

[20] Simone Calderara,et al. Latent Space Autoregression for Novelty Detection , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[22] Arthur Zimek,et al. On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study , 2016, Data Mining and Knowledge Discovery.

[23] Zhi-Hua Zhou,et al. Efficient Training for Positive Unlabeled Learning , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24] Quoc V. Le,et al. Neural Architecture Search with Reinforcement Learning , 2016, ICLR.

[25] Anton van den Hengel,et al. Deep Anomaly Detection with Deviation Networks , 2019, KDD.

[26] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[27] Andrew W. Moore,et al. Active Learning for Anomaly and Rare-Category Detection , 2004, NIPS.

[28] Ran El-Yaniv,et al. Deep Anomaly Detection Using Geometric Transformations , 2018, NeurIPS.

[29] Georg Langs,et al. Unsupervised Anomaly Detection with Generative Adversarial Networks to Guide Marker Discovery , 2017, IPMI.

[30] Kai Ming Ting,et al. Defying the gravity of learning curve: a characteristic of nearest neighbour anomaly detectors , 2016, Machine Learning.

[31] Charu C. Aggarwal,et al. Outlier Detection with Autoencoder Ensembles , 2017, SDM.

[32] Duen Horng Chau,et al. Guilt by association: large scale malware detection by mining file-relation graphs , 2014, KDD.

[33] Gang Niu,et al. Theoretical Comparisons of Positive-Unlabeled Learning against Positive-Negative Learning , 2016, NIPS.

[34] Nicholas Jing Yuan,et al. DRN: A Deep Reinforcement Learning Framework for News Recommendation , 2018, WWW.

[35] Alexei A. Efros,et al. Curiosity-Driven Exploration by Self-Supervised Prediction , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[36] Toby P. Breckon,et al. GANomaly: Semi-Supervised Anomaly Detection via Adversarial Training , 2018, ACCV.

[37] Randy C. Paffenroth,et al. Anomaly Detection with Robust Deep Autoencoders , 2017, KDD.

[38] Nour Moustafa,et al. UNSW-NB15: a comprehensive data set for network intrusion detection systems (UNSW-NB15 network data set) , 2015, 2015 Military Communications and Information Systems Conference (MilCIS).

[39] Bianca Zadrozny,et al. Outlier detection by active learning , 2006, KDD '06.

[40] Janez Demsar,et al. Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[41] Chunhua Shen,et al. Weakly-supervised Deep Anomaly Detection with Pairwise Relation Learning , 2019, ArXiv.

[42] Fei Tony Liu,et al. Isolation-Based Anomaly Detection , 2012, TKDD.

[43] Charu C. Aggarwal,et al. Outlier Analysis , 2013, Springer New York.

[44] Jun Zhou,et al. Anomaly Detection with Partially Observed Anomalies , 2018, WWW.

[45] Liang Zhang,et al. Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning , 2018, KDD.

[46] Xiaoli Li,et al. Learning to Classify Texts Using Positive and Unlabeled Data , 2003, IJCAI.

[47] Sharad Goel,et al. The Measure and Mismeasure of Fairness: A Critical Review of Fair Machine Learning , 2018, ArXiv.

[48] Ling Shao,et al. Hyperparameter Optimization for Tracking with Continuous Deep Q-Learning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[49] Chuan Sheng Foo,et al. Adversarially Learned Anomaly Detection , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[50] Yoshua Bengio,et al. Maximum Entropy Generators for Energy-Based Models , 2019, ArXiv.

[51] Mahmood Fathy,et al. Adversarially Learned One-Class Classifier for Novelty Detection , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[52] Denis J. Dean,et al. Comparison of neural networks and discriminant analysis in predicting forest cover types , 1998 .

[53] Bo Zong,et al. Deep Autoencoding Gaussian Mixture Model for Unsupervised Anomaly Detection , 2018, ICLR.

[54] Alexei A. Efros,et al. Large-Scale Study of Curiosity-Driven Learning , 2018, ICLR.

[55] VARUN CHANDOLA,et al. Anomaly detection: A survey , 2009, CSUR.

[56] Thomas G. Dietterich,et al. Sequential Feature Explanations for Anomaly Detection , 2019, ACM Trans. Knowl. Discov. Data.

[57] Hans-Peter Kriegel,et al. LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[58] Longbing Cao,et al. Deep Learning for Anomaly Detection: A Review , 2020, ArXiv.

[59] Chandan Srivastava,et al. Support Vector Data Description , 2011 .

[60] Jun Li,et al. One-Class Adversarial Nets for Fraud Detection , 2018, AAAI.