Ensemble Algorithms for Unsupervised Anomaly Detection

Many anomaly detection algorithms have been proposed in recent years, including density-based and rank-based algorithms. In this paper, we propose ensemble methods to improve the performance of these individual algorithms. We evaluate approaches that use score and rank aggregation for these algorithms. We also consider sequential methods in which one detection method is followed by the other. We use several datasets to evaluate the performance of the proposed ensemble methods. Our results show that sequential methods significantly improve the ability to detect anomalous data points.

[1]  Kagan Tumer,et al.  Classifier ensembles: Select real-world applications , 2008, Inf. Fusion.

[2]  Raman K. Mehra,et al.  Ensemble methods for anomaly detection and distributed intrusion detection in Mobile Ad-Hoc Networks , 2008, Inf. Fusion.

[3]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[4]  Carla E. Brodley,et al.  Anomaly Detection Using an Ensemble of Feature Models , 2010, 2010 IEEE International Conference on Data Mining.

[5]  Lawrence O. Hall,et al.  Anomaly Detection Using Ensembles , 2011, MCS.

[6]  Jian Tang,et al.  Enhancing Effectiveness of Outlier Detections for Low Density Patterns , 2002, PAKDD.

[7]  Arthur Zimek,et al.  Ensembles for unsupervised outlier detection: challenges and research questions a position paper , 2014, SKDD.

[8]  Jian Tang,et al.  Capabilities of outlier detection schemes in large datasets, framework and methodologies , 2006, Knowledge and Information Systems.

[9]  Kishan G. Mehrotra,et al.  Rank-based outlier detection , 2013 .

[10]  Anthony K. H. Tung,et al.  Ranking Outliers Using Symmetric Neighborhood Relationship , 2006, PAKDD.

[11]  Arthur Zimek,et al.  Subsampling for efficient and effective unsupervised outlier detection ensembles , 2013, KDD.

[12]  Tomás Pevný,et al.  Novelty detection in blind steganalysis , 2008, MM&Sec '08.

[13]  Kishan G. Mehrotra,et al.  Algorithms for Detecting Outliers via Clustering and Ranks , 2012, IEA/AIE.

[14]  Wenke Lee,et al.  Classification of packed executables for accurate computer virus detection , 2008, Pattern Recognit. Lett..

[15]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD 2000.

[16]  Lior Rokach,et al.  Ensemble of Feature Chains for Anomaly Detection , 2013, MCS.