Anomaly Detection using Decision Tree based Classifiers

as we know that with the help of Data mining techniques we can find out knowledge in terms of various characteristics and patterns. In this regard this paper presents finding out of anomalies/ outliers using various decision tree based classifiers viz. Best-first Decision Tree, Functional Tree, Logistic Model Tree, J48 and Random Forest decision tree. Three real world datasets has been used in this study. Theoretical analysis and experimental results shown that the Random Forest decision tree has outperformed other decision tree based classifiers of this study in terms of correct classification rate and kappa statistic. Keywordsanomaly, outlier, decision tree, classification

[1]  Haijia Shi Best-first Decision Tree Learning , 2007 .

[2]  Junbin Gao,et al.  Image Outlier Detection and Feature Extraction via L1-Norm-Based 2D Probabilistic PCA , 2015, IEEE Transactions on Image Processing.

[3]  Fabrizio Angiulli,et al.  Exploiting domain knowledge to detect outliers , 2013, Data Mining and Knowledge Discovery.

[4]  Tian Zhang,et al.  BIRCH: A New Data Clustering Algorithm and Its Applications , 1997, Data Mining and Knowledge Discovery.

[5]  João Gama,et al.  Functional Trees , 2001, Machine Learning.

[6]  Marco Vannucci,et al.  Outlier Detection Methods for Industrial Applications , 2008, ICRA 2008.

[7]  Eibe Frank,et al.  Logistic Model Trees , 2003, Machine Learning.

[8]  Jing Li,et al.  A Hierarchical Framework Using Approximated Local Outlier Factor for Efficient Anomaly Detection , 2013, ANT/SEIT.

[9]  Richi Nayak,et al.  A semi-supervised graph-based algorithm for detecting outliers in online-social-networks , 2013, SAC '13.

[10]  Mohamed A. Ismail,et al.  Fuzzy outlier analysis a combined clustering - outlier detection approach , 2007, 2007 IEEE International Conference on Systems, Man and Cybernetics.

[11]  Mohamed Bouguessa Clustering categorical data in projected spaces , 2013, Data Mining and Knowledge Discovery.

[12]  Reda Alhajj,et al.  Graph-based approach for outlier detection in sequential data and its application on stock market and weather data , 2014, Knowl. Based Syst..

[13]  Ashkan Sami,et al.  Entropy-based outlier detection using semi-supervised approach with few positive examples , 2014, Pattern Recognit. Lett..

[14]  Alfredo Ferro,et al.  Enhancing density-based clustering: Parameter reduction and outlier detection , 2013, Inf. Syst..

[15]  Durga Toshniwal,et al.  A Framework for Outlier Detection in Evolving Data Streams by Weighting Attributes in Clustering , 2012 .

[16]  Ji Zhang,et al.  Outlier detection from large distributed databases , 2013, World Wide Web.

[17]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[18]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[19]  Osman Hegazy,et al.  Outliers detection and classification in wireless sensor networks , 2013 .

[20]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[21]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[22]  Nishchal K. Verma,et al.  Clustering based outlier detection in fuzzy SVM , 2014, 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).