Outlier detection via a soft computing hybrid

Outlier detection has been attracting the data analysts in almost every domain for a long time now because their detection is very challenging. Outliers or novel cases need to be detected before any analysis is performed on data set. Depending upon the domain, outlier detection saves a lot of time, money or both. In this paper, we developed a novel outlier detection model using ensembling technique, in the paradigm of soft computing, which includes four algorithms, namely k-Reverse Nearest Neighbor (kRNN), Auto Associative Neural Network (AANN), Counter Propagation Auto Association Neural Network (CPAANN), and Generalized Regression Auto Association Neural network (GRAANN) as constituents. The ensemble takes the union of all the outliers found by the four techniques.

[1]  Marjana Novič,et al.  Counter-propagation neural networks in Matlab , 2008 .

[2]  Vadlamani Ravi,et al.  A new online data imputation method based on general regression auto associative neural network , 2014, Neurocomputing.

[3]  F. E. Grubbs Sample Criteria for Testing Outlying Observations , 1950 .

[4]  Yumin Chen,et al.  Neighborhood outlier detection , 2010, Expert Syst. Appl..

[5]  Nathalie Japkowicz,et al.  A Novelty Detection Approach to Classification , 1995, IJCAI.

[6]  R. Hecht-Nielsen Counterpropagation networks. , 1987, Applied optics.

[7]  José R. Dorronsoro,et al.  Neural fraud detection in credit card operations , 1997, IEEE Trans. Neural Networks.

[8]  Dean Abbott,et al.  An evaluation of high-end data mining tools for fraud detection , 1998, SMC'98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.98CH36218).

[9]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[10]  J. Hollmen,et al.  Residual generation and visualization for understanding novel process conditions , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[11]  Bernd Freisleben,et al.  CARDWATCH: a neural network based database mining system for credit card fraud detection , 1997, Proceedings of the IEEE/IAFE 1997 Computational Intelligence for Financial Engineering (CIFEr).

[12]  Hongxing He,et al.  Outlier Detection Using Replicator Neural Networks , 2002, DaWaK.

[13]  Kamalakar Karlapalem,et al.  A Simple Yet Effective Data Clustering Algorithm , 2006, Sixth International Conference on Data Mining (ICDM'06).

[14]  Shuchita Upadhyaya,et al.  Outlier Detection: Applications And Techniques , 2012 .

[15]  Donald F. Specht,et al.  A general regression neural network , 1991, IEEE Trans. Neural Networks.

[16]  M. J. Desforges,et al.  Applications of probability density estimation to the detection of abnormal conditions in engineering , 1998 .

[17]  Ahmed H. Tewfik,et al.  Robust clustering of acoustic emission signals using the Kohonen network , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[18]  Victoria J. Hodge,et al.  A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.

[19]  Risto Miikkulainen,et al.  Intrusion Detection with Neural Networks , 1997, NIPS.