A Novel Approach to Compute Confusion Matrix for Classification of n-Class Attributes with Feature Selection

Confusion matrix is a useful tool to measure the performance of classifiers in their ability to classify multi-classed objects. Computation of classification accuracy for 2-classed attributes using confusion matrix is rather straightforward whereas it is quite cumbersome in case of multi-class attributes. In this work, we propose a novel approach to transform an n × n confusion matrix for n-class attributes to its equivalent 2 × 2 weighted average confusion matrix (WACM). The suitability of WACM has been shown for a classification problem using a web service data set. We have computed the accuracy of four classifiers, namely, Naive Bayes(NB), Genetic Programming(GP), Instance Based Lazy Learner(IB1), and Decision Tree(J48) with and without feature selection. Next, WACM has been employed on the confusion matrix obtained after feature selection which further improves the classification accuracy.

[1]  Eyhab Al-Masri,et al.  QoS-based Discovery and Ranking of Web Services , 2007, 2007 16th International Conference on Computer Communications and Networks.

[2]  Eyhab Al-Masri,et al.  Discovering the best web service , 2007, WWW '07.

[3]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[4]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[5]  Manas Ranjan Patra,et al.  Augmenting Weighted Average with Confusion Matrix to Enhance Classification Accuracy , 2014 .

[6]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[7]  Sulan Zhang,et al.  Fisher Score Based Naive Bayesian Classifier , 2005, 2005 International Conference on Neural Networks and Brain.

[8]  Finn Verner Jensen,et al.  Introduction to Bayesian Networks , 2008, Innovations in Bayesian Networks.

[9]  Geoffrey I. Webb,et al.  Comparison of lazy Bayesian rule, and tree-augmented Bayesian learning , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[10]  Eyhab Al-Masri,et al.  Investigating web services on the world wide web , 2008, WWW.

[11]  J.A. Lozano,et al.  Bayesian Model Averaging of Naive Bayes for Clustering , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[12]  Zhipeng Xie A study of selective neighborhood-based naive Bayes for efficient lazy learning , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.