A Comparative Study for Various Methods of Classification

This paper discusses data mining techniques to process a medical dataset and identify the relevance of liver disorder and drinking alcohol drink by classification of blood test data. We have used four different classification methods including decision tree, Bayesian algorithms (Naive Bayes and Bayesian Networks), Neural Network classification and Rough Sets methods. To evaluate the methods, we have used the Waikato Environment for Knowledge Analysis (WEKA) open source tool. WEKA is a collection of machine learning algorithms that can be used for different processing tasks such as classification, and clustering. Bayesian algorithms and Neural Network classification methods are implemented with WEKA. However, as WEKA does not support methods based on Rough Sets we have used Rosetta. We have provided an evaluation based on applying these classification methods to our dataset and measuring the accuracy of test results. The evaluation results show that using Neural Networks obtains the best result among the other methods.

[1]  GuoQiang An Effective Algorithm for Improving the Performance of Naive Bayes for Text Classification , 2010 .

[2]  Guo Qiang An Effective Algorithm for Improving the Performance of Naive Bayes for Text Classification , 2010, 2010 Second International Conference on Computer Research and Development.

[3]  Diana Avellaneda Avellaneda,et al.  Natural Texture Classification: A Neural Network Models Benchmark , 2009, 2009 Mexican International Conference on Computer Science.

[4]  Jiye Liang,et al.  The Information Entropy, Rough Entropy And Knowledge Granulation In Rough Set Theory , 2004, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[5]  Haifeng Guo,et al.  Data Mining Techniques for Complex Formation Evaluation in Petroleum Exploration and Production: A Comparison of Feature Selection and Classification Methods , 2008, 2008 IEEE Pacific-Asia Workshop on Computational Intelligence and Industrial Application.

[6]  James F. Peters,et al.  Towards a Software Change Classification System: A Rough Set Approach , 2003, Software Quality Journal.

[7]  Ryohei Nakatsu,et al.  Emotion Recognition in Speech Using Neural Networks , 2000, Neural Computing & Applications.

[8]  Ehab Al-Shaer,et al.  Analysis of Firewall Policy Rules Using Data Mining Techniques , 2006, 2006 IEEE/IFIP Network Operations and Management Symposium NOMS 2006.

[9]  Ayesha H. Butalia,et al.  Applications of Rough Sets in the Field of Data Mining , 2008, 2008 First International Conference on Emerging Trends in Engineering and Technology.

[10]  T. Justin,et al.  Comparison of different classification methods for emotion recognition , 2010, The 33rd International Convention MIPRO.