Comparison of Different Classification Techniques Using WEKA for Hematological Data

ABSTRAC : Medical professionals need a reliable prediction methodology to diagnose hematological data comments. There are large quantities of information about patients and their medical conditions. Generally, data mining (sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it into useful information. Data mining software is one of a number of analytical tools for analyzing data. It allows users to analyze data from many different dimensions or angles, categorize it, and summarize the relationships identified. Weka is a data mining tools. It contains many machine leaning algorithms. It provides the facility to classify our data through various algorithms. Classification is an important data mining technique with broad applications. It classifies data of various kinds. Classification is used in every field of our life. Classification is used to classify each item in a set of data into one of predefined set of classes or groups. In this paper we are studying the various Classification algorithms. The thesis main aims to show the comparison of different classification algorithms using Waikato Environment for Knowledge Analysis or in short, WEKA and find out which algorithm is most suitable for user working on hematological data. To use propose model, new Doctor or patients can predict hematological data Comment also developed a mobile App that can easily diagnosis hematological data comments. The best algorithm based on the hematological data is J48 classifier with an accuracy of 97.16% and the total time taken to build the model is at 0.03 seconds. Naive Bayes classifier has the lowest average error at 29.71% compared to others.

[1]  Duangmanee Sanmun,et al.  The Observation Report of Red Blood Cell Morphology in Thailand Teenager by Using Data Mining Technique , 2014, Advances in hematology.

[2]  Kaushik H. Raviya,et al.  Performance Evaluation of Different Data Mining Classification Algorithm Using WEKA , 2012 .

[3]  Satish Kumar David,et al.  Comparative Analysis of Data Mining Tools and Classification Techniques using WEKA in Medical Bioinformatics , 2013 .

[4]  K. Olival,et al.  Hematology and serum chemistry reference values of stray dogs in Bangladesh , 2011, Open veterinary journal.

[5]  K. Rajesh,et al.  Application of Data Mining Methods and Techniques for Diabetes Diagnosis , 2012 .

[6]  Narendra Sharma,et al.  Comparison the various clustering algorithms of weka tools , 2012 .

[7]  Eyad H. Elshami,et al.  AUTOMATED DIAGNOSIS OF THALASSEMIA BASED ON DATAMINING CLASSIFIERS , 2012 .

[8]  Mahendra Tiwari,et al.  Performance analysis of Data Mining algorithms in Weka , 2012 .

[9]  Mohd Fauzi Othman,et al.  Comparison of different classification techniques using WEKA for breast cancer , 2007 .

[10]  Wenjing Zhang,et al.  Medical Diagnosis Data Mining Based on Improved Apriori Algorithm , 2014, J. Networks.

[11]  V. Vaithiyanathan,et al.  COMPARISON OF DIFFERENT CLASSIFICATION TECHNIQUES USING DIFFERENT DATASETS , 2013 .

[12]  S. Vijayarani,et al.  Comparative Analysis of Bayes and Lazy Classification Algorithms , 2013 .

[13]  Suresh B. Mudunuri,et al.  Performance Analysis and Evaluation of Different Data Mining Algorithms used for Cancer Classification , 2013 .

[14]  Nikhil N. Salvithal,et al.  Evaluating Performance of Data Mining Classification Algorithm in Weka , 2013 .