Improved J48 Classification Algorithm for the Prediction of Diabetes

research work deals with efficient data mining procedure for predicting the diabetes from medical records of patients. Diabetes is a very common disease these days in all populations and in all age groups. Diabetes contributes to heart disease, increases the risks of developing kidney disease, nerve damage, blood vessel damage and blindness. So mining the diabetes data in efficient manner is a critical issue. The Pima Indians Diabetes Data Set is used in this paper; which collects the information of patients with and without having diabetes. The modified J48 classifier is used to increase the accuracy rate of the data mining procedure. The data mining tool WEKA has been used as an API of MATLAB for generating the J-48 classifiers. Experimental results showed a significant improvement over the existing J-48 algorithm. KeywordsDecision Tree, MATLAB, Data Mining, Diabetes, WEKA.

[1]  G. I. Salama,et al.  Experimental comparison of classifiers for breast cancer diagnosis , 2012, 2012 Seventh International Conference on Computer Engineering & Systems (ICCES).

[2]  R. Robu,et al.  Medical data mining with extended WEKA , 2012, 2012 IEEE 16th International Conference on Intelligent Engineering Systems (INES).

[3]  Diego Andina,et al.  Data mining for the diagnosis of type 2 diabetes , 2012, World Automation Congress 2012.

[4]  Zeljko Garaca,et al.  Mining demographic data with decision trees , 2010, The 33rd International Convention MIPRO.

[5]  Olaiya Folorunsho Comparative Study of Different Data Mining Techniques Performance in knowledge Discovery from Medical Database , 2013 .

[6]  Asmaa S. Hussein,et al.  Efficient Chronic Disease Diagnosis prediction and recommendation system , 2012, 2012 IEEE-EMBS Conference on Biomedical Engineering and Sciences.

[7]  Hamid Eslami Nosratabadi,et al.  Evaluating the success level of data mining projects based on CRISP-DM methodology by a Fuzzy expert system , 2011, 2011 3rd International Conference on Electronics Computer Technology.

[8]  Masahiro Takada,et al.  Comparison of robustness against missing values of alternative decision tree and multiple logistic regression for predicting clinical data in primary breast cancer , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[9]  Asma A. Al Jarullah Decision tree discovery for the diagnosis of type II diabetes , 2011, 2011 International Conference on Innovations in Information Technology.

[10]  Yang Guo,et al.  Using Bayes Network for Prediction of Type-2 diabetes , 2012, 2012 International Conference for Internet Technology and Secured Transactions.

[11]  R. G. Ramani,et al.  Automatic prediction of Diabetic Retinopathy and Glaucoma through retinal image analysis and data mining techniques , 2012, 2012 International Conference on Machine Vision and Image Processing (MVIP).

[12]  Chien-Chung Chan,et al.  Predicting disease by using data mining based on healthcare information system , 2012, 2012 IEEE International Conference on Granular Computing.