An Efficient Rule-Based Classification of Diabetes Using ID3, C4.5, & CART Ensembles

Conventional techniques for clinical decision support systems are based on a single classifier or simple combination of these classifiers used for disease diagnosis and prediction. Recently much attention has been paid on improving the performance of disease prediction by using ensemble-based methods. In this paper, we use multiple ensemble classification techniques for diabetes datasets. Three types of decision trees ID3, C4.5 and CART are used as the base classifiers. The ensemble techniques used are Majority Voting, Adaboost, Bayesian Boosting, Stacking and Bagging. Two benchmark diabetes datasets are used from UCI and Bio Stat repositories respectively. Experimental results and evaluation show that Bagging ensemble technique shows better performance as compared to single as well as other ensemble techniques.

[1]  Nilesh B. Prajapati,et al.  Study of Diabetes Prediction using Feature Selection and Classification , 2014 .

[2]  Rob Stocker,et al.  Using Decision Tree for Diagnosing Heart Disease Patients , 2011, AusDM.

[3]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[4]  Silvana Quaglini,et al.  Data mining techniques for analyzing stroke care processes , 2010, MedInfo.

[5]  S Vijiyarani,et al.  DISEASE PREDICTION IN DATA MINING TECHNIQUE – A SURVEY , 2013 .

[6]  Johan Gustav Bellika,et al.  Towards a mobile solution for predicting illness in Type 1 Diabetes Mellitus: Development of a prediction model for detecting risk of illness in Type 1 Diabetes prior to symptom onset , 2011, 2011 2nd International Conference on Wireless Communication, Vehicular Technology, Information Theory and Aerospace & Electronic Systems Technology (Wireless VITAE).

[7]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[8]  Aida Mustapha,et al.  A Hybrid Model of Hierarchical Clustering and Decision Tree for Rule-based Classification of Diabetic Patients , 2013 .

[9]  S Setayeshi,et al.  Diabetes Diagnosis by Using Computational Intelligence Algorithms , 2012 .

[10]  Rahmat Zolfaghari Islamic Diagnosis of Diabetes in Female Population of Pima Indian Heritage with Ensemble of BP Neural Network and SVM , 2012 .

[11]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[12]  Rolf Johansson,et al.  Ensemble Glucose Prediction in Insulin-Dependent Diabetes , 2014 .

[13]  Max Bramer,et al.  Principles of Data Mining , 2013, Undergraduate Topics in Computer Science.

[14]  Subramanian Appavu,et al.  An amalgam KNN to predict diabetes mellitus , 2013, 2013 IEEE International Conference ON Emerging Trends in Computing, Communication and Nanotechnology (ICECCN).

[15]  Nilesh B. Prajapati,et al.  Diabetes prediction using feature selection and classification , 2014 .

[16]  Abdulkadir Sengür,et al.  Effective diagnosis of heart disease through neural networks ensembles , 2009, Expert Syst. Appl..

[17]  Ruben D. Canlas Data Mining in Healthcare : Current Applications and Issues By , 2010 .

[18]  Christos Schizas,et al.  Region based Support Vector Machine algorithm for medical diagnosis on Pima Indian Diabetes dataset , 2012, 2012 IEEE 12th International Conference on Bioinformatics & Bioengineering (BIBE).

[19]  Sapna,et al.  DATA MINING – FUZZY NEURAL GENETIC ALGORITHM IN PREDICTING DIABETES , 2008 .

[20]  Lior Rokach,et al.  Ensemble-based classifiers , 2010, Artificial Intelligence Review.

[21]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[22]  R. Polikar,et al.  Ensemble based systems in decision making , 2006, IEEE Circuits and Systems Magazine.

[23]  K. Rajesh,et al.  Application of Data Mining Methods and Techniques for Diabetes Diagnosis , 2012 .

[24]  Asoke K. Nandi,et al.  Feature generation using genetic programming with comparative partner selection for diabetes classification , 2013, Expert Syst. Appl..

[25]  Sungyoung Lee,et al.  Prediction of Diabetes Mellitus Based on Boosting Ensemble Modeling , 2014, UCAmI.

[26]  B. Srinivasan,et al.  Predicting Diabetes by cosequencing the various Data Mining Classification Techniques , 2014 .

[27]  Janki Naik,et al.  Tumor Detection and Classification using Decision Tree in Brain MRI , 2013 .

[28]  Thomas Porter,et al.  Identifying Diabetic Patients: A Data Mining Approach , 2009, AMCIS.