Decision Tree Discovery for the Diagnosis of Type II Diabetes

description of patterns. In this study, decision tree method was used to predict patients with developing diabetes. The dataset used is the Pima Indians Diabetes Data Set, which collects the information of patients with and without developing diabetes. The study goes through two phases. The first phase is data preprocessing including attribute identification and selection, handling missing values, and numerical discretization. The second phase is a diabetes prediction model construction using the decision tree method. Weka software was used throughout all the phases of this study.

[1]  Jiang Ming-Yan,et al.  Diabetes expert system , 1997, 1997 IEEE International Conference on Intelligent Processing Systems (Cat. No.97TH8335).

[2]  Mohsen Beheshti,et al.  Diabetes Data Analysis and Prediction Model Discovery Using RapidMiner , 2008, 2008 Second International Conference on Future Generation Communication and Networking.

[3]  A. Santhakumaran,et al.  Impact of Preprocessing for Diagnosis of Diabetes Mellitus Using Artificial Neural Networks , 2010, 2010 Second International Conference on Machine Learning and Computing.

[4]  Marjan Khajehei,et al.  Data Mining and Medical Research Studies , 2010, 2010 Second International Conference on Computational Intelligence, Modelling and Simulation.

[5]  A. Santhakumaran,et al.  A Novel Classification Method for Diagnosis of Diabetes Mellitus Using Artificial Neural Networks , 2010, 2010 International Conference on Data Storage and Data Engineering.

[6]  S J Pöppl,et al.  Predicting Type 2 diabetes using an electronic nose-based artificial neural network analysis. , 2002, Diabetes, nutrition & metabolism.

[7]  Dimitris Kanellopoulos,et al.  Data Preprocessing for Supervised Leaning , 2007 .

[8]  Dorian Pyle,et al.  Data Preparation for Data Mining , 1999 .

[9]  Durga Toshniwal,et al.  Association Rule for Classification of Type-2 Diabetic Patients , 2010, 2010 Second International Conference on Machine Learning and Computing.

[10]  Ian Witten,et al.  Data Mining , 2000 .