Diagnosis of diabetes mellitus using artificial neural network and classification and regression tree optimized with genetic algorithm

Diabetes mellitus is one of the most important public health problems affecting millions of people worldwide. An early and accurate diagnosis of diabetes mellitus has critical importance for the medical treatments of patients. In this study, first, artificial neural network (ANN) and classification and regression tree (CART)‐based approaches are proposed for the diagnosis of diabetes. Hybrid ANN‐GA and CART‐GA approaches are then developed using a genetic algorithm (GA) to improve the classification accuracy of these approaches. Finally, the performances of the developed approaches are evaluated with a Pima Indian diabetes data set. Experimental results show that the developed hybrid CART‐GA approach outperforms the ANN, CART, and ANN‐GA approaches in terms of classification accuracy, and this approach provides an efficient methodology for diagnosis of diabetes mellitus.

[1]  Der-Chiang Li,et al.  A fuzzy-based data transformation for feature extraction to increase classification performance with small medical data sets , 2011, Artif. Intell. Medicine.

[2]  Novruz Allahverdi,et al.  Design of a hybrid system for the diabetes and heart diseases , 2008, Expert Syst. Appl..

[3]  M. Carnethon,et al.  Serum insulin, obesity, and the incidence of type 2 diabetes in black and white adults: the atherosclerosis risk in communities study: 1987-1998. , 2002, Diabetes care.

[4]  Dayou Liu,et al.  Evolving support vector machines using fruit fly optimization for medical data classification , 2016, Knowl. Based Syst..

[5]  Jian Zhang,et al.  Efficient Volume Exploration Using the Gaussian Mixture Model , 2011, IEEE Transactions on Visualization and Computer Graphics.

[6]  Chee Peng Lim,et al.  A hybrid intelligent system for medical data classification , 2014, Expert Syst. Appl..

[7]  Piotr Duda,et al.  The CART decision tree for mining data streams , 2014, Inf. Sci..

[8]  Elif Derya Übeyli Comparison of different classification algorithms in clinical decision‐making , 2007, Expert Syst. J. Knowl. Eng..

[9]  T. Warren Liao,et al.  Medical data mining by fuzzy modeling with selected features , 2008, Artif. Intell. Medicine.

[10]  Damodar Reddy Edla,et al.  RST-BatMiner: A fuzzy rule miner integrating rough set feature selection and Bat optimization for detection of diabetes disease , 2017, Appl. Soft Comput..

[11]  John H. Lilly,et al.  Evolutionary design of a fuzzy classifier from data , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[12]  K. G. Srinivasa,et al.  A self-adaptive migration model genetic algorithm for data mining applications , 2007, Inf. Sci..

[13]  Gustavo E. A. P. A. Batista,et al.  A study of the behavior of several methods for balancing machine learning training data , 2004, SKDD.

[14]  Pasi Luukka,et al.  Similarity classifier with generalized mean applied to medical data , 2006, Comput. Biol. Medicine.

[15]  M.M.B.R. Vellasco,et al.  Inverted hierarchical neuro-fuzzy BSP system: a novel neuro-fuzzy model for pattern classification and rule extraction in databases , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[16]  Ravinder Agarwal,et al.  Machine learning techniques for medical diagnosis of diabetes using iris images , 2018, Comput. Methods Programs Biomed..

[17]  M. Ringnér,et al.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks , 2001, Nature Medicine.

[18]  Gail A. Carpenter,et al.  ARTMAP-IC and medical diagnosis: Instance counting and inconsistent cases , 1998, Neural Networks.

[19]  Pasi Luukka,et al.  Feature selection using fuzzy entropy measures with similarity classifier , 2011, Expert Syst. Appl..

[20]  Kemal Polat,et al.  A cascade learning system for classification of diabetes disease: Generalized Discriminant Analysis and Least Square Support Vector Machine , 2008, Expert Syst. Appl..

[21]  Aboul Ella Hassanien,et al.  Hybrid rough-bijective soft set classification system , 2016, Neural Computing and Applications.

[22]  F.H.F. Leung,et al.  Tuning of the structure and parameters of neural network using an improved genetic algorithm , 2001, IECON'01. 27th Annual Conference of the IEEE Industrial Electronics Society (Cat. No.37243).

[23]  M. Remzi Önder,et al.  Diyabet ve Hipertansiyon , 2000 .

[24]  P. Zimmet,et al.  Physical activity and prevalence of diabetes in Melanesian and Indian men in Fiji , 1984, Diabetologia.

[25]  Robert J. Cox,et al.  A Method for Optimal Division of Data Sets for Use in Neural Networks , 2005, KES.

[26]  Nor Ashidi Mat Isa,et al.  Clustered-Hybrid Multilayer Perceptron network for pattern recognition application , 2011, Appl. Soft Comput..