Automatic Detection of Diabetes Diagnosis using Feature Weighted Support Vector Machines based on Mutual Information and Modified Cuckoo Search

Diabetes is a major health problem in both developing and developed countries and its incidence is rising dramatically. In this study, we investigate a novel automatic approach to diagnose Diabetes disease based on Feature Weighted Support Vector Machines (FW-SVMs) and Modified Cuckoo Search (MCS). The proposed model consists of three stages: Firstly, PCA is applied to select an optimal subset of features out of set of all the features. Secondly, Mutual Information is employed to construct the FWSVM by weighting different features based on their degree of importance. Finally, since parameter selection plays a vital role in classification accuracy of SVMs, MCS is applied to select the best parameter values. The proposed MI-MCS-FWSVM method obtains 93.58% accuracy on UCI dataset. The experimental results demonstrate that our method outperforms the previous methods by not only giving more accurate results but also significantly speeding up the classification procedure.

[1]  Kemal Polat,et al.  A cascade learning system for classification of diabetes disease: Generalized Discriminant Analysis and Least Square Support Vector Machine , 2008, Expert Syst. Appl..

[2]  Bao-Gang Hu,et al.  Linear feature-weighted support vector machine , 2009 .

[3]  Mehmet Fatih Akay,et al.  Support vector machines combined with feature selection for breast cancer diagnosis , 2009, Expert Syst. Appl..

[4]  Xin-She Yang,et al.  Engineering optimisation by cuckoo search , 2010, Int. J. Math. Model. Numer. Optimisation.

[5]  Nurettin Acir,et al.  Automatic classification of auditory brainstem responses using SVM-based feature selection algorithm for threshold detection , 2006, Eng. Appl. Artif. Intell..

[6]  ChoiChong-Ho,et al.  Input Feature Selection by Mutual Information Based on Parzen Window , 2002 .

[7]  M. Pardo,et al.  Classification of electronic nose data with support vector machines , 2005 .

[8]  AkayMehmet Fatih Support vector machines combined with feature selection for breast cancer diagnosis , 2009 .

[9]  Kemal Polat,et al.  An expert system approach based on principal component analysis and adaptive neuro-fuzzy inference system to diagnosis of diabetes disease , 2007, Digit. Signal Process..

[10]  Chong-Ho Choi,et al.  Input Feature Selection by Mutual Information Based on Parzen Window , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Xin-She Yang,et al.  Cuckoo Search via Lévy flights , 2009, 2009 World Congress on Nature & Biologically Inspired Computing (NaBIC).

[12]  Giorgio Valentini,et al.  Cancer recognition with bagged ensembles of support vector machines , 2004, Neurocomputing.

[13]  Kenneth Morgan,et al.  Modified cuckoo search: A new gradient free optimisation algorithm , 2011 .

[14]  Yu Zhang,et al.  Automated defect recognition of C-SAM images in IC packaging using Support Vector Machines , 2005 .

[15]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[16]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[17]  Wei Jin,et al.  Face recognition method based on support vector machine and particle swarm optimization , 2011, Expert Syst. Appl..

[18]  Xin-She Yang,et al.  Engineering optimisation by cuckoo search , 2010 .

[19]  M. Cevdet Ince,et al.  An expert system for detection of breast cancer based on association rules and neural network , 2009, Expert Syst. Appl..

[20]  Esin Dogantekin,et al.  An automatic diabetes diagnosis system based on LDA-Wavelet Support Vector Machine Classifier , 2011, Expert Syst. Appl..

[21]  Elif Derya Übeyli Comparison of different classification algorithms in clinical decision‐making , 2007, Expert Syst. J. Knowl. Eng..