Performance of SVM Classifiers in Predicting Diabetes

This paper investigates the ability of several models of Support Vector Machines (SVMs) with alternate kernel functions to predict the probability of occurrence of Diabetes (DT) in a mixed patient population. To do this a SVM was trained with 13 inputs (symptoms) from the medical dataset obtained from a university hospital. Different kernel functions, such as Linear, Quadratic, Polyorder (order three), Multi Layer Perceptron (MLP) and Radial Basis Function kernel (RBF) were coded and tested to build the medical diagnosis system (MDS). A detailed database, comprising of healthy and diabetic patients from a university hospital was used for training the SVM for prediction of diabetes. All kernel functions for SVM models showed reasonably good accuracy in prediction of disease (s), with linear kernel structure showing best prediction in 3 out of 4 datasets and Polyorder in one database. Thus the best choice appears to be situation specific.

[1]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[2]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[3]  Rahul Samant,et al.  A study on Feature Selection Methods in Medical Decision Support Systems , 2013 .

[4]  Guotai Jiang,et al.  The research on evaluation of diabetes metabolic function based on Support Vector Machine , 2010, 2010 3rd International Conference on Biomedical Engineering and Informatics.

[5]  Rahul Samant,et al.  Effects of Missing Data Imputation on Classifier Accuracy , 2013 .

[6]  Issam El Naqa,et al.  Machine learning methods for predicting tumor response in lung cancer , 2012 .

[7]  Dimitris Vassis,et al.  e-Doctor: A Web based Support Vector Machine for Automatic Medical Diagnosis , 2013 .

[8]  Lei Tang,et al.  Support vector machine model for diagnosis of lymph node metastasis in gastric cancer with multidetector computed tomography: a preliminary study , 2011, BMC Cancer.

[9]  Yongsheng Ding,et al.  SVM and statistical technique method applying in Primary Open Angle Glaucoma diagnosis , 2010, 2010 8th World Congress on Intelligent Control and Automation.

[10]  M. Uusitupa,et al.  Lifestyles matter in the prevention of type 2 diabetes. , 2002, Diabetes care.

[11]  Park,et al.  Open Access Research Article Identification of Type 2 Diabetes-associated Combination of Snps Using Support Vector Machine , 2022 .

[12]  Rüdiger W. Brause,et al.  Medical Analysis and Diagnosis by Neural Networks , 2001, ISMDA.