Diabetes Diagnostic Prediction Using Vector Support Machines

Abstract The most important factors for the diagnosis of diabetes mellitus (DM) are age, body mass index (BMI) and blood glucose concentration. Diagnosis of DM by a doctor is complicated, because several factors are involved in the disease, and the diagnosis is subject to human error. A blood test does not provide enough information to make a correct diagnosis of the disease. A vector support machine (SVM) was implemented to predict the diagnosis of DM based on the factors mentioned in patients. The classes of the output variable are three: without diabetes, with a predisposition to diabetes and with diabetes. An SVM was obtained with an accuracy of 99.2% with Colombian patients and an accuracy of 65.6% with a data set of patients of a different ethnic background.

[1]  Fevzullah Temurtas,et al.  A comparative study on diabetes disease diagnosis using neural networks , 2009, Expert Syst. Appl..

[2]  Hongbo Zhao,et al.  Probabilistic back analysis based on Bayesian and multi-output support vector machine for a high cut rock slope , 2016 .

[3]  D. Bates,et al.  Fitting Linear Mixed-Effects Models Using lme4 , 2014, 1406.5823.

[4]  Amelec Viloria,et al.  Factor Analysis of the Psychosocial Risk Assessment Instrument , 2018, DMBD.

[5]  T. Santhanam,et al.  Application of K-Means and Genetic Algorithms for Dimension Reduction by Integrating SVM for Diabetes Diagnosis , 2015 .

[6]  Mariane Krause,et al.  Disentangling the change–alliance relationship: Observational assessment of the therapeutic alliance during change and stuck episodes , 2017, Psychotherapy research : journal of the Society for Psychotherapy Research.

[7]  Pierre Baldi,et al.  Mathematical Correction for Fingerprint Similarity Measures to Improve Chemical Retrieval , 2007, J. Chem. Inf. Model..

[8]  Shankaracharya,et al.  Computational intelligence in early diabetes diagnosis: a review. , 2010, The review of diabetic studies : RDS.

[9]  Peter Willett,et al.  Comparison of chemical similarity measures using different numbers of query structures , 2013, J. Inf. Sci..

[10]  Ya Zhang,et al.  A machine learning-based framework to identify type 2 diabetes through electronic health records , 2017, Int. J. Medical Informatics.

[11]  Amelec Viloria,et al.  Determination of Dimensionality of the Psychosocial Risk Assessment of Internal, Individual, Double Presence and External Factors in Work Environments , 2018, DMBD.

[12]  Amelec Viloria,et al.  Methodology for the Design of a Student Pattern Recognition Tool to Facilitate the Teaching - Learning Process Through Knowledge Data Discovery (Big Data) , 2018, DMBD.

[13]  K. Teerapabolarn,et al.  for the Hypergeometric Distribution , 2014 .

[14]  Allam Appa Rao,et al.  A computational intelligence approach for a better diagnosis of diabetic patients , 2014, Comput. Electr. Eng..

[15]  Esin Dogantekin,et al.  An automatic diabetes diagnosis system based on LDA-Wavelet Support Vector Machine Classifier , 2011, Expert Syst. Appl..