Screening for pre-diabetes using support vector machine model

The global prevalence of diabetes is rapidly increasing. Studies support screening and interventions for pre-diabetes, which results in serious complications and diabetes. This study aimed at developing an intelligence-based screening model for pre-diabetes that could assist with decreasing the prevalence of diabetes through early identification and subsequent interventions. Data from the Korean National Health and Nutrition Examination Survey (KNHANES) were used, excluding subjects with diabetes. The KNHANES 2010 data (n = 4,685) were used for training and internal validation, while data from KNHANES 2011 (n = 4,566) were used for external validation. We developed a model to screen for pre-diabetes using support vector machine (SVM), and performed a systematic evaluation of the SVM model using internal and external validation. We compared the performance of the SVM model with that of a screening score model based on logistic regression analysis for pre-diabetes that had been developed previously. Backward elimination logistic regression resulted in associations between pre-diabetes and age, sex, waist circumference, body mass index, alcohol intake, family history of diabetes, and hypertension. The areas under the curves (AUCs) for the SVM model in the internal and external datasets were 0.761 and 0.731, respectively, while the AUCs for the screening score model were 0.734 and 0.712, respectively. The SVM model developed in this study performed better than the screening score model that had been developed previously and may be more effective for pre-diabetes screening.

[1]  K. Matthews,et al.  Improving the performance of physiologic hot flash measures with support vector machines. , 2009, Psychophysiology.

[2]  S. Yusuf,et al.  The relationship between glucose and incident cardiovascular events. A metaregression analysis of published data from 20 studies of 95,783 individuals followed for 12.4 years. , 1999, Diabetes care.

[3]  C. Herder,et al.  The Association of Genetic Markers for Type 2 Diabetes with Prediabetic Status - Cross-Sectional Data of a Diabetes Prevention Trial , 2013, PloS one.

[4]  Jaakko Tuomilehto,et al.  The diabetes risk score: a practical tool to predict type 2 diabetes risk. , 2003, Diabetes care.

[5]  J. Shaw,et al.  IDF diabetes atlas: global estimates of the prevalence of diabetes for 2011 and 2030. , 2011, Diabetes research and clinical practice.

[6]  Kyung-Ah Kim,et al.  Mortality prediction of rats in acute hemorrhagic shock using machine learning techniques , 2013, Medical & Biological Engineering & Computing.

[7]  The Diabetes Prevention Program (DPP): description of lifestyle intervention. , 2002, Diabetes care.

[8]  J. Shaw,et al.  Global estimates of the prevalence of diabetes for 2010 and 2030. , 2010, Diabetes research and clinical practice.

[9]  Jongoh Kim,et al.  Prevalence and Management of Diabetes in Korean Adults , 2009, Diabetes Care.

[10]  K. Jung-Choi,et al.  The impact of governmental antismoking policy on socioeconomic disparities in cigarette smoking in South Korea. , 2009, Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco.

[11]  Charles X. Ling,et al.  Using AUC and accuracy in evaluating learning algorithms , 2005, IEEE Transactions on Knowledge and Data Engineering.

[12]  B. Reiser,et al.  Estimation of the Youden Index and its Associated Cutoff Point , 2005, Biometrical journal. Biometrische Zeitschrift.

[13]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[14]  D. Schillinger,et al.  Racial/ethnic variation in prevalence estimates for United States prediabetes under alternative 2010 American Diabetes Association criteria: 1988-2008. , 2012, Ethnicity & disease.

[15]  Wolfgang Rathmann,et al.  Prediabetes: a high-risk state for diabetes development , 2012, The Lancet.

[16]  G. Rossi,et al.  Diagnosis and Classification of Diabetes Mellitus The information that follows is based largely on the reports of the Expert Committee on the Diagnosis and Classification of Diabetes (Diabetes Care 20:1183–1197, 1997, and Diabetes Care 26:3160–3167, 2003). , 2008, Diabetes Care.

[17]  H. Bang,et al.  A Simple Screening Score for Diabetes for the Korean Population , 2012, Diabetes Care.

[18]  J. Choi,et al.  Osteoporosis Risk Prediction for Bone Mineral Density Assessment of Postmenopausal Women Using Machine Learning , 2013, Yonsei medical journal.