Decision trees as a simple-to-use and reliable tool to identify individuals with impaired glucose metabolism or type 2 diabetes mellitus.

OBJECTIVE The prevalence of unknown impaired fasting glucose (IFG), impaired glucose tolerance (IGT), or type 2 diabetes mellitus (T2DM) is high. Numerous studies demonstrated that IFG, IGT, or T2DM are associated with increased cardiovascular risk, therefore an improved identification strategy would be desirable. The objective of this study was to create a simple and reliable tool to identify individuals with impaired glucose metabolism (IGM). DESIGN AND METHODS A cohort of 1737 individuals (1055 controls, 682 with previously unknown IGM) was screened by 75 g oral glucose tolerance test (OGTT). Supervised machine learning was used to automatically generate decision trees to identify individuals with IGM. To evaluate the accuracy of identification, a tenfold cross-validation was performed. Resulting trees were subsequently re-evaluated in a second, independent cohort of 1998 individuals (1253 controls, 745 unknown IGM). RESULTS A clinical decision tree included age and systolic blood pressure (sensitivity 89.3%, specificity 37.4%, and positive predictive value (PPV) 48.0%), while a tree based on clinical and laboratory data included fasting glucose and systolic blood pressure (sensitivity 89.7%, specificity 54.6%, and PPV 56.2%). The inclusion of additional parameters did not improve test quality. The external validation approach confirmed the presented decision trees. CONCLUSION We proposed a simple tool to identify individuals with existing IGM. From a practical perspective, fasting blood glucose and blood pressure measurements should be regularly measured in all individuals presenting in outpatient clinics. An OGTT appears to be useful only if the subjects are older than 48 years or show abnormalities in fasting glucose or blood pressure.

[1]  Bendix Carstensen,et al.  A Danish diabetes risk score for targeted screening: the Inter99 study. , 2004, Diabetes care.

[2]  M. Harris,et al.  Early detection of undiagnosed diabetes mellitus: a US perspective , 2000, Diabetes/metabolism research and reviews.

[3]  J. Selbig,et al.  Predicting impaired glucose metabolism in women with polycystic ovary syndrome by decision tree modelling , 2006, Diabetologia.

[4]  Ken Williams,et al.  Identification of individuals with insulin resistance using routine clinical measurements. , 2005, Diabetes.

[5]  T. Valle,et al.  Prevention of type 2 diabetes mellitus by changes in lifestyle among subjects with impaired glucose tolerance. , 2001, The New England journal of medicine.

[6]  M. Fantone,et al.  Report of the Expert Committee on the Diagnosis and Classification of Diabetes Mellitus , 1997, Diabetes Care.

[7]  L. T. Middleton,et al.  Risk prediction of prevalent diabetes in a Swiss population using a weighted genetic score—the CoLaus Study , 2009, Diabetologia.

[8]  J. Lindström,et al.  Sustained reduction in the incidence of type 2 diabetes by lifestyle intervention: follow-up of the Finnish Diabetes Prevention Study , 2006, The Lancet.

[9]  S. Gough,et al.  Validation of an algorithm combining haemoglobin A1c and fasting plasma glucose for diagnosis of diabetes mellitus in UK and Australian populations , 2009, Diabetic medicine : a journal of the British Diabetic Association.

[10]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[11]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[12]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[13]  S. Fowler,et al.  Reduction in the incidence of type 2 diabetes with lifestyle intervention or metformin. , 2002 .

[14]  R. Turner,et al.  Homeostasis model assessment: insulin resistance and β-cell function from fasting plasma glucose and insulin concentrations in man , 1985, Diabetologia.

[15]  Jaakko Tuomilehto,et al.  The diabetes risk score: a practical tool to predict type 2 diabetes risk. , 2003, Diabetes care.

[16]  M. Hanefeld,et al.  The Finnish Diabetes Risk Score is associated with insulin resistance and progression towards type 2 diabetes. , 2009, The Journal of clinical endocrinology and metabolism.

[17]  Nicholas J Wareham,et al.  A simple risk score identifies individuals at high risk of developing Type 2 diabetes: a prospective cohort study. , 2008, Family practice.

[18]  Thomas Lengauer,et al.  Diversity and complexity of HIV-1 drug resistance: A bioinformatics approach to predicting phenotype from genotype , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[19]  K. Chien An Accurate Risk Score Based on Anthropometric, Dietary, and Lifestyle Factors to Predict the Development of Type 2 Diabetes , 2007, Diabetes Care.

[20]  David B Sacks,et al.  A new look at screening and diagnosing diabetes mellitus. , 2008, The Journal of clinical endocrinology and metabolism.

[21]  A. von Eckardstein,et al.  Risk for diabetes mellitus in middle-aged Caucasian male participants of the PROCAM study: implications for the definition of impaired fasting glucose by the American Diabetes Association. Prospective Cardiovascular Münster. , 2000, The Journal of clinical endocrinology and metabolism.

[22]  Stefan R Bornstein,et al.  An Accurate Risk Score Based on Anthropometric, Dietary, and Lifestyle Factors to Predict the Development of Type 2 Diabetes , 2007, Diabetes Care.

[23]  J Tuomilehto,et al.  Diabetes risk score in Oman: a tool to identify prevalent type 2 diabetes among Arabs of the Middle East. , 2007, Diabetes research and clinical practice.