Machine learning for the prediction of atherosclerotic cardiovascular disease during 3‐year follow up in Chinese type 2 diabetes mellitus patients

AIMS/INTRODUCTION Clinical guidelines for the management of individuals with type 2 diabetes mellitus endorse the systematic assessment of atherosclerotic cardiovascular disease risk for early interventions. In this study, we aimed to develop machine learning models to predict 3-year atherosclerotic cardiovascular disease risk in Chinese type 2 diabetes mellitus patients. MATERIALS AND METHODS Clinical records of 4,722 individuals with type 2 diabetes mellitus admitted to 94 hospitals were used. The features included demographic information, disease histories, laboratory tests and physical examinations. Logistic regression, support vector machine, gradient boosting decision tree, random forest and adaptive boosting were applied for model construction. The performance of these models was evaluated using the area under the receiver operating characteristic curve. Additionally, we applied SHapley Additive exPlanation values to explain the prediction model. RESULTS All five models achieved good performance in both internal and external test sets (area under the receiver operating characteristic curve >0.8). Random forest showed the highest discrimination ability, with sensitivity and specificity being 0.838 and 0.814, respectively. The SHapley Additive exPlanation analyses showed that previous history of diabetic peripheral vascular disease, older populations and longer diabetes duration were the three most influential predictors. CONCLUSIONS The prediction models offer opportunities to personalize treatment and maximize the benefits of these medical interventions.

[1]  Agata M. Wijata,et al.  Machine learning predicts cardiovascular events in patients with diabetes: The Silesia Diabetes-Heart Project. , 2023, Current problems in cardiology.

[2]  Liangxiao Jiang,et al.  Rigorous non-disjoint discretization for naive Bayes , 2023, Pattern Recognit..

[3]  J. Chan,et al.  Pathophysiology, phenotypes and management of type 2 diabetes mellitus in Indian and Chinese populations , 2022, Nature Reviews Endocrinology.

[4]  Qi Li,et al.  Automating and improving cardiovascular disease prediction using Machine learning and EMR data features from a regional healthcare system , 2022, Int. J. Medical Informatics.

[5]  E. Huang,et al.  10. Cardiovascular Disease and Risk Management: Standards of Medical Care in Diabetes-2022. , 2021, Diabetes care.

[6]  V. Aboyans,et al.  Lower Extremity Peripheral Artery Disease: Contemporary Epidemiology, Management Gaps, and Future Directions: A Scientific Statement From the American Heart Association. , 2021, Circulation.

[7]  Puspanjali Mohapatra,et al.  An analytical study of modified multi-objective Harris Hawk Optimizer towards medical data feature selection , 2021, Comput. Biol. Medicine.

[8]  E. Ross,et al.  Leveraging Machine Learning and Artificial Intelligence to Improve Peripheral Artery Disease Detection, Treatment, and Outcomes. , 2021, Circulation research.

[9]  D. Telesca,et al.  Low HbA1c levels and all-cause or cardiovascular mortality among people without diabetes: the US National Health and Nutrition Examination Survey 1999-2015. , 2020, International journal of epidemiology.

[10]  Shengshou Hu,et al.  Cardiovascular risk factors in China: a nationwide population-based cohort study. , 2020, The Lancet. Public health.

[11]  Long Jiang Zhang,et al.  A clinically applicable deep-learning model for detecting intracranial aneurysm in computed tomography angiography images , 2020, Nature Communications.

[12]  C. Russo,et al.  The Framingham cardiovascular risk score and 5‐year progression of multiple sclerosis , 2020, European journal of neurology.

[13]  Li Yang,et al.  Study of cardiovascular disease prediction model based on random forest in eastern China , 2020, Scientific Reports.

[14]  S. Goldhaber,et al.  Peripheral Artery Disease: Past, Present, and Future. , 2019, The American journal of medicine.

[15]  Qifu Li,et al.  Standards of medical care for type 2 diabetes in China 2019 , 2019, Diabetes/metabolism research and reviews.

[16]  K. Ngiam,et al.  Big data and machine learning algorithms for health-care delivery. , 2019, The Lancet. Oncology.

[17]  A. Khera,et al.  2019 ACC/AHA Guideline on the Primary Prevention of Cardiovascular Disease: Executive Summary: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. , 2019, Journal of the American College of Cardiology.

[18]  B. Davis,et al.  Prediction of individual life-years gained without cardiovascular events from lipid, blood pressure, glucose, and aspirin treatment based on data of more than 500 000 patients with Type 2 diabetes mellitus. , 2019, European heart journal.

[19]  Jim Warren,et al.  Cardiovascular disease risk prediction equations in 400 000 primary care patients in New Zealand: a derivation and validation study , 2018, The Lancet.

[20]  Max A. Viergever,et al.  A Recurrent CNN for Automatic Detection and Classification of Coronary Artery Plaque and Stenosis in Coronary CT Angiography , 2018, IEEE Transactions on Medical Imaging.

[21]  David Zidar,et al.  Accuracy of Cardiovascular Risk Prediction Varies by Neighborhood Socioeconomic Position , 2017, Annals of Internal Medicine.

[22]  Anna C. Porter,et al.  Coronary Artery Calcification and Risk of Cardiovascular Disease and Death Among Patients With Chronic Kidney Disease , 2017, JAMA cardiology.

[23]  Xueli Yang,et al.  Predicting the 10-Year Risks of Atherosclerotic Cardiovascular Disease in Chinese Population: The China-PAR Project (Prediction for ASCVD Risk in China) , 2016, Circulation.

[24]  R. Kronmal,et al.  10-Year Coronary Heart Disease Risk Prediction Using Coronary Artery Calcium and Traditional Risk Factors: Derivation in the MESA (Multi-Ethnic Study of Atherosclerosis) With Validation in the HNR (Heinz Nixdorf Recall) Study and the DHS (Dallas Heart Study). , 2015, Journal of the American College of Cardiology.

[25]  Sung Hee Choi,et al.  The ACC/AHA 2013 pooled cohort equations compared to a Korean Risk Prediction Model for atherosclerotic cardiovascular disease. , 2015, Atherosclerosis.

[26]  H. Nathoe,et al.  The Relation Between HbA1c and Cardiovascular Events in Patients With Type 2 Diabetes With and Without Vascular Disease , 2015, Diabetes Care.

[27]  Marja-Riitta Taskinen,et al.  New insights into the pathophysiology of dyslipidemia in type 2 diabetes. , 2015, Atherosclerosis.

[28]  John W McEvoy,et al.  An analysis of calibration and discrimination among multiple cardiovascular risk scores in a modern multiethnic cohort. , 2015, Annals of internal medicine.

[29]  Anette Varbo,et al.  Triglycerides and cardiovascular disease , 2014, The Lancet.

[30]  K. Fox,et al.  Primacy of the 3B approach to control risk factors for cardiovascular disease in type 2 diabetes patients. , 2013, The American journal of medicine.

[31]  R J Hinchliffe,et al.  Diagnosis and management of peripheral arterial disease , 2012, BMJ : British Medical Journal.

[32]  Rodney X. Sturdivant,et al.  Applied Logistic Regression: Hosmer/Applied Logistic Regression , 2005 .

[33]  Dong Zhao,et al.  Predictive value for the Chinese population of the Framingham CHD risk assessment tool compared with the Chinese Multi-Provincial Cohort Study. , 2004, JAMA.

[34]  L. Breiman Random Forests , 2001, Encyclopedia of Machine Learning and Data Mining.

[35]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[36]  R. D'Agostino,et al.  Validation of the Framingham coronary heart disease prediction scores: results of a multiple ethnic groups investigation. , 2001, JAMA.

[37]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[38]  J. Mckenney,et al.  Executive Summary of The Third Report of The National Cholesterol Education Program (NCEP) Expert Panel on Detection, Evaluation, And Treatment of High Blood Cholesterol In Adults (Adult Treatment Panel III). , 2001, JAMA.