Identifying relatively high-risk group of coronary artery calcification based on progression rate: Statistical and machine learning methods

Coronary artery calcification (CAC) score is an important predictor of coronary artery disease (CAD), which is the primary cause of death in advanced countries. Early prediction of high-risk of CAC based on progression rate enables people to prevent CAD from developing into severe symptoms and diseases. In this study, we developed various classifiers to identify patients in high risk of CAC using statistical and machine learning methods, and compared them with performance accuracy. For statistical approaches, linear regression based classifier and logistic regression model were developed. For machine learning approaches, we suggested three kinds of ensemble-based classifiers (best, top-k, and voting method) to deal with imbalanced distribution of our data set. Ensemble voting method outperformed all other methods including regression methods as AUC was 0.781.

[1]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[2]  Y. Freund,et al.  Discussion of the Paper \additive Logistic Regression: a Statistical View of Boosting" By , 2000 .

[3]  Michael Motro,et al.  Accelerated coronary artery calcification in mildly reduced renal function of high-risk hypertensives: a 3-year prospective observation , 2003, Journal of hypertension.

[4]  Robert Detrano,et al.  Relation of coronary calcium progression and control of lipids according to National Cholesterol Education Program guidelines. , 2004, The American journal of cardiology.

[5]  Ian H. Witten,et al.  WEKA - Experiences with a Java Open-Source Project , 2010, J. Mach. Learn. Res..

[6]  Helmut Schulte,et al.  Prediction of risk of coronary events in middle-aged men in the Prospective Cardiovascular Münster Study (PROCAM) using neural networks. , 2002, International journal of epidemiology.

[7]  Kevin B. Korb,et al.  Epidemiological data mining of cardiovascular Bayesian networks , 2006 .

[8]  P. Greenland,et al.  Coronary artery calcium score and risk classification for coronary heart disease prediction. , 2010, JAMA.

[9]  Iftikhar J. Kullo,et al.  Progression of Subclinical Coronary Atherosclerosis: Does Obesity Make a Difference? , 2005, Circulation.

[10]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[11]  R. Detrano,et al.  Coronary artery calcium score combined with Framingham score for risk prediction in asymptomatic individuals. , 2004, JAMA.

[12]  Steven Shea,et al.  Risk Factors for the Progression of Coronary Artery Calcification in Asymptomatic Subjects: Results From the Multi-Ethnic Study of Atherosclerosis (MESA) , 2007, Circulation.

[13]  Matthew J Budoff,et al.  Coronary calcium progression rates with a zero initial score by electron beam tomography. , 2007, International journal of cardiology.

[14]  Daniel S Berman,et al.  Determinants of coronary calcium conversion among patients with a normal coronary calcium scan: what is the "warranty period" for remaining normal? , 2010, Journal of the American College of Cardiology.

[15]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[16]  Daniel S Berman,et al.  Long-term prognosis associated with coronary calcification: observations from a registry of 25,253 patients. , 2007, Journal of the American College of Cardiology.

[17]  Irwin Feuerstein,et al.  Coronary calcium independently predicts incident premature coronary heart disease over measured cardiovascular risk factors: mean three-year outcomes in the Prospective Army Coronary Calcium (PACC) project. , 2005, Journal of the American College of Cardiology.

[18]  M. Budoff,et al.  Rates of progression of coronary calcium by electron beam tomography. , 2000, The American journal of cardiology.

[19]  usso,et al.  EFFECT OF HMG-CoA REDUCTASE INHIBITORS ON CORONARY ARTERY DISEASE AS ASSESSED BY ELECTRON-BEAM COMPUTED TOMOGRAPHY , 2000 .

[20]  Richard A. Kronmal,et al.  Distribution of Coronary Artery Calcium by Race, Gender, and Age: Results from the Multi-Ethnic Study of Atherosclerosis (MESA) , 2005, Circulation.

[21]  Geoffrey I. Webb,et al.  MultiBoosting: A Technique for Combining Boosting and Wagging , 2000, Machine Learning.

[22]  Jonathan G Goldin,et al.  Calcium begets calcium: progression of coronary artery calcification in asymptomatic subjects. , 2002, Radiology.