Prediction of 4-year risk for coronary artery calcification using ensemble-based classification

The progression of coronary artery calcification (CAC) has been regarded as an important risk factor of coronary artery disease (CAD), which is the biggest cause of death. Because CAC occurrence increases the risk of CAD by a factor of ten, the one whose coronary artery is calcified should pay more attention to the health management. However, performing the computerized tomography (CT) scan to check if coronary artery is calcified as a regular examination might be inefficient due to its high cost. Therefore, it is required to identify high risk persons who need regular follow-up checks of CAC or low risk ones who can avoid unnecessary CT scans. Due to this reason, we develop a 4-year prediction model for a new occurrence of CAC based on data collected by the regular health examination. We build the prediction model using ensemble-based methods to handle imbalanced dataset. Experimental results show that the developed prediction models provided a reasonable accuracy (AUC 75%), which is about 5% higher than the model built by the other imbalanced classification method.

[1]  Matthew J. Budoff,et al.  Prognostic Value of Coronary Artery Calcification , 2005 .

[2]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[3]  Geoff Holmes,et al.  Benchmarking Attribute Selection Techniques for Discrete Class Data Mining , 2003, IEEE Trans. Knowl. Data Eng..

[4]  Daniel S Berman,et al.  Determinants of coronary calcium conversion among patients with a normal coronary calcium scan: what is the "warranty period" for remaining normal? , 2010, Journal of the American College of Cardiology.

[5]  Ryan Jg Periodic health examination of the elderly. , 1978 .

[6]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[7]  Scott D Flamm,et al.  Role of noninvasive testing in the clinical evaluation of women with suspected coronary artery disease: Consensus statement from the Cardiac Imaging Committee, Council on Clinical Cardiology, and the Cardiovascular Imaging and Intervention Committee, Council on Cardiovascular Radiology and Intervent , 2005, Circulation.

[8]  M. Budoff,et al.  Rates of progression of coronary calcium by electron beam tomography. , 2000, The American journal of cardiology.

[9]  Geoffrey I. Webb,et al.  MultiBoosting: A Technique for Combining Boosting and Wagging , 2000, Machine Learning.

[10]  Jonathan G Goldin,et al.  Calcium begets calcium: progression of coronary artery calcification in asymptomatic subjects. , 2002, Radiology.

[11]  Zhi-Hua Zhou,et al.  Exploratory Undersampling for Class-Imbalance Learning , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[12]  Y. Freund,et al.  Discussion of the Paper \additive Logistic Regression: a Statistical View of Boosting" By , 2000 .

[13]  Hye Jin Kam,et al.  Identifying relatively high-risk group of coronary artery calcification based on progression rate: Statistical and machine learning methods , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.