hs-CRP is strongly associated with coronary heart disease (CHD): A data mining approach using decision tree algorithm

BACKGROUND AND AIMS Coronary heart disease (CHD) is an important public health problem globally. Algorithms incorporating the assessment of clinical biomarkers together with several established traditional risk factors can help clinicians to predict CHD and support clinical decision making with respect to interventions. Decision tree (DT) is a data mining model for extracting hidden knowledge from large databases. We aimed to establish a predictive model for coronary heart disease using a decision tree algorithm. METHODS Here we used a dataset of 2346 individuals including 1159 healthy participants and 1187 participant who had undergone coronary angiography (405 participants with negative angiography and 782 participants with positive angiography). We entered 10 variables of a total 12 variables into the DT algorithm (including age, sex, FBG, TG, hs-CRP, TC, HDL, LDL, SBP and DBP). RESULTS Our model could identify the associated risk factors of CHD with sensitivity, specificity, accuracy of 96%, 87%, 94% and respectively. Serum hs-CRP levels was at top of the tree in our model, following by FBG, gender and age. CONCLUSION Our model appears to be an accurate, specific and sensitive model for identifying the presence of CHD, but will require validation in prospective studies.

[1]  Maryam Tayefi,et al.  Serum high-sensitivity C-reactive protein as a biomarker in patients with metabolic syndrome: evidence-based study with 7284 subjects , 2016, European Journal of Clinical Nutrition.

[2]  Theo Stijnen,et al.  Genetic Variation, C-Reactive Protein Levels, and Incidence of Diabetes , 2007, Diabetes.

[3]  P. Malani Harrison’s Principles of Internal Medicine , 2012 .

[4]  Michael J. A. Berry,et al.  Data mining techniques - for marketing, sales, and customer support , 1997, Wiley computer publishing.

[5]  Anne Tybjærg-Hansen,et al.  Genetically Elevated C-Reactive Protein and Ischemic Vascular Disease , 2009 .

[6]  Dimitrios I. Fotiadis,et al.  Automated Diagnosis of Coronary Artery Disease Based on Data Mining and Fuzzy Modeling , 2008, IEEE Transactions on Information Technology in Biomedicine.

[7]  J. Kastelein,et al.  Controversies in cardiovascular medicine C-reactive protein is a mediator of cardiovascular disease , 2010 .

[8]  Habibollah Esmaily,et al.  Cytokine and growth factor profiling in patients with the metabolic syndrome , 2015, British Journal of Nutrition.

[9]  Constantinos S. Pattichis,et al.  Assessment of the Risk Factors of Coronary Heart Events Based on Data Mining With Decision Trees , 2010, IEEE Transactions on Information Technology in Biomedicine.

[10]  Mevlut Ture,et al.  Comparing performances of logistic regression, classification and regression tree, and neural networks for predicting coronary artery disease , 2008, Expert Syst. Appl..

[11]  Soni Jyoti,et al.  Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction , 2011 .

[12]  Mahmoud Ebrahimi,et al.  C-reactive protein associated with coronary artery disease in Iranian patients with angiographically defined coronary artery disease. , 2007, Clinical laboratory.

[13]  Wolfgang Koenig,et al.  High-sensitivity C-reactive protein and atherosclerotic disease: from improved risk prediction to risk-guided therapy. , 2013, International journal of cardiology.

[14]  Yi-Ping Phoebe Chen,et al.  Association rule mining to detect factors which contribute to heart disease in males and females , 2013, Expert Syst. Appl..

[15]  C Guijarro,et al.  High-sensitivity C-reactive protein: potential adjunct for global risk assessment in the primary prevention of cardiovascular disease. , 2001, Circulation.

[16]  Sulabha S. Apte,et al.  Improved Study of Heart Disease Prediction System using Data Mining Classification Techniques , 2012 .

[17]  Douglas L. Mann,et al.  Braunwald’s Heart Disease: A Textbook of Cardiovascular Medicine. 8th edition , 2018 .

[18]  Tze-Yun Leong,et al.  Predicting Coronary Artery Disease with Medical Profile and Gene Polymorphisms Data , 2007, MedInfo.

[19]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[20]  J. Manson,et al.  C-reactive protein, interleukin 6, and risk of developing type 2 diabetes mellitus. , 2001, JAMA.

[21]  Jamal Shahrabi,et al.  Applying decision tree for identification of a low risk population for type 2 diabetes. Tehran Lipid and Glucose Study. , 2014, Diabetes research and clinical practice.

[22]  Maryam Tayefi,et al.  The application of a decision tree to establish the parameters associated with hypertension , 2017, Comput. Methods Programs Biomed..

[23]  Imran Kurt,et al.  Analysis of intervariable relationships between major risk factors in the development of coronary artery disease: a classification tree approach. , 2007, Anadolu kardiyoloji dergisi : AKD = the Anatolian journal of cardiology.

[24]  Maryam Negahbani,et al.  Coronary Artery Disease Diagnosis Using Supervised Fuzzy C-Means with Differential Search Algorithm-based Generalized Minkowski Metrics , 2015 .

[25]  Zahi A Fayad,et al.  2010 ACCF/AHA guideline for assessment of cardiovascular risk in asymptomatic adults: a report of the American College of Cardiology Foundation/American Heart Association Task Force on Practice Guidelines. , 2010, Journal of the American College of Cardiology.

[26]  G. Moneta,et al.  Rosuvastatin to Prevent Vascular Events in Men and Women with Elevated C-Reactive Protein , 2009 .

[27]  Paul M Ridker,et al.  Inflammation in atherosclerosis: from pathophysiology to practice. , 2009, Journal of the American College of Cardiology.

[28]  Douglas P. Zipes,et al.  Braunwald's Heart Disease: A Textbook of Cardiovascular Medicine, 2-Volume Set, 10th Edition , 2011 .

[29]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[30]  Jafar Habibi,et al.  A data mining approach for diagnosis of coronary artery disease , 2013, Comput. Methods Programs Biomed..

[31]  Paul Schoenhagen,et al.  Statin therapy, LDL cholesterol, C-reactive protein, and coronary artery disease. , 2005, The New England journal of medicine.