Neural Network-Based Coronary Heart Disease Risk Prediction Using Feature Correlation Analysis

Background Of the machine learning techniques used in predicting coronary heart disease (CHD), neural network (NN) is popularly used to improve performance accuracy. Objective Even though NN-based systems provide meaningful results based on clinical experiments, medical experts are not satisfied with their predictive performances because NN is trained in a “black-box” style. Method We sought to devise an NN-based prediction of CHD risk using feature correlation analysis (NN-FCA) using two stages. First, the feature selection stage, which makes features acceding to the importance in predicting CHD risk, is ranked, and second, the feature correlation analysis stage, during which one learns about the existence of correlations between feature relations and the data of each NN predictor output, is determined. Result Of the 4146 individuals in the Korean dataset evaluated, 3031 had low CHD risk and 1115 had CHD high risk. The area under the receiver operating characteristic (ROC) curve of the proposed model (0.749 ± 0.010) was larger than the Framingham risk score (FRS) (0.393 ± 0.010). Conclusions The proposed NN-FCA, which utilizes feature correlation analysis, was found to be better than FRS in terms of CHD risk prediction. Furthermore, the proposed model resulted in a larger ROC curve and more accurate predictions of CHD risk in the Korean population than the FRS.

[1]  Roohallah Alizadehsani,et al.  Computer aided decision making for heart disease detection using hybrid neural network-Genetic algorithm , 2017, Comput. Methods Programs Biomed..

[2]  Andy P. Field,et al.  Discovering Statistics Using Ibm Spss Statistics , 2017 .

[3]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[4]  David Sussillo,et al.  Opening the Black Box: Low-Dimensional Dynamics in High-Dimensional Recurrent Neural Networks , 2013, Neural Computation.

[5]  Tribikram Pradhan,et al.  Application of Data Mining Techniques on Heart Disease Prediction: A Survey , 2016 .

[6]  John A. Swets,et al.  Signal Detection Theory and ROC Analysis in Psychology and Diagnostics: Collected Papers , 1996 .

[7]  Peter Jüni,et al.  Correlation of Intravascular Ultrasound Findings With Histopathological Analysis of Thrombus Aspirates in Patients With Very Late Drug-Eluting Stent Thrombosis , 2009, Circulation.

[8]  Zhidong Zhao,et al.  An Intelligent System for Noninvasive Diagnosis of Coronary Artery Disease with EMD-TEO and BP Neural Network , 2008, 2008 International Workshop on Education Technology and Training & 2008 International Workshop on Geoscience and Remote Sensing.

[9]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[10]  W. Baxt Application of artificial neural networks to clinical medicine , 1995, The Lancet.

[11]  R. Cabrera Sole,et al.  [PP.29.15] OBESITY INCREASES THE CENTRAL SYSTOLIC AND DIASTOLIC BLOOD PRESSURE DESPITE HAVING PROPER TREATMENT IN HYPERTENSIVE PATIENTS , 2016 .

[12]  Francesco Corea,et al.  Introduction to Data , 2017, IBM SPSS Essentials.

[13]  Kazuo Kimura,et al.  Prognostic impact of spontaneous coronary artery dissection in young female patients with acute myocardial infarction: A report from the Angina Pectoris-Myocardial Infarction Multicenter Investigators in Japan. , 2016, International journal of cardiology.

[14]  Qeethara Al-Shayea Artificial Neural Networks in Medical Diagnosis , 2024, International Journal of Research Publication and Reviews.

[15]  Craig E. Kuziemsky,et al.  Identifying high-cost patients using data mining techniques and a small set of non-trivial attributes , 2014, Comput. Biol. Medicine.

[16]  R. Detrano,et al.  International application of a new probability algorithm for the diagnosis of coronary artery disease. , 1989, The American journal of cardiology.

[17]  M. Carroll,et al.  Hypertension, abnormal cholesterol, and high body mass index among non-Hispanic Asian adults: United States, 2011-2012. , 2014, NCHS data brief.

[18]  William Peters,et al.  The next generation of clinical decision support: linking evidence to best practice. , 2002, Journal of healthcare information management : JHIM.

[19]  U. Rajendra Acharya,et al.  Automated identification of normal and diabetes heart rate signals using nonlinear measures , 2013, Comput. Biol. Medicine.

[20]  William H. Barker,et al.  CLINICAL EPIDEMIOLOGY: THE ESSENTIALS. , 1984 .

[21]  Yaowapa Maneerat,et al.  Intersected genes in hyperlipidemia and coronary bypass patients: Feasible biomarkers for coronary heart disease , 2016 .

[22]  José David Martín-Guerrero,et al.  A new machine learning approach for predicting the response to anemia treatment in a large cohort of End Stage Renal Disease patients undergoing dialysis , 2015, Comput. Biol. Medicine.

[23]  Metin Akay,et al.  Noninvasive diagnosis of coronary artery disease using a neural network algorithm , 1993, Biological Cybernetics.

[24]  E. Iso,et al.  Measurement Uncertainty and Probability: Guide to the Expression of Uncertainty in Measurement , 1995 .

[25]  Sangeet Srivastava,et al.  A Hybrid Data Mining Model to Predict Coronary Artery Disease Cases Using Non-Invasive Clinical Data , 2016, Journal of Medical Systems.

[26]  Ian Graham,et al.  Relationships between body mass index, cardiovascular mortality, and risk factors: a report from the SCORE investigators , 2011, European journal of cardiovascular prevention and rehabilitation : official journal of the European Society of Cardiology, Working Groups on Epidemiology & Prevention and Cardiac Rehabilitation and Exercise Physiology.

[27]  Eric J Topol,et al.  Effect of antihypertensive agents on cardiovascular events in patients with coronary disease and normal blood pressure: the CAMELOT study: a randomized controlled trial. , 2004, JAMA.

[28]  M J Legato,et al.  Women's perceptions of their general health, with special reference to their risk of coronary artery disease: results of a national telephone survey. , 1997, Journal of women's health.

[29]  A. von Eckardstein,et al.  High-density lipoprotein cholesterol as a predictor of coronary heart disease risk. The PROCAM experience and pathophysiological implications for reverse cholesterol transport. , 1996, Atherosclerosis.

[30]  E. Barrett-Connor,et al.  Why is diabetes mellitus a stronger risk factor for fatal ischemic heart disease in women than in men? The Rancho Bernardo Study. , 1991, JAMA.

[31]  Stephen V. Stehman,et al.  Selecting and interpreting measures of thematic classification accuracy , 1997 .

[32]  Majid Ezzati,et al.  Estimates of global and regional potential health gains from reducing multiple major risk factors , 2003, The Lancet.

[33]  B. Horne,et al.  Usefulness of high-sensitivity C-reactive protein in predicting long-term risk of death or acute myocardial infarction in patients with unstable or stable angina pectoris or acute myocardial infarction. , 2002, The American journal of cardiology.

[34]  Anna Shcherbina,et al.  Not Just a Black Box: Learning Important Features Through Propagating Activation Differences , 2016, ArXiv.

[35]  W. Willett,et al.  Relative and absolute excess risks of coronary heart disease among women who smoke cigarettes. , 1987, The New England journal of medicine.

[36]  Jae-Kwon Kim,et al.  Adaptive mining prediction model for content recommendation to coronary heart disease patients , 2014, Cluster Computing.

[37]  Arch G Mainous,et al.  A coronary heart disease risk score based on patient-reported information. , 2007, The American journal of cardiology.

[38]  Simon Capewell,et al.  Cardiovascular risk factor trends and potential for reducing coronary heart disease mortality in the United States of America. , 2010, Bulletin of the World Health Organization.

[39]  Barry Robson,et al.  Implementation of a web based universal exchange and inference language for medicine: Sparse data, probabilities and inference in data mining of clinical data repositories , 2015, Comput. Biol. Medicine.

[40]  R. Collins,et al.  Blood pressure, stroke, and coronary heart disease Part 1, prolonged differences in blood pressure: prospective observational studies corrected for the regression dilution bias , 1990, The Lancet.

[41]  Hector M. Romero Ugalde,et al.  Computational cost improvement of neural network models in black box nonlinear system identification , 2015, Neurocomputing.

[42]  Achal Kumar Goyal,et al.  Cardiovascular risk prediction: a comparative study of Framingham and quantum neural network based approach , 2016, Patient preference and adherence.

[43]  W. Kannel Blood pressure as a cardiovascular risk factor: prevention and treatment. , 1996, JAMA.

[44]  Igor Kononenko,et al.  Analysing and improving the diagnosis of ischaemic heart disease with machine learning , 1999, Artif. Intell. Medicine.

[45]  Enrique Sanchez-Delgado,et al.  Lifetime risk of developing coronary heart disease , 1999, The Lancet.

[46]  J. Havel,et al.  Artificial neural networks in medical diagnosis , 2013 .

[47]  W. Kannel,et al.  Electrocardiographic left ventricular hypertrophy and risk of coronary heart disease. The Framingham study. , 1970 .