Deep learning–based prediction model of occurrences of major adverse cardiac events during 1-year follow-up after hospital discharge in patients with AMI using knowledge mining

Traditional regression-based approaches do not provide good results in diagnosis and prediction of occurrences of cardiovascular diseases (CVD). Therefore, the goal of this paper is to propose a deep learning–based prediction model of occurrence of major adverse cardiac events (MACE) during the 1, 6, 12 month follow-up after hospital admission in acute myocardial infarction (AMI) patients using knowledge mining. We used the Korea Acute Myocardial Infarction Registry (KAMIR) dataset, a cardiovascular disease database registered in 52 hospitals in Korea between 1 January, 2005, and 31 December, 2008. Among 14,885 AMI patients, 10,813 subjects in age from 20 to 100 years with the 1-year follow-up traceability without coding errors were finally selected. For our experiment, the training/validation/test dataset split is 60/20/20 by random sampling without replacement. The preliminary deep learning model was first built by applying training and validation datasets and then a new preliminary deep learning model was generated using the best hyperparameters obtained from random hyperparameter grid search. Lastly, the preliminary prediction model of MACE occurrences in AMI patients is evaluated by test dataset. Compared with conventional regression-based models, the performances of machine/deep learning–based prediction models of the MACE occurrence in patients with AMI, including deep neural network (DNN), gradient boosting machine (GBM), and generalized linear model (GLM), are also evaluated through a matrix with sensitivity, specificity, overall accuracy, and the area under the ROC curve (AUC). The prediction results of the MACE occurrence during the 1, 6, and 12-month follow-up in AMI patients were the AUC of DNN (1 M 0.97, 6 M 0.94, 12 M 0.96), GBM (0.96, 0.95, 0.96), and GLM (0.76, 0.67, 0.72) in machine learning–based models as well as GRACE (0.75, 0.72, 0.76) in regression model. Compared with previous models, our deep learning–based prediction models significantly had the accuracy of 95% or higher and outperformed all machine learning and regression-based prediction models. This paper was the first trial of deep learning–based prediction model of the MACE occurrence in AMI clinical data. We found that the proposed prediction model applied different risk factors except the attribute “age” by using knowledge mining and directly used the raw data as input.

[1]  A. Sheikh,et al.  Predicting cardiovascular risk in England and Wales: prospective derivation and validation of QRISK2 , 2008, BMJ : British Medical Journal.

[2]  G. Moneta,et al.  Rosuvastatin to Prevent Vascular Events in Men and Women with Elevated C-Reactive Protein , 2009 .

[3]  Muhammad Saqlain,et al.  A Voting Ensemble Classifier for Wafer Map Defect Patterns Identification in Semiconductor Manufacturing , 2019, IEEE Transactions on Semiconductor Manufacturing.

[4]  E. Antman,et al.  The TIMI risk score for unstable angina/non-ST elevation MI: A method for prognostication and therapeutic decision making. , 2000, JAMA.

[5]  Bart De Moor,et al.  Hyperparameter Search in Machine Learning , 2015, ArXiv.

[6]  P. Lytsy,et al.  Adherence to and beliefs in lipid-lowering medical treatments: a structural equation modeling approach including the necessity-concern framework. , 2013, Patient education and counseling.

[7]  Tim Leiner,et al.  Dilated Convolutional Neural Networks for Cardiovascular MR Segmentation in Congenital Heart Disease , 2016, RAMBO+HVSMR@MICCAI.

[8]  Tom Fahey,et al.  Predictive accuracy of the Framingham coronary risk score in British men:prospective cohort study , 2003, BMJ : British Medical Journal.

[9]  Ashwini Shetty,et al.  Different Data Mining Approaches for Predicting Heart Disease , 2016 .

[10]  Gerhard Weiss,et al.  Machine learning techniques in eating behavior e-coaching , 2017, Personal and Ubiquitous Computing.

[11]  N J Wald,et al.  Use of blood pressure lowering drugs in the prevention of cardiovascular disease: meta-analysis of 147 randomised trials in the context of expectations from prospective epidemiological studies , 2009, BMJ : British Medical Journal.

[12]  K. Eagle,et al.  Prediction of risk of death and myocardial infarction in the six months after presentation with acute coronary syndrome: prospective multinational observational study (GRACE) , 2006, BMJ : British Medical Journal.

[13]  Henrik Madsen,et al.  Introduction to General and Generalized Linear Models , 2010 .

[14]  A. Coulter Can patients assess the quality of health care? , 2006, BMJ : British Medical Journal.

[15]  Daniel Levy,et al.  The Framingham Heart Study and the epidemiology of cardiovascular disease: a historical perspective , 2014, The Lancet.

[16]  J. Hippisley-Cox,et al.  Development and validation of QRISK3 risk prediction algorithms to estimate future risk of cardiovascular disease: prospective cohort study , 2017, British Medical Journal.

[17]  Z. Obermeyer,et al.  Predicting the Future - Big Data, Machine Learning, and Clinical Medicine. , 2016, The New England journal of medicine.

[18]  Nazar Abbas Saqib,et al.  Identification of Heart Failure by Using Unstructured Data of Cardiac Patients , 2016, 2016 45th International Conference on Parallel Processing Workshops (ICPPW).

[19]  Ton Dijkstra,et al.  Cross-validation of bimodal health-related stress assessment , 2013, Personal and Ubiquitous Computing.

[20]  Robert C Welsh,et al.  Validation of the Global Registry of Acute Coronary Event (GRACE) risk score for in-hospital mortality in patients with acute coronary syndrome in Canada. , 2009, American heart journal.

[21]  M. Pencina,et al.  General Cardiovascular Risk Profile for Use in Primary Care: The Framingham Heart Study , 2008, Circulation.

[22]  Basheer Mohammed Al-Maqaleh An Intelligent and Electronic System based Classification and Prediction for Heart Disease Diagnosis , 2016 .

[23]  Michel C. A. Klein,et al.  What technological features are used in smartphone apps that promote physical activity? A review and content analysis , 2017, Personal and Ubiquitous Computing.

[24]  S. A. Hannan,et al.  Mining Techniques to Find Out Heart Diseases : An Overview , 2012 .

[25]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[26]  D. McManus,et al.  Performance of the GRACE Risk Score 2.0 Simplified Algorithm for Predicting 1-Year Death After Hospitalization for an Acute Coronary Syndrome in a Contemporary Multiracial Cohort. , 2016, The American journal of cardiology.

[27]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[28]  D. Colville,et al.  Letter from Deb Colville and colleagues: women doctors , 2003 .

[29]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[30]  Thomas G. Dietterich Adaptive computation and machine learning , 1998 .

[31]  E. Antman,et al.  Dynamic TIMI Risk Score for STEMI , 2013, Journal of the American Heart Association.

[32]  M. Stone Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[33]  Ju Han Kim,et al.  Current trend of acute myocardial infarction in Korea (from the Korea Acute Myocardial Infarction Registry from 2006 to 2013). , 2014, The American journal of cardiology.

[34]  O. Franco,et al.  Predictive Value of Updating Framingham Risk Scores with Novel Risk Markers in the U.S. General Population , 2014, PloS one.