PREDICTION OF CORONARY ARTERY DISEASE BASED ON ENSEMBLE LEARNING APPROACHES AND CO-EXPRESSED OBSERVATIONS

Background: Coronary artery disease (CAD) is one of the most representative cardiovascular diseases. Early and accurate prediction of CAD based on physiological measurements can reduce the risk of heart attack through medicine therapy, healthy diet, and regular physical activity. Methods:Four heart disease datasets from the UC Irvine Machine Learning Repository were combined and re-examined to remove incomplete entries, and a total of 822 cases were utilized in this study. Seven machine learning methods, including Naive Bayes, artificial neural networks (ANNs), sequential minimal optimization (SMO), k-nearest neighbor (KNN), AdaBoost, J48, and random forest, were adopted to analyze the collected datasets for CAD prediction. By combining co-expressed observations and an ensemble voting mechanism, we designed and evaluated a new medical decision classifier for CAD prediction. The TOPSIS (Technique for Order Preference by Similarity to an Ideal Solution) algorithm was applied to determine the best prediction method for CAD diagnosis. Results: Features of systolic blood pressure, cholesterol, heart rate, and ST depression are considered to be the most significant differences between patients with and without CADs. We show that the prediction capability of seven machine learning classifiers can be enhanced by integrating combinations of observed co-expressed features. Finally, compared to the use of any single classifier, the proposed voting mechanism achieved optimal performance according to TOPSIS.

[1]  Beant Kaur,et al.  Review on Heart Disease Prediction System using Data Mining Techniques , 2014 .

[2]  Manfred E. Beutel,et al.  Complaints of Sleep Disturbances Are Associated with Cardiovascular Disease: Results from the Gutenberg Health Study , 2014, PloS one.

[3]  Iraj Mahdavi,et al.  Designing a model of fuzzy TOPSIS in multiple criteria decision making , 2008, Appl. Math. Comput..

[4]  Michael G Perri,et al.  Lifestyle Interventions for Cardiovascular Disease Risk Reduction: A Systematic Review of the Effects of Diet Composition, Food Provision, and Treatment Modality on Weight Loss , 2014, Current Atherosclerosis Reports.

[5]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[6]  Eloi Marijon,et al.  State-of-the-art Paper Prevalences, Patterns, and the Potential of Early Disease Detection , 2022 .

[7]  Tae-Yong Lee,et al.  A coronary heart disease prediction model: the Korean Heart Study , 2014, BMJ Open.

[8]  Sellappan Palaniappan,et al.  Intelligent heart disease prediction system using data mining techniques , 2008, 2008 IEEE/ACS International Conference on Computer Systems and Applications.

[9]  Andrew Steptoe,et al.  Associations of job strain and lifestyle risk factors with risk of coronary artery disease: a meta-analysis of individual participant data , 2013, Canadian Medical Association Journal.

[10]  D. Levy,et al.  Prediction of coronary heart disease using risk factor categories. , 1998, Circulation.

[11]  Daniel Krewski,et al.  Lung Cancer and Cardiovascular Disease Mortality Associated with Ambient Air Pollution and Cigarette Smoke: Shape of the Exposure–Response Relationships , 2011, Environmental health perspectives.

[12]  P. Libby,et al.  Pathophysiology of Coronary Artery Disease , 2005, Circulation.

[13]  Makoto Watanabe,et al.  Cardiovascular Disease and Risk Factors in Asia: A Selected Review , 2008, Circulation.

[14]  Jongsik Lee,et al.  Data-Mining-Based Coronary Heart Disease Risk Prediction Model Using Fuzzy Logic and Decision Tree , 2015, Healthcare informatics research.

[15]  Benjamin Yang,et al.  "A change of heart: how the people of framingham, massachusetts, helped unravel the mysteries of cardiovascular disease". , 2005, Discovery medicine.

[16]  Morteza Yazdani,et al.  A state-of the-art survey of TOPSIS applications , 2012, Expert Syst. Appl..

[17]  Hamido Fujita Knowledge-based cybernetics in decision support system based on subjective emotional intelligence , 2013, 2013 IEEE 9th International Conference on Computational Cybernetics (ICCC).

[18]  G. Hansson Inflammation, atherosclerosis, and coronary artery disease. , 2005, The New England journal of medicine.