A Stacking-Based Model for Non-Invasive Detection of Coronary Heart Disease

Coronary arteriongraphy (CAG) is an accurate invasive technique for the diagnosis of coronary heart disease (CHD). However, its invasive procedure is not appropriate for the detection of CHD in the annual physical examination. With the successful application of machine learning (ML) in various fields, our goal is to perform selective integration of multiple ML algorithms and verify the validity of feature selection methods with personal clinical information commonly seen in the annual physical examination. In this study, a two level stacking based model is designed in which level 1 is base-level and level 2 is meta-level. The predictions of base-level classifiers is selected as the input of meta-level. The pearson correlation coefficient and maximum information coefficient are first calculated to find the classifier with the lowest correlation. Then enumeration algorithm is used to find the best combining classifiers which acquire the best result in the end. The Z-Alizadeh Sani CHD dataset which we use consists of 303 cases verified by CAG. Experimental results demonstrate that the proposed model obtains an accuracy, sensitivity and specificity of 95.43%, 95.84%, 94.44%, respectively for the detection of CHD. The proposed method can effectively aid clinicians to detect those with normal coronary arteries from those with CHD.

[1]  Yanhuang Jiang,et al.  LibEDM: A Platform for Ensemble Based Data Mining , 2014, 2014 IEEE International Conference on Data Mining Workshop.

[2]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[3]  J. Zhu,et al.  Predicting corticosteroid‐free endoscopic remission with vedolizumab in ulcerative colitis , 2018, Alimentary pharmacology & therapeutics.

[4]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[5]  Giles M. Foody,et al.  Feature Selection for Classification of Hyperspectral Data by SVM , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Saeid Nahavandi,et al.  Non-invasive detection of coronary artery disease in high-risk patients based on the stenosis prediction of separate coronary arteries , 2018, Comput. Methods Programs Biomed..

[7]  Bhuvaneswari Ragothaman,et al.  A Multi-objective Non-Dominated Sorted Artificial Bee Colony Feature Selection Algorithm for Medical Datasets , 2016 .

[8]  M. Motwani,et al.  Machine learning for prediction of all-cause mortality in patients with suspected coronary artery disease: a 5-year multicentre prospective registry analysis , 2016, European heart journal.

[9]  U. Rajendra Acharya,et al.  Characterization of coronary artery disease using flexible analytic wavelet transform applied on ECG signals , 2017, Biomed. Signal Process. Control..

[10]  Ljupčo Todorovski,et al.  A comparison of stacking with meta decision trees to other combining methods , 2001 .

[11]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[12]  Ren Long,et al.  iRSpot-EL: identify recombination spots with an ensemble learning approach , 2017, Bioinform..

[13]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[14]  Shai Avidan,et al.  Ensemble Tracking , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Kindie Biredagn Nahato,et al.  Hybrid approach using fuzzy sets and extreme learning machine for classifying clinical datasets , 2016 .

[16]  Deepu Vijayasenan,et al.  A novel approach for classification of normal/abnormal phonocardiogram recordings using temporal signal analysis and machine learning , 2016, 2016 Computing in Cardiology Conference (CinC).

[17]  Gunnar Rätsch,et al.  Soft Margins for AdaBoost , 2001, Machine Learning.

[18]  Tom Mitchell,et al.  Exploring predictive and reproducible modeling with the single‐subject FIAC dataset , 2006, Human brain mapping.

[19]  Bernard Zenko,et al.  Is Combining Classifiers with Stacking Better than Selecting the Best One? , 2004, Machine Learning.

[20]  Phayung Meesad,et al.  A highly accurate firefly based algorithm for heart disease prediction , 2015, Expert Syst. Appl..

[21]  Vince D. Calhoun,et al.  An ensemble learning system for a 4-way classification of Alzheimer’s disease and mild cognitive impairment , 2018, Journal of Neuroscience Methods.

[22]  Joseph Sill,et al.  Feature-Weighted Linear Stacking , 2009, ArXiv.

[23]  Alper Ekrem Murat,et al.  A discrete particle swarm optimization method for feature selection in binary classification problems , 2010, Eur. J. Oper. Res..

[24]  Stephen J. Wright,et al.  Hogwild: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent , 2011, NIPS.

[25]  Babak Mohammadzadeh Asl,et al.  Automated diagnosis of coronary artery disease (CAD) patients using optimized SVM , 2017, Comput. Methods Programs Biomed..

[26]  F. Fleuret Fast Binary Feature Selection with Conditional Mutual Information , 2004, J. Mach. Learn. Res..

[27]  Sungzoon Cho,et al.  GA-SVM wrapper approach for feature subset selection in keystroke dynamics identity verification , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[28]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[29]  Mariette Awad,et al.  A local mixture based SVM for an efficient supervised binary classification , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[30]  Michael Mitzenmacher,et al.  Detecting Novel Associations in Large Data Sets , 2011, Science.

[31]  Ridvan Saraçoglu,et al.  Hidden Markov model-based classification of heart valve disease with PCA for dimension reduction , 2012, Eng. Appl. Artif. Intell..

[32]  Jacek M. Zurada,et al.  Normalized Mutual Information Feature Selection , 2009, IEEE Transactions on Neural Networks.

[33]  Divya Tomar,et al.  Feature Selection based Least Square Twin Support Vector Machine for Diagnosis of Heart Disease , 2014, BSBT 2014.

[34]  Jafar Habibi,et al.  A data mining approach for diagnosis of coronary artery disease , 2013, Comput. Methods Programs Biomed..

[35]  N. Stamatis,et al.  Forecasting chaotic cardiovascular time series with an adaptive slope multilayer perceptron neural network , 1999, IEEE Transactions on Biomedical Engineering.

[36]  Usman Qamar,et al.  A Multicriteria Weighted Vote‐Based Classifier Ensemble for Heart Disease Prediction , 2016, Comput. Intell..

[37]  Chang-Dong Wang,et al.  Locally Weighted Ensemble Clustering , 2016, IEEE Transactions on Cybernetics.

[38]  Sangeet Srivastava,et al.  A Hybrid Data Mining Model to Predict Coronary Artery Disease Cases Using Non-Invasive Clinical Data , 2016, Journal of Medical Systems.

[39]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[40]  Steven Shea,et al.  Cardiovascular Event Prediction by Machine Learning: The Multi-Ethnic Study of Atherosclerosis , 2017, Circulation research.

[41]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[42]  Yan Jiang,et al.  Application in Disease Classification based on KPCA-IBA-LSSVM , 2019 .

[43]  Lawrence Carin,et al.  Sparse multinomial logistic regression: fast algorithms and generalization bounds , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Majid Ghonji Feshki,et al.  Improving the heart disease diagnosis by evolutionary algorithm of PSO and Feed Forward Neural Network , 2016, 2016 Artificial Intelligence and Robotics (IRANOPEN).