APPLICATION OF ENSEMBLE ALGORITHM INTEGRATING MULTIPLE CRITERIA FEATURE SELECTION IN CORONARY HEART DISEASE DETECTION

Conventional coronary heart disease (CHD) detection methods are expensive, rely much on doctors’ subjective experience, and some of them have side effects. In order to obtain rapid, high-precision, low-cost, non-invasive detection results, several methods in machine learning were attempted for CHD detection in this paper. The paper adopted multiple evaluation criteria to measure features, combined with heuristic search strategy and seven common classification algorithms to verify the validity and the importance of feature selection (FS) in the Z-Alizadeh Sani CHD dataset. On this basis, a novelty algorithm integrating multiple FS methods into the ensemble algorithm (ensemble algorithm based on multiple feature selection, EA-MFS) was further proposed. The algorithm adopted Bagging approach to increase data diversity, used the aforementioned MFS methods for functional perturbation, employed major voting method to carry out the decision results, and performed selective integration in terms of the difference ...

[1]  Chong-Ho Choi,et al.  Input Feature Selection by Mutual Information Based on Parzen Window , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Sonali Agarwal,et al.  Hybrid Feature Selection Based Weighted Least Squares Twin Support Vector Machine Approach for Diagnosing Breast Cancer, Hepatitis, and Diabetes , 2015, Adv. Artif. Neural Syst..

[3]  Alex Alves Freitas,et al.  A Survey of Evolutionary Algorithms for Decision-Tree Induction , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[4]  Stefan Klein,et al.  Feature Selection Based on the SVM Weight Vector for Classification of Dementia , 2015, IEEE Journal of Biomedical and Health Informatics.

[5]  Carla E. Brodley,et al.  Feature Selection for Unsupervised Learning , 2004, J. Mach. Learn. Res..

[6]  Jafar Habibi,et al.  A data mining approach for diagnosis of coronary artery disease , 2013, Comput. Methods Programs Biomed..

[7]  María José del Jesús,et al.  Genetic feature selection in a fuzzy rule-based classification system learning process for high-dimensional problems , 2001, Inf. Sci..

[8]  D. Levy,et al.  Prediction of coronary heart disease using risk factor categories. , 1998, Circulation.

[9]  Usman Qamar,et al.  IntelliHealth: A medical decision support application using a novel weighted multi-layer classifier ensemble framework , 2016, J. Biomed. Informatics.

[10]  U. Rajendra Acharya,et al.  Automated characterization of coronary artery disease, myocardial infarction, and congestive heart failure using contourlet and shearlet transforms of electrocardiogram signal , 2017, Knowl. Based Syst..

[11]  H. S. Niranjana Murthy,et al.  ANN Model to Predict Coronary Heart Disease Based on Risk Factors , 2013 .

[12]  Li Song,et al.  Recognizing Common CT Imaging Signs of Lung Diseases Through a New Feature Selection Method Based on Fisher Criterion and Genetic Optimization , 2015, IEEE Journal of Biomedical and Health Informatics.

[13]  Sylvain Arlot,et al.  A survey of cross-validation procedures for model selection , 2009, 0907.4728.

[14]  Jafar Habibi,et al.  Coronary artery disease detection using computational intelligence methods , 2016, Knowl. Based Syst..

[15]  John Q. Gan,et al.  A filter-dominating hybrid sequential forward floating search method for feature subset selection in high-dimensional space , 2014, Int. J. Mach. Learn. Cybern..

[16]  Martin Plátek,et al.  A Scale of Context Sensitive Languages: Applications to Natural Language , 1978, Inf. Control..

[17]  S. Marso,et al.  Intravascular ultrasound radiofrequency analysis of coronary atherosclerosis: an emerging technology for the assessment of vulnerable plaque. , 2007, European heart journal.

[18]  Sotiris B. Kotsiantis,et al.  Machine learning: a review of classification and combining techniques , 2006, Artificial Intelligence Review.

[19]  Nor Ashidi Mat Isa,et al.  A GA-based feature selection and parameter optimization of an ANN in diagnosing breast cancer , 2015, Pattern Analysis and Applications.

[20]  Xiao Zhang,et al.  Feature selection in mixed data: A method using a novel fuzzy rough set-based information entropy , 2016, Pattern Recognit..

[21]  Martin Mozina,et al.  Orange: data mining toolbox in python , 2013, J. Mach. Learn. Res..

[22]  Vijander Singh,et al.  Wrapper based wavelet feature optimization for EEG signals , 2012 .

[23]  U. Rajendra Acharya,et al.  Automated characterization and classification of coronary artery disease and myocardial infarction by decomposition of ECG signals: A comparative study , 2017, Inf. Sci..

[24]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[25]  Chris H. Q. Ding,et al.  Minimum Redundancy Feature Selection from Microarray Gene Expression Data , 2005, J. Bioinform. Comput. Biol..

[26]  U. Rajendra Acharya,et al.  Application of higher-order spectra for the characterization of Coronary artery disease using electrocardiogram signals , 2017, Biomed. Signal Process. Control..

[27]  S. S. Iyengar,et al.  An Evaluation of Filter and Wrapper Methods for Feature Selection in Categorical Clustering , 2005, IDA.

[28]  Hari Kusnanto,et al.  Interpretation of Clinical Data Based on C4.5 Algorithm for the Diagnosis of Coronary Heart Disease , 2016, Healthcare informatics research.