A comparison of feature selection models utilizing binary particle swarm optimization and genetic algorithm in determining coronary artery disease using support vector machine

The aim of this study is to search the efficiency of binary particle swarm optimization (BPSO) and genetic algorithm (GA) techniques as feature selection models on determination of coronary artery disease (CAD) existence based upon exercise stress testing (EST) data. Also, increasing the classification performance of the classifier is another aim. The dataset having 23 features was obtained from patients who had performed EST and coronary angiography. Support vector machine (SVM) with k-fold cross-validation method is used as the classifier system of CAD existence in both BPSO and GA feature selection techniques. Classification results of feature selection technique using BPSO and GA are compared with each other and also with the results of the whole features using simple SVM model. The results show that feature selection technique using BPSO is more successful than feature selection technique using GA on determining CAD. Also with the new dataset composed by feature selection technique using BPSO, this study reached more accurate values of success on CAD existence research with more little complexity of classifier system and more little classification time compared with whole features used SVM.

[1]  Hong Zhu,et al.  Feature Selection for Microarray Data Analysis Using Mutual Information and Rough Set Theory , 2006, AIAI.

[2]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[3]  Senjian An,et al.  Fast cross-validation algorithms for least squares support vector machine and kernel ridge regression , 2007, Pattern Recognit..

[4]  J. Shao Linear Model Selection by Cross-validation , 1993 .

[5]  Li-Yeh Chuang,et al.  Improved binary PSO for feature selection using gene expression data , 2008, Comput. Biol. Chem..

[6]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[7]  Emre Çomak,et al.  A decision support system based on support vector machines for diagnosis of the heart valve diseases , 2007, Comput. Biol. Medicine.

[8]  Mevlut Ture,et al.  Comparing performances of logistic regression, classification and regression tree, and neural networks for predicting coronary artery disease , 2008, Expert Syst. Appl..

[9]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[10]  David Sprigings,et al.  Exercise stress testing , 2002 .

[11]  Hua-chao Yang,et al.  Research into a Feature Selection Method for Hyperspectral Imagery Using PSO and SVM , 2007 .

[12]  Harun Uguz,et al.  A biomedical system based on fuzzy discrete hidden Markov model for the diagnosis of the brain diseases , 2008, Expert Syst. Appl..

[13]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[14]  Russell C. Eberhart,et al.  A discrete binary version of the particle swarm algorithm , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[15]  Thomas H. Cormen,et al.  Introduction to algorithms [2nd ed.] , 2001 .

[16]  Zhao Zhidong Noninvasive Diagnosis of Coronary Artery Disease Based on Instantaneous Frequency of Diastolic Murmurs and SVM , 2005, 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference.

[17]  F. Fernández‐Avilés,et al.  Selection of the optimal stress test for the diagnosis of coronary artery disease , 1998, Heart.

[18]  Kuan-Yu Chen,et al.  A hybrid SARIMA and support vector machines in forecasting the production values of the machinery industry in Taiwan , 2007, Expert Syst. Appl..

[19]  Chartchalerm Isarankura-Na-Ayudhya,et al.  Identification of ischemic heart disease via machine learning analysis on magnetocardiograms , 2008, Comput. Biol. Medicine.

[20]  Bhaskar D. Kulkarni,et al.  Support vector classification with parameter tuning assisted by agent-based technique , 2004, Comput. Chem. Eng..

[21]  P. Wolf,et al.  Heart disease and stroke statistics--2006 update: a report from the American Heart Association Statistics Committee and Stroke Statistics Subcommittee. , 2006, Circulation.

[22]  Thierry Dutoit,et al.  Computerized screening of children congenital heart diseases , 2008, Comput. Methods Programs Biomed..

[23]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[24]  Clive Kelly Malignancies in Rheumatology , 2002 .