Optimal SVM parameters estimation using chaotic accelerated particle swarm optimization for genetic data classification

Microarray studies and gene expression analysis have received tremendous attention over the last few years and provide many promising paths toward the understanding of fundamental questions in biology and medicine. High throughput biological data need to be processed, analyzed, and interpreted to address problems in life sciences. Feature selection (FS) and clustering are among the methods used in the statistical analysis of these data types. Support Vector Machine (SVM) is very popular and powerful among learning methods. In general, SVM has a wide range of applications in pattern recognition and nonlinear fitting. Kernel functions have an important role in classification capability of SVM. In this study, at first, selection of appropriate and efficient features is performed using Fisher ranking method considering classification power. After that, balancing is performed between different classes based on the removal of samples of the majority class, which are far from the decision boundary. Optimization of SVM classifier parameters including penalty factor C and Radial Basis Function (RBF) parameter γ is performed utilizing chaotic accelerated particle swarm optimization (CAPSO). Simulation is applied on genetic data sets. The classification accuracy of the proposed method on the incorporated genetic data sets is more than 94%. Using CAPSO algorithm instead of PSO algorithm for optimizing SVM parameters resulted in improved classification accuracy on genetic data.

[1]  Chih-Fong Tsai,et al.  SVOIS: Support Vector Oriented Instance Selection for text classification , 2011, Inf. Syst..

[2]  T. Poggio,et al.  Prediction of central nervous system embryonal tumour outcome based on gene expression , 2002, Nature.

[3]  Shih-Wei Lin,et al.  Particle swarm optimization for parameter determination and feature selection of support vector machines , 2008, Expert Syst. Appl..

[4]  R. Ali Abbaspour,et al.  Evaluation of Genetic Algorithms for tuning SVM parameters in multi-class problems , 2010, 2010 11th International Symposium on Computational Intelligence and Informatics (CINTI).

[5]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[6]  Rui Zhang,et al.  Semi-Supervised Hyperspectral Image Classification Using Spatio-Spectral Laplacian Support Vector Machine , 2014, IEEE Geoscience and Remote Sensing Letters.

[7]  Massimiliano Pontil,et al.  Support Vector Machines for 3D Object Recognition , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Parag Pruthi,et al.  Modeling Packet Traffic with Chaotic Maps , 1994 .

[9]  Xin-She Yang,et al.  Chaos-enhanced accelerated particle swarm optimization , 2013, Commun. Nonlinear Sci. Numer. Simul..

[10]  Zeping Yang,et al.  An Active Under-Sampling Approach for Imbalanced Data Classification , 2012, 2012 Fifth International Symposium on Computational Intelligence and Design.

[11]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[12]  Fu-song Xiong,et al.  NLSSVM: Least Square Support Vector Machine based on Newton optimization , 2011, 2011 IEEE International Conference on Computer Science and Automation Engineering.

[13]  E. Lander,et al.  Gene expression correlates of clinical prostate cancer behavior. , 2002, Cancer cell.

[14]  Jieping Ye,et al.  Computational and Theoretical Analysis of Null Space and Orthogonal Linear Discriminant Analysis , 2006, J. Mach. Learn. Res..

[15]  Ching Y. Suen,et al.  Automatic model selection for the optimization of SVM kernels , 2005, Pattern Recognit..

[16]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[17]  Gregory Ditzler,et al.  Optimal v-SVM parameter estimation using multi objective evolutionary algorithms , 2010, IEEE Congress on Evolutionary Computation.

[18]  Todd,et al.  Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning , 2002, Nature Medicine.

[19]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Gregory Ditzler,et al.  Optimal-SVM Parameter Estimation using Multi Objective Evolutionary Algorithms , .

[21]  Leandro dos Santos Coelho,et al.  Fuzzy Identification Based on a Chaotic Particle Swarm Optimization Approach Applied to a Nonlinear Yo-yo Motion System , 2007, IEEE Transactions on Industrial Electronics.

[22]  Mohammad Hossein Moattar,et al.  Robust and stable feature selection by integrating ranking methods and wrapper technique in genetic data classification. , 2014, Biochemical and biophysical research communications.

[23]  Xin Yao,et al.  Feature Selection for Microarray Data Using Least Squares SVM and Particle Swarm Optimization , 2005, 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology.

[24]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[25]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[26]  Xin-She Yang,et al.  Nature-Inspired Metaheuristic Algorithms , 2008 .

[27]  Bernardete Ribeiro,et al.  Distributed Text Classification With an Ensemble Kernel-Based Learning Approach , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[28]  Zhongyi Hu,et al.  A PSO and pattern search based memetic algorithm for SVMs parameters optimization , 2013, Neurocomputing.

[29]  Cheng-Lung Huang,et al.  A distributed PSO-SVM hybrid system with feature selection and parameter optimization , 2008, Appl. Soft Comput..

[30]  Xueying Zhang,et al.  Optimization of SVM Parameters Based on PSO Algorithm , 2009, 2009 Fifth International Conference on Natural Computation.