Improved salp swarm algorithm based on particle swarm optimization for feature selection

Feature selection (FS) is a machine learning process commonly used to reduce the high dimensionality problems of datasets. This task permits to extract the most representative information of high sized pools of data, reducing the computational effort in other tasks as classification. This article presents a hybrid optimization method for the FS problem; it combines the slap swarm algorithm (SSA) with the particle swarm optimization. The hybridization between both approaches creates an algorithm called SSAPSO, in which the efficacy of the exploration and the exploitation steps is improved. To verify the performance of the proposed algorithm, it is tested over two experimental series, in the first one, it is compared with other similar approaches using benchmark functions. Meanwhile, in the second set of experiments, the SSAPSO is used to determine the best set of features using different UCI datasets. Where the redundant or the confusing features are removed from the original dataset while keeping or yielding a better accuracy. The experimental results provide the evidence of the enhancement in the SSAPSO regarding the performance and the accuracy without affecting the computational effort.

[1]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[2]  Aboul Ella Hassanien,et al.  Multi-objective whale optimization algorithm for content-based image retrieval , 2018, Multimedia Tools and Applications.

[3]  Takio Kurita,et al.  Selection of Import Vectors via Binary Particle Swarm Optimization and Cross-Validation for Kernel Logistic Regression , 2007, 2007 International Joint Conference on Neural Networks.

[4]  Mohamed Elhoseny,et al.  Social-spider optimization algorithm for improving ANFIS to predict biochar yield , 2017, 2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT).

[5]  Wei Chang Yeh,et al.  A Hybrid Simplified Swarm Optimization Method for Imbalanced Data Feature Selection , 2017 .

[6]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[7]  Li-Yeh Chuang,et al.  Tabu Search and Binary Particle Swarm Optimization for Feature Selection Using Microarray Data , 2009, J. Comput. Biol..

[8]  Erik Valdemar Cuevas Jiménez,et al.  A new algorithm inspired in the behavior of the social-spider for constrained optimization , 2014, Expert Syst. Appl..

[9]  Yiming Yang,et al.  A Study of Approaches to Hypertext Categorization , 2002, Journal of Intelligent Information Systems.

[10]  Pei-Chann Chang,et al.  An attribute weight assignment and particle swarm optimization algorithm for medical database classifications , 2012, Comput. Methods Programs Biomed..

[11]  Aboul Ella Hassanien,et al.  Hybrid Swarms Optimization Based Image Segmentation , 2016 .

[12]  Aboul Ella Hassanien,et al.  Hybrid Learning Enhancement of RBF Network with Particle Swarm Optimization , 2009, Foundations of Computational Intelligence.

[13]  Hossam Faris,et al.  Salp Swarm Algorithm: A bio-inspired optimizer for engineering design problems , 2017, Adv. Eng. Softw..

[14]  L. Madin Aspects of jet propulsion in salps , 1990 .

[15]  K. Thangavel,et al.  Mammogram Image Analysis: Bio-inspired Computational Approach , 2011, SocProS.

[16]  Parham Moradi,et al.  Integration of graph clustering with ant colony optimization for feature selection , 2015, Knowl. Based Syst..

[17]  A. Butte,et al.  Microarrays for an Integrative Genomics , 2002 .

[18]  Mohamed Elhoseny,et al.  Prediction of biochar yield using adaptive neuro-fuzzy inference system with particle swarm optimization , 2017, 2017 IEEE PES PowerAfrica.

[19]  Aboul Ella Hassanien,et al.  Hybrid Monkey Algorithm with Krill Herd Algorithm optimization for feature selection , 2015, 2015 11th International Computer Engineering Conference (ICENCO).

[20]  Sun-Yuan Kung,et al.  Feature Selection for Genomic Signal Processing: Unsupervised, Supervised, and Self-Supervised Scenarios , 2010, J. Signal Process. Syst..

[21]  Shengwu Xiong,et al.  Multi-objective Whale Optimization Algorithm for Multilevel Thresholding Segmentation , 2018 .

[22]  Dervis Karaboga,et al.  A powerful and efficient algorithm for numerical function optimization: artificial bee colony (ABC) algorithm , 2007, J. Glob. Optim..

[23]  Huan Liu,et al.  Consistency-based search in feature selection , 2003, Artif. Intell..

[24]  Alper Ekrem Murat,et al.  A discrete particle swarm optimization method for feature selection in binary classification problems , 2010, Eur. J. Oper. Res..

[25]  Xiang Li,et al.  Optimal band selection for hyperspectral data with improved differential evolution , 2015, J. Ambient Intell. Humaniz. Comput..

[26]  Maciej Modrzejewski,et al.  Feature Selection Using Rough Sets Theory , 1993, ECML.

[27]  Mahmoud Al-Ayyoub,et al.  Feature extraction and selection for Arabic tweets authorship authentication , 2017, J. Ambient Intell. Humaniz. Comput..

[28]  G. Yamuna,et al.  HYBRID FIREFLY SWARM INTELLIGENCE BASED FEATURE SELECTION FOR MEDICAL DATA CLASSIFICATION AND SEGMENTATION IN SVD - NSCT DOMAIN. , 2016 .

[29]  Taghi M. Khoshgoftaar,et al.  A review of the stability of feature selection techniques for bioinformatics data , 2012, 2012 IEEE 13th International Conference on Information Reuse & Integration (IRI).

[30]  Loganathan Agilandeeswari,et al.  An intelligent lung cancer diagnosis system using cuckoo search optimization and support vector machine classifier , 2017, Journal of Ambient Intelligence and Humanized Computing.

[31]  Daniel Weihs,et al.  Hydrodynamic advantages of swimming by salp chains , 2017, Journal of The Royal Society Interface.

[32]  K. Thangavel,et al.  Ant colony Optimization for Feature Selection and Classification of Microcalcifications in Digital Mammograms , 2006, 2006 International Conference on Advanced Computing and Communications.

[33]  David E. Goldberg,et al.  Genetic algorithms and Machine Learning , 1988, Machine Learning.

[34]  Andrew Lewis,et al.  Grey Wolf Optimizer , 2014, Adv. Eng. Softw..

[35]  Ron Kohavi,et al.  Feature Subset Selection Using the Wrapper Method: Overfitting and Dynamic Search Space Topology , 1995, KDD.

[36]  Anthony J Richardson,et al.  Rethinking the Role of Salps in the Ocean. , 2016, Trends in ecology & evolution.

[37]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[38]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[39]  Salim Chikhi,et al.  Clustered negative selection algorithm and fruit fly optimization for email spam detection , 2017, Journal of Ambient Intelligence and Humanized Computing.

[40]  Ning Zhong,et al.  Using Rough Sets with Heuristics for Feature Selection , 1999, Journal of Intelligent Information Systems.

[41]  Yiyu Yao,et al.  Information-Theoretic Measures for Knowledge Discovery and Data Mining , 2003 .

[42]  Q. Bone,et al.  Communication between individuals in salp chains. II. Physiology , 1980, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[43]  Gabriele Steidl,et al.  Combined SVM-Based Feature Selection and Classification , 2005, Machine Learning.

[44]  Anil K. Jain,et al.  Dimensionality reduction using genetic algorithms , 2000, IEEE Trans. Evol. Comput..

[45]  Eduardo Gasca,et al.  Eliminating redundancy and irrelevance using a new MLP-based feature selection method , 2006, Pattern Recognit..

[46]  Taher Niknam,et al.  An efficient hybrid approach based on PSO, ACO and k-means for cluster analysis , 2010, Appl. Soft Comput..

[47]  Ahmed A. Ewees,et al.  Improved grasshopper optimization algorithm using opposition-based learning , 2018, Expert Syst. Appl..

[48]  Jinyan Li,et al.  Identifying good diagnostic gene groups from gene expression profiles using the concept of emerging patterns. , 2002 .

[49]  Richard Jensen,et al.  Feature Selection and Linear/Nonlinear Regression Methods for the Accurate Prediction of Glycogen Synthase Kinase-3β Inhibitory Activities , 2009, J. Chem. Inf. Model..

[50]  Ahmad Taher Azar,et al.  Supervised hybrid feature selection based on PSO and rough sets for medical diagnosis , 2014, Comput. Methods Programs Biomed..

[51]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[52]  Gang Wang,et al.  Towards an optimal support vector machine classifier using a parallel particle swarm optimization strategy , 2014, Appl. Math. Comput..

[53]  Songfeng Lu,et al.  Galaxy images classification using hybrid brain storm optimization with moth flame optimization , 2018, Journal of Astronomical Telescopes, Instruments, and Systems.

[54]  Pengfei Duan,et al.  A Hybrid Method of Sine Cosine Algorithm and Differential Evolution for Feature Selection , 2017, ICONIP.

[55]  Shichao Kan,et al.  Compressed sensing based feature fusion for image retrieval , 2018, Journal of Ambient Intelligence and Humanized Computing.

[56]  Labiba Souici-Meslati,et al.  Hybrid ACO-PSO Based Approaches for Feature Selection , 2016 .

[57]  Sharifah Mumtazah Syed Ahmad,et al.  Smile detection using hybrid face representation , 2016, J. Ambient Intell. Humaniz. Comput..

[58]  Qinghua Hu,et al.  Improved Feature Selection Algorithm Based on SVM and Correlation , 2006, ISNN.

[59]  Aboul Ella Hassanien,et al.  Chaotic multi-verse optimizer-based feature selection , 2017, Neural Computing and Applications.

[60]  Hongbin Zhang,et al.  Feature selection using tabu search method , 2002, Pattern Recognit..

[61]  Hao Dong,et al.  An improved particle swarm optimization for feature selection , 2011 .

[62]  Marcel J. T. Reinders,et al.  Random subspace method for multivariate feature selection , 2006, Pattern Recognit. Lett..

[63]  Songfeng Lu,et al.  Feature Selection Based on Improved Runner-Root Algorithm Using Chaotic Singer Map and Opposition-Based Learning , 2017, ICONIP.

[64]  Xin-She Yang,et al.  A wrapper approach for feature selection based on Bat Algorithm and Optimum-Path Forest , 2014, Expert Syst. Appl..

[65]  R. Annamalai Saravanan,et al.  Enhanced text mining approach based on ontology for clustering research project selection , 2017 .