A Novel Feature Selection Method Based on Salp Swarm Algorithm

Biomedical and clinical data usually contain some redundant and irrelevant features, which may lead to misleading and over-fitting problems in the process of modeling algorithms. In order to effectively remove irrelevant or redundant features, the use of feature selection methods can reduce the number of features, improve the accuracy of the model, and reduce the running time. In recent years, Wrapper-based feature selection algorithms have received widespread attention because they can obtain better accuracy. This paper uses a wrapper feature selection algorithm FS_SSA based on Salp swarm. In the FS_SSA algorithm, the position of the follower salps is updated by the relative position of the Salp. The followers gradually moves to the leading Salp. The gradual movement of the follower salps can make the Salp Swarm Algorithm not easy to fall into a local optimal state. The two behaviors of exploration and development in subset searches are balanced, and the search process of feature subsets is prevented from falling into the local optimum. Experimental results based on public medical data sets show that the FS_SSA has better classification performance than other methods.

[1]  Bin Hu,et al.  Feature Selection for Optimized High-Dimensional Biomedical Data Using an Improved Shuffled Frog Leaping Algorithm , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[2]  S. P. Shantharajah,et al.  An optimized feature selection based on genetic approach and support vector machine for heart disease , 2018, Cluster Computing.

[3]  Richard Millham,et al.  Elitist Binary Wolf Search Algorithm for Heuristic Feature Selection in High-Dimensional Bioinformatics Datasets , 2017, Scientific Reports.

[4]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[5]  Diego Oliva,et al.  An improved runner-root algorithm for solving feature selection problems based on rough sets and neighborhood rough sets , 2020, Appl. Soft Comput..

[6]  Boran Sekeroglu,et al.  Lung Cancer Incidence Prediction Using Machine Learning Algorithms , 2020 .

[7]  Zhihong Man,et al.  Classification of microarray datasets using finite impulse response extreme learning machine for cancer diagnosis , 2011, IECON 2011 - 37th Annual Conference of the IEEE Industrial Electronics Society.

[8]  Chaokun Yan,et al.  Hybrid binary Coral Reefs Optimization algorithm with Simulated Annealing for Feature Selection in high-dimensional biomedical datasets , 2019, Chemometrics and Intelligent Laboratory Systems.

[9]  Aboul Ella Hassanien,et al.  Binary ant lion approaches for feature selection , 2016, Neurocomputing.

[10]  Amr Badr,et al.  A binary clonal flower pollination algorithm for feature selection , 2016, Pattern Recognit. Lett..

[11]  Shantharajah S. Periyasamy,et al.  An optimized feature selection based on genetic approach and support vector machine for heart disease , 2019, Clust. Comput..

[12]  Asif Ekbal,et al.  Joint model for feature selection and parameter optimization coupled with classifier ensemble in chemical mention recognition , 2015, Knowl. Based Syst..

[13]  Abdel Latif Abu Dalhoum,et al.  Evaluation Feature Selection Technique on Classification by Using Evolutionary ELM Wrapper Method with Features Priorities , 2021 .

[14]  Hossam Faris,et al.  Salp Swarm Algorithm: A bio-inspired optimizer for engineering design problems , 2017, Adv. Eng. Softw..

[15]  Huan Liu,et al.  Manipulating Data and Dimension Reduction Methods: Feature Selection , 2009, Encyclopedia of Complexity and Systems Science.