Improving nature-inspired algorithms for feature selection

Selecting highly discriminative features from a whole feature set has become an important research area. Not only can this improve the performance of classification, but it can also decrease the cost of system diagnoses when a large number of noisy, redundant features are excluded. Binary nature-inspired algorithms have been used as a feature selection procedure. Each of these algorithms requires an initial population to be set, and the appropriateness of the initialization plays a key role in the final result. At the stage of population initialization, the positions are initialized randomly by uniform distribution which leads to a high variability of the classification results. To avoid the randomness of the population generated and to take into account the relation between each feature and the class variable, parametric and non-parametric methods, such as the t-test and Wilcoxon rank sum test are proposed as an initial population in the binary nature-inspired algorithms. This modification can help these binary algorithms to enhance global exploration and local exploitation or exhibit a slow convergence speed compared with the standard procedure. The binary bat, gray wolf, and whale algorithms are considered. The performance of our proposed methods is evaluated on ten publicly available datasets with high-dimensional and low-dimensional data. The experimental results and statistical analysis confirm that the performance of our proposed methods compared with the standard algorithms is better in terms of classification accuracy, the number of selected features, running time, and feature selection stability.

[1]  Pei Hu,et al.  Improved Binary Grey Wolf Optimizer and Its application for feature selection , 2020, Knowl. Based Syst..

[2]  Jingjing Ma,et al.  A Novel Feature Selection Method for High-Dimensional Biomedical Data Based on an Improved Binary Clonal Flower Pollination Algorithm , 2019, Human Heredity.

[3]  Hossam Faris,et al.  Binary grasshopper optimisation algorithm approaches for feature selection problems , 2019, Expert Syst. Appl..

[4]  Gai-Ge Wang,et al.  Binary Moth Search Algorithm for Discounted {0-1} Knapsack Problem , 2018, IEEE Access.

[5]  Zakariya Yahya Algamal,et al.  Feature selection using particle swarm optimization-based logistic regression model , 2018 .

[6]  Anupam Shukla,et al.  A survey of nature-inspired algorithms for feature selection to identify Parkinson's disease , 2017, Comput. Methods Programs Biomed..

[7]  Sanjoy Das,et al.  A Survey on Nature-Inspired Optimization Algorithms and Their Application in Image Enhancement Domain , 2018, Archives of Computational Methods in Engineering.

[8]  Haithem Taha Mohammad Ali,et al.  QSAR classification model for diverse series of antifungal agents based on improved binary differential search algorithm , 2019, SAR and QSAR in environmental research.

[9]  S. Karthikeyan,et al.  A hybrid discrete firefly algorithm for multi-objective flexible job shop scheduling problem with limited resource constraints , 2014, The International Journal of Advanced Manufacturing Technology.

[10]  Sankalap Arora,et al.  Binary butterfly optimization approaches for feature selection , 2019, Expert Syst. Appl..

[11]  Chenye Qiu,et al.  A novel multi-swarm particle swarm optimization for feature selection , 2019, Genetic Programming and Evolvable Machines.

[12]  Dongsheng Zhao,et al.  Chaotic binary bat algorithm for analog test point selection , 2015, Analog Integrated Circuits and Signal Processing.

[13]  Sankalap Arora,et al.  Chaotic grey wolf optimization algorithm for constrained optimization problems , 2018, J. Comput. Des. Eng..

[14]  Zakariya Yahya Algamal,et al.  Feature Selection Using Different Transfer Functions for Binary Bat Algorithm , 2020 .

[15]  Manik Sharma,et al.  A Comprehensive Analysis of Nature-Inspired Meta-Heuristic Techniques for Feature Selection Problem , 2020 .

[16]  Maarouk Toufik Messaoud,et al.  A new binary grasshopper optimization algorithm for feature selection problem , 2019, J. King Saud Univ. Comput. Inf. Sci..

[17]  Andrew Lewis,et al.  S-shaped versus V-shaped transfer functions for binary Particle Swarm Optimization , 2013, Swarm Evol. Comput..

[18]  Mohammed Azmi Al-Betar,et al.  A novel gene selection method using modified MRMR and hybrid bat-inspired algorithm with β-hill climbing , 2018, Applied Intelligence.

[19]  Rohayanti Hassan,et al.  Selection and classification of gene expression in autism disorder: Use of a combination of statistical filters and a GBPSO-SVM algorithm , 2017, PloS one.

[20]  Sankalap Arora,et al.  Chaotic whale optimization algorithm , 2018, J. Comput. Des. Eng..

[21]  Saeid Barshandeh,et al.  A new hybrid chaotic atom search optimization based on tree-seed algorithm and Levy flight for solving optimization problems , 2020, Engineering computations.

[22]  Andrew Lewis,et al.  The Whale Optimization Algorithm , 2016, Adv. Eng. Softw..

[23]  Mohamed A. Tawhid,et al.  Feature selection based on rough set approach, wrapper approach, and binary whale optimization algorithm , 2020, Int. J. Mach. Learn. Cybern..

[24]  Zengyou He,et al.  Stable Feature Selection for Biomarker Discovery , 2010, Comput. Biol. Chem..

[25]  Zakariya Yahya Algamal,et al.  Improving whale optimization algorithm for feature selection with a time-varying transfer function , 2021, Numerical Algebra, Control & Optimization.

[26]  Ronghua Shang,et al.  Sparse and low-redundant subspace learning-based dual-graph regularized robust feature selection , 2020, Knowl. Based Syst..

[27]  Dinesh Kumar,et al.  Binary whale optimization algorithm and its application to unit commitment problem , 2018, Neural Computing and Applications.

[28]  Said Jadid Abdul Kadir,et al.  Binary Optimization Using Hybrid Grey Wolf Optimization for Feature Selection , 2019, IEEE Access.

[29]  Muhammad Hisyam Lee,et al.  A two-stage sparse logistic regression for optimal gene selection in high-dimensional microarray data classification , 2018, Advances in Data Analysis and Classification.

[30]  Kaiping Luo,et al.  A binary grey wolf optimizer for the multidimensional knapsack problem , 2019, Appl. Soft Comput..

[31]  Aboul Ella Hassanien,et al.  Binary grey wolf optimization approaches for feature selection , 2016, Neurocomputing.

[32]  Ronghua Shang,et al.  Local discriminative based sparse subspace learning for feature selection , 2019, Pattern Recognit..

[33]  Andrew Lewis,et al.  Grey Wolf Optimizer , 2014, Adv. Eng. Softw..

[34]  Xin-She Yang,et al.  A New Metaheuristic Bat-Inspired Algorithm , 2010, NICSO.

[35]  Ashraf Darwish,et al.  A New Chaotic Whale Optimization Algorithm for Features Selection , 2018, Journal of Classification.

[36]  K. Kathiravan,et al.  Assessment of ramping cost for independent power producers using firefly algorithm and gray wolf optimization , 2018, Cluster Computing.

[37]  Ying Huang,et al.  Chaos enhanced grey wolf optimization wrapped ELM for diagnosis of paraquat-poisoned patients , 2019, Comput. Biol. Chem..

[38]  Hossam Faris,et al.  Efficient Hybrid Nature-Inspired Binary Optimizers for Feature Selection , 2019, Cognitive Computation.

[39]  Xin-She Yang,et al.  Binary bat algorithm , 2013, Neural Computing and Applications.

[40]  Ronghua Shang,et al.  Non-Negative Spectral Learning and Sparse Regression-Based Dual-Graph Regularized Feature Selection , 2018, IEEE Transactions on Cybernetics.

[41]  Omar Saber Qasim,et al.  Feature selection based on chaotic binary black hole algorithm for data classification , 2020 .

[42]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[43]  Arun Kumar Sangaiah,et al.  A modified nature inspired meta-heuristic whale optimization algorithm for solving 0–1 knapsack problem , 2019, Int. J. Mach. Learn. Cybern..

[44]  R. M. Rizk-Allah,et al.  New binary bat algorithm for solving 0–1 knapsack problem , 2017, Complex & Intelligent Systems.

[46]  Jan Kalina,et al.  Classification methods for high-dimensional genetic data , 2014 .

[47]  Stjepan Oreski,et al.  Genetic algorithm-based heuristic for feature selection in credit risk assessment , 2014, Expert Syst. Appl..

[48]  Aboul Ella Hassanien,et al.  Firefly Optimization Algorithm for Feature Selection , 2015, BCI.

[49]  Haithem Taha Mohammad Ali,et al.  High-dimensional QSAR/QSPR classification modeling based on improving pigeon optimization algorithm , 2020 .

[50]  Qiang Li,et al.  An Enhanced Grey Wolf Optimization Based Feature Selection Wrapped Kernel Extreme Learning Machine for Medical Diagnosis , 2017, Comput. Math. Methods Medicine.