Binary Whale Optimization Algorithm for Dimensionality Reduction

Feature selection (FS) was regarded as a global combinatorial optimization problem. FS is used to simplify and enhance the quality of high-dimensional datasets by selecting prominent features and removing irrelevant and redundant data to provide good classification results. FS aims to reduce the dimensionality and improve the classification accuracy that is generally utilized with great importance in different fields such as pattern classification, data analysis, and data mining applications. The main problem is to find the best subset that contains the representative information of all the data. In order to overcome this problem, two binary variants of the whale optimization algorithm (WOA) are proposed, called bWOA-S and bWOA-V. They are used to decrease the complexity and increase the performance of a system by selecting significant features for classification purposes. The first bWOA-S version uses the Sigmoid transfer function to convert WOA values to binary ones, whereas the second bWOA-V version uses a hyperbolic tangent transfer function. Furthermore, the two binary variants introduced here were compared with three famous and well-known optimization algorithms in this domain, such as Particle Swarm Optimizer (PSO), three variants of binary ant lion (bALO1, bALO2, and bALO3), binary Dragonfly Algorithm (bDA) as well as the original WOA, over 24 benchmark datasets from the UCI repository. Eventually, a non-parametric test called Wilcoxon’s rank-sum was carried out at 5% significance to prove the powerfulness and effectiveness of the two proposed algorithms when compared with other algorithms statistically. The qualitative and quantitative results showed that the two introduced variants in the FS domain are able to minimize the selected feature number as well as maximize the accuracy of the classification within an appropriate time.

[1]  Mengjie Zhang,et al.  Particle Swarm Optimization for Feature Selection in Classification: A Multi-Objective Approach , 2013, IEEE Transactions on Cybernetics.

[2]  Siti Zaiton Mohd Hashim,et al.  BMOA: Binary Magnetic Optimization Algorithm , 2012 .

[3]  Aboul Ella Hassanien,et al.  New binary whale optimization algorithm for discrete optimization problems , 2020, Engineering Optimization.

[4]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[5]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[6]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[7]  Xin Yao,et al.  A Survey on Evolutionary Computation Approaches to Feature Selection , 2016, IEEE Transactions on Evolutionary Computation.

[8]  Andrew Lewis,et al.  The Whale Optimization Algorithm , 2016, Adv. Eng. Softw..

[9]  Zhihua Cui,et al.  Monarch butterfly optimization , 2015, Neural Computing and Applications.

[10]  Amr Badr,et al.  A binary clonal flower pollination algorithm for feature selection , 2016, Pattern Recognit. Lett..

[11]  Mahmoud Hassaballah,et al.  A novel hybrid Harris hawks optimization and support vector machines for drug design and discovery , 2020, Comput. Chem. Eng..

[12]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[13]  Fatma A. Hashim,et al.  A modified Henry gas solubility optimization for solving motif discovery problem , 2019, Neural Computing and Applications.

[14]  Majdi M. Mafarja,et al.  Hybrid Whale Optimization Algorithm with simulated annealing for feature selection , 2017, Neurocomputing.

[15]  Mohamed Amin,et al.  A comprehensive review of moth-flame optimisation: variants, hybrids, and applications , 2020, J. Exp. Theor. Artif. Intell..

[16]  Amir Hossein Alavi,et al.  An improved NSGA-III algorithm with adaptive mutation operator for Big Data optimization problems , 2018, Future Gener. Comput. Syst..

[17]  Li-Yeh Chuang,et al.  Improved binary PSO for feature selection using gene expression data , 2008, Comput. Biol. Chem..

[18]  Mahmoud Hassaballah,et al.  Lévy flight distribution: A new metaheuristic algorithm for solving engineering optimization problems , 2020, Eng. Appl. Artif. Intell..

[19]  Witold Pedrycz,et al.  Solving Fuzzy Job-Shop Scheduling Problem Using DE Algorithm Improved by a Selection Mechanism , 2020, IEEE Transactions on Fuzzy Systems.

[20]  Mohammad-Reza Feizi-Derakhshi,et al.  Feature selection using Forest Optimization Algorithm , 2016, Pattern Recognit..

[21]  Gaige Wang,et al.  Moth search algorithm: a bio-inspired metaheuristic algorithm for global optimization problems , 2016, Memetic Computing.

[22]  Selwyn Piramuthu Evaluating feature selection methods for learning in data mining applications , 2004, Eur. J. Oper. Res..

[23]  Aboul Ella Hassanien,et al.  Binary grey wolf optimization approaches for feature selection , 2016, Neurocomputing.

[24]  Huiling Chen,et al.  Crow Search Algorithm: Theory, Recent Advances, and Applications , 2020, IEEE Access.

[25]  Kashif Hussain,et al.  Optimal Sink Node Placement in Large Scale Wireless Sensor Networks Based on Harris’ Hawk Optimization Algorithm , 2020, IEEE Access.

[26]  Mohamed Amin,et al.  Ant Lion Optimization: Variants, Hybrids, and Applications , 2020, IEEE Access.

[27]  Nabil Neggaz,et al.  An efficient henry gas solubility optimization for feature selection , 2020, Expert Syst. Appl..

[28]  Ying Tan,et al.  Improving Metaheuristic Algorithms With Information Feedback Models , 2019, IEEE Transactions on Cybernetics.

[29]  Mohamed Elhoseny,et al.  Hybrid Harris hawks optimization with cuckoo search for drug design and discovery in chemoinformatics , 2020, Scientific Reports.

[30]  Amir Hossein Gandomi,et al.  Chaotic Krill Herd algorithm , 2014, Inf. Sci..

[31]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[32]  António Gaspar-Cunha,et al.  Using Multiobjective Evolutionary Algorithms in the Optimization of Operating Conditions of Polymer Injection Molding , 2010 .

[33]  Erik Cuevas,et al.  Comparison of Circular Symmetric Low-Pass Digital IIR Filter Design Using Evolutionary Computation Techniques , 2020, Mathematics.

[34]  Aboul Ella Hassanien,et al.  Binary ant lion approaches for feature selection , 2016, Neurocomputing.

[35]  Aboul Ella Hassanien,et al.  Maximizing lifetime of large-scale wireless sensor networks using multi-objective whale optimization algorithm , 2019, Telecommun. Syst..

[36]  Antanas Verikas,et al.  Feature selection with neural networks , 2002, Pattern Recognit. Lett..

[37]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Seyedali Mirjalili,et al.  Henry gas solubility optimization: A novel physics-based algorithm , 2019, Future Gener. Comput. Syst..

[39]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[40]  Serkan Günal,et al.  Subspace based feature selection for pattern recognition , 2008, Inf. Sci..

[41]  Francesco Palmieri,et al.  GGA: A modified genetic algorithm with gradient-based local search for solving constrained optimization problems , 2021, Inf. Sci..

[42]  Hussain Shareef,et al.  Lightning search algorithm , 2015, Appl. Soft Comput..

[43]  Hossein Nezamabadi-pour,et al.  BGSA: binary gravitational search algorithm , 2010, Natural Computing.

[44]  Kazuyuki Murase,et al.  A new hybrid ant colony optimization algorithm for feature selection , 2012, Expert Syst. Appl..