An Efficient Marine Predators Algorithm for Feature Selection

Feature Selection (F.S.) reduces the number of features by removing unnecessary, redundant, and noisy information while keeping a relatively decent classification accuracy. F.S. can be considered an optimization problem. As the problem is challenging and there are many local solutions, stochastic optimization algorithms may be beneficial. This paper proposes a novel approach to dimension reduction in feature selection. As a seminal attempt, this work uses binary variants of the recent Marine Predators Algorithm (MPA) to select the optimal feature subset to improve classification accuracy. MPA is a new and novel nature-inspired metaheuristic. This research proposes an algorithm that is a hybridization between MPA and k-Nearest Neighbors (k-NN) called MPA-KNN. K-Nearest Neighbors (k-NN) is used to evaluate the selected features on medical datasets with feature sizes ranging from tiny to massive. The proposed methods are evaluated on 18 well-known UCI medical dataset benchmarks and compared with eight well-regarded metaheuristic wrapper-based approaches. The core exploratory and exploitative processes are adapted in MPA to select the optimal and meaningful features for achieving the most accurate classification. The results show that the proposed MPA-KNN approach had a remarkable capability to select the optimal and significant features. It performed better than the well-established metaheuristic algorithms we tested. The algorithms we used for comparison are Grey Wolf Optimizer (GWO), MothFlame Optimization Algorithm (MFO), Sine Cosine Algorithm (SCA), Whale Optimization Algorithm (WOA), Slap Swarm Algorithm (SSA), Butterfly Optimization Algorithm (BFO), and Harris Hawks Optimization (HHO). This paper is the first work that implements MPA for Feature Selection problems. The results ensure that the proposed MPA-KNN approach has a remarkable capability to select the optimal and significant features and performed better than several metaheuristic algorithms. MPA-KNN achieves the best averages accuracy, Sensitivity, and Specificity rates of all datasets.

[1]  Russell C. Eberhart,et al.  A discrete binary version of the particle swarm algorithm , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[2]  R. Mantegna,et al.  Fast, accurate algorithm for numerical simulation of Lévy stable stochastic processes. , 1994, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[3]  Marko Robnik-Sikonja,et al.  Theoretical and Empirical Analysis of ReliefF and RReliefF , 2003, Machine Learning.

[4]  Anupam Shukla,et al.  A survey of nature-inspired algorithms for feature selection to identify Parkinson's disease , 2017, Comput. Methods Programs Biomed..

[5]  Hossam Faris,et al.  Time-varying hierarchical chains of salps with random weight networks for feature selection , 2020, Expert Syst. Appl..

[6]  Robertas Damasevicius,et al.  COVID-19 image classification using deep features and fractional-order marine predators algorithm , 2020, Scientific Reports.

[7]  Hossam Faris,et al.  Efficient Hybrid Nature-Inspired Binary Optimizers for Feature Selection , 2019, Cognitive Computation.

[8]  G. Viswanathan,et al.  Lévy flights and superdiffusion in the context of biological encounters and random searches , 2008 .

[9]  Satvir Singh,et al.  An improved butterfly optimization algorithm with chaos , 2017, J. Intell. Fuzzy Syst..

[10]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[11]  Ron Kohavi,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998 .

[12]  Seyedali Mirjalili,et al.  A Review of Grey Wolf Optimizer-Based Feature Selection Methods for Classification , 2019, Algorithms for Intelligent Systems.

[13]  Sadoullah Ebrahimnejad,et al.  Emperor Penguins Colony: a new metaheuristic algorithm for optimization , 2019, Evolutionary Intelligence.

[14]  Abdul Rahim Abdullah,et al.  A New Co-Evolution Binary Particle Swarm Optimization with Multiple Inertia Weight Strategy for Feature Selection , 2019, Informatics.

[15]  Majdi M. Mafarja,et al.  Hybrid Whale Optimization Algorithm with simulated annealing for feature selection , 2017, Neurocomputing.

[16]  Nabil Neggaz,et al.  An efficient henry gas solubility optimization for feature selection , 2020, Expert Syst. Appl..

[17]  Byung Ro Moon,et al.  Hybrid Genetic Algorithms for Feature Selection , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Xin-She Yang,et al.  Firefly Algorithm, Lévy Flights and Global Optimization , 2010, SGAI Conf..

[19]  S. C. Neoh,et al.  A Micro-GA Embedded PSO Feature Selection Approach to Intelligent Facial Emotion Recognition , 2017, IEEE Transactions on Cybernetics.

[20]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[21]  Lucija Brezocnik Feature selection for classification using particle swarm optimization , 2017, IEEE EUROCON 2017 -17th International Conference on Smart Technologies.

[22]  Amir H. Gandomi,et al.  Marine Predators Algorithm: A nature-inspired metaheuristic , 2020, Expert Syst. Appl..

[23]  Mohammed A A Al-Qaness,et al.  Marine Predators Algorithm for Forecasting Confirmed Cases of COVID-19 in Italy, USA, Iran and Korea , 2020, International journal of environmental research and public health.

[24]  Taher A. Shehabeldeen,et al.  Utilization of Random Vector Functional Link integrated with Marine Predators Algorithm for tensile behavior prediction of dissimilar friction stir welded aluminum alloy joints , 2020 .

[25]  Siddhartha Bhattacharyya,et al.  S-shaped Binary Whale Optimization Algorithm for Feature Selection , 2019 .

[26]  Mohammad Alshinwan,et al.  Moth–flame optimization algorithm: variants and applications , 2019, Neural Computing and Applications.

[27]  Hossam Faris,et al.  Salp Swarm Algorithm: A bio-inspired optimizer for engineering design problems , 2017, Adv. Eng. Softw..

[28]  Stjepan Oreski,et al.  Genetic algorithm-based heuristic for feature selection in credit risk assessment , 2014, Expert Syst. Appl..

[29]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[30]  Tongyi Zheng,et al.  An Improved Squirrel Search Algorithm for Optimization , 2019, Complex..

[31]  A. Rezaee Jordehi,et al.  An efficient chaotic water cycle algorithm for optimization tasks , 2015, Neural Computing and Applications.

[32]  Kedar Nath Das,et al.  A memory based differential evolution algorithm for unconstrained optimization , 2016, Appl. Soft Comput..

[33]  Said Jadid Abdul Kadir,et al.  Binary Optimization Using Hybrid Grey Wolf Optimization for Feature Selection , 2019, IEEE Access.

[34]  H. Stanley,et al.  Optimizing the success of random searches , 1999, Nature.

[35]  Mohamed Abd Elaziz,et al.  Modeling of friction stir welding process using adaptive neuro-fuzzy inference system integrated with harris hawks optimizer , 2019, Journal of Materials Research and Technology.

[36]  Ram Rup Sarkar,et al.  Binary Genetic Swarm Optimization: A Combination of GA and PSO for Feature Selection , 2019, J. Intell. Syst..

[37]  H. L. Dryden,et al.  Investigations on the Theory of the Brownian Movement , 1957 .

[38]  Nasser R. Sabar,et al.  Rank based binary particle swarm optimisation for feature selection in classification , 2018, ICFNDS.

[39]  Yafei Zhang,et al.  Dynamic Adaboost learning with feature selection based on parallel genetic algorithm for image annotation , 2010, Knowl. Based Syst..

[40]  Naindar Afdanny,et al.  Killer Whale Algorithm: An Algorithm Inspired by the Life of Killer Whale , 2017 .

[41]  Dan Boneh,et al.  On genetic algorithms , 1995, COLT '95.

[42]  Hamza Turabieh,et al.  Enhanced Binary Moth Flame Optimization as a Feature Selection Algorithm to Predict Software Fault Prediction , 2020, IEEE Access.

[43]  Seyedali Mirjalili,et al.  Dynamic Butterfly Optimization Algorithm for Feature Selection , 2020, IEEE Access.

[44]  Hossam Faris,et al.  An efficient binary Salp Swarm Algorithm with crossover scheme for feature selection problems , 2018, Knowl. Based Syst..

[45]  Jae-Hyun Seo,et al.  Feature Selection for Very Short-Term Heavy Rainfall Prediction Using Evolutionary Computation , 2014 .

[46]  Silvia Casado Yusta,et al.  Different metaheuristic strategies to solve the feature selection problem , 2009, Pattern Recognit. Lett..

[47]  Mohamed Abd Elaziz,et al.  A review on meta-heuristics methods for estimating parameters of solar cells , 2019, Journal of Power Sources.

[48]  Kenji Suzuki,et al.  Binary coordinate ascent: An efficient optimization technique for feature subset selection for machine learning , 2016, Knowl. Based Syst..

[49]  John David Filmalter,et al.  First Descriptions of the Behavior of Silky Sharks, Carcharhinus Falciformis, Around Drifting Fish Aggregating Devices in the Indian Ocean , 2011 .

[50]  Dr.E. Ilavarasan,et al.  A Novel Hybrid Algorithm to Classify Spam Profiles in Twitter , 2020 .

[51]  Dalia Yousri,et al.  An Improved Marine Predators Algorithm With Fuzzy Entropy for Multi-Level Thresholding: Real World Example of COVID-19 CT Image Segmentation , 2020, IEEE Access.

[52]  Nicolas E. Humphries,et al.  Environmental context explains Lévy and Brownian movement patterns of marine predators , 2010, Nature.

[53]  Shulin Wang,et al.  Feature selection in machine learning: A new perspective , 2018, Neurocomputing.

[54]  Wang Bao,et al.  Integration of unsupervised and supervised machine learning algorithms for credit risk assessment , 2019, Expert Syst. Appl..

[55]  Huan Liu,et al.  Chi2: feature selection and discretization of numeric attributes , 1995, Proceedings of 7th IEEE International Conference on Tools with Artificial Intelligence.

[56]  Yanjiao Wang,et al.  An Improved Squirrel Search Algorithm for Global Function Optimization , 2019, Algorithms.

[57]  Seyed Mohammad Mirjalili,et al.  Whale optimization approaches for wrapper feature selection , 2018, Appl. Soft Comput..

[58]  Mohamed H. Haggag,et al.  A novel chaotic salp swarm algorithm for global optimization and feature selection , 2018, Applied Intelligence.

[59]  Ravi Shankar,et al.  A Firefly Algorithm Based Wrapper-Penalty Feature Selection Method for Cancer Diagnosis , 2018, ICCSA.

[60]  Frederic Bartumeus,et al.  ANIMAL SEARCH STRATEGIES: A QUANTITATIVE RANDOM‐WALK ANALYSIS , 2005 .

[61]  Huiling Chen,et al.  Slime mould algorithm: A new method for stochastic optimization , 2020, Future Gener. Comput. Syst..

[62]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[63]  Nabil Neggaz,et al.  Boosting salp swarm algorithm by sine cosine algorithm and disrupt operator for feature selection , 2020, Expert Syst. Appl..

[64]  Mahmoud Hassaballah,et al.  A novel hybrid Harris hawks optimization and support vector machines for drug design and discovery , 2020, Comput. Chem. Eng..

[65]  Dario Izzo,et al.  A survey on artificial intelligence trends in spacecraft guidance dynamics and control , 2018, Astrodynamics.

[66]  Aboul Ella Hassanien,et al.  Binary ant lion approaches for feature selection , 2016, Neurocomputing.

[67]  Hossam Faris,et al.  Evolving Support Vector Machines using Whale Optimization Algorithm for spam profiles detection on online social networks in different lingual contexts , 2018, Knowl. Based Syst..

[68]  Nicolas E. Humphries,et al.  Scaling laws of marine predator search behaviour , 2008, Nature.

[69]  Rossitza Setchi,et al.  Feature selection using Joint Mutual Information Maximisation , 2015, Expert Syst. Appl..

[70]  Myong Kee Jeong,et al.  An evolutionary algorithm with the partial sequential forward floating search mutation for large-scale feature selection problems , 2015, J. Oper. Res. Soc..

[71]  Mohamed A. Tawhid,et al.  Hybrid Binary Bat Enhanced Particle Swarm Optimization Algorithm for solving feature selection problems , 2018 .

[72]  Xuelong Li,et al.  Unsupervised Feature Selection via Adaptive Multimeasure Fusion , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[73]  Yi Yang,et al.  Semisupervised Feature Selection via Spline Regression for Video Semantic Recognition , 2015, IEEE Transactions on Neural Networks and Learning Systems.