A Hybrid Swarm Intelligence Based Feature Selection Algorithm for High Dimensional Datasets

High dimensional datasets expose a critical obstacle in machine learning. Feature selection overcomes this obstacle by eliminating duplicated and unimportant features from the dataset to increase the robustness of learning algorithms. This paper introduces a binary version of a hybrid swarm intelligence approach as a wrapper method for feature selection that gathers between the strengths of both the grey wolf and particle swarm optimizers. This approach is named Improved Binary Grey Wolf Optimization (IBGWO). The original version of this hybrid approach was proposed in the literature with a continuous search space as a high-level hybrid form, which runs the optimizers one after the other. Two different types of transfer functions, named S-Shaped and V-Shaped, are applied in this work to turn continuous data into binary. Nine of high-dimensional small-instance medical datasets are employed to assess the proposed approach. The experimental results demonstrate that IBGWO based on S-Shaped (IBGWO-S) outperforms the binary particle swarm and the binary grey wolf optimizers on six out of nine datasets according to the classification accuracy and fitness values. IBGWO-S selects the fewest features on 100% of the datasets. The results show IBGWO based on V-Shaped (IBGWO-V) outperforms the binary particle swarm and binary grey wolf optimizers on five datasets based on the classification accuracy and fitness values. The results indicate that IBGWO-V outperforms IBGWO-S in terms of all studied evaluation metrics. The results also show that IBGWO-S and IBGWO-V outperform eight meta-heuristics known in the literature in selecting the relevant features with acceptable classification accuracy.

[1]  Jin Song Dong,et al.  Binary Harris Hawks Optimizer for High-Dimensional, Low Sample Size Feature Selection , 2019, Algorithms for Intelligent Systems.

[2]  Xin-She Yang Harmony Search as a Metaheuristic Algorithm , 2009 .

[3]  Majdi M. Mafarja,et al.  Hybrid binary ant lion optimizer with rough set and approximate entropy reducts for feature selection , 2018, Soft Comput..

[4]  Francesca Odone,et al.  Feature selection for high-dimensional data , 2009, Comput. Manag. Sci..

[5]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[6]  Abdul Rahim Abdullah,et al.  A New Competitive Binary Grey Wolf Optimizer to Solve the Feature Selection Problem in EMG Signals Classification , 2018, Comput..

[7]  Andrew Lewis,et al.  S-shaped versus V-shaped transfer functions for binary Particle Swarm Optimization , 2013, Swarm Evol. Comput..

[8]  Shivani Mehta,et al.  Hybrid GWO-PSO Algorithm for Solving Convex Economi c Load Dispatch Problem , 2016 .

[9]  Gang Wang,et al.  A novel bacterial foraging optimization algorithm for feature selection , 2017, Expert Syst. Appl..

[10]  Xiaofeng Zhu,et al.  Efficient kNN Classification With Different Numbers of Nearest Neighbors , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[11]  Omid Bozorg Haddad,et al.  Honey-Bees Mating Optimization (HBMO) Algorithm: A New Heuristic Approach for Water Resources Optimization , 2006 .

[12]  Ke Chen,et al.  Hybrid particle swarm optimization with spiral-shaped mechanism for feature selection , 2019, Expert Syst. Appl..

[13]  Hua Li,et al.  Using mutual information for selecting continuous-valued attribute in decision tree learning , 2003, Proceedings of the 2003 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.03EX693).

[14]  Hossein Nezamabadi-pour,et al.  GSA: A Gravitational Search Algorithm , 2009, Inf. Sci..

[15]  N. Altman An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression , 1992 .

[16]  Xin-She Yang,et al.  BBA: A Binary Bat Algorithm for Feature Selection , 2012, 2012 25th SIBGRAPI Conference on Graphics, Patterns and Images.

[17]  R. Eberhart,et al.  Empirical study of particle swarm optimization , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[18]  Rahim Ali Abbaspour,et al.  Enhanced Chaotic Grey Wolf Optimizer for Real-World Optimization Problems: A Comparative Study , 2018 .

[19]  Seyed Mohammad Mirjalili,et al.  Evolutionary population dynamics and grey wolf optimizer , 2015, Neural Computing and Applications.

[20]  Sreeram Ramakrishnan,et al.  A hybrid approach for feature subset selection using neural networks and ant colony optimization , 2007, Expert Syst. Appl..

[21]  Mohammad Ehsan Basiri,et al.  A novel hybrid ACO-GA algorithm for text feature selection , 2009, 2009 IEEE Congress on Evolutionary Computation.

[22]  Salwani Abdullah,et al.  Investigating memetic algorithm in solving rough set attribute reduction , 2013, Int. J. Comput. Appl. Technol..

[23]  Shu-Kai S. Fan,et al.  A decreasing inertia weight particle swarm optimizer , 2007 .

[24]  El-Ghazali Talbi,et al.  Metaheuristics - From Design to Implementation , 2009 .

[25]  Hossam Faris,et al.  Harris hawks optimization: Algorithm and applications , 2019, Future Gener. Comput. Syst..

[26]  Andrew Lewis,et al.  Grey Wolf Optimizer , 2014, Adv. Eng. Softw..

[27]  Hossein Nezamabadi-pour,et al.  An advanced ACO algorithm for feature subset selection , 2015, Neurocomputing.

[28]  Majdi M. Mafarja,et al.  Hybrid Whale Optimization Algorithm with simulated annealing for feature selection , 2017, Neurocomputing.

[29]  Hossam Faris,et al.  Salp Swarm Algorithm: A bio-inspired optimizer for engineering design problems , 2017, Adv. Eng. Softw..

[30]  Vili Podgorelec,et al.  Swarm Intelligence Algorithms for Feature Selection: A Review , 2018, Applied Sciences.

[31]  Hossam Faris,et al.  Grey wolf optimizer: a review of recent variants and applications , 2017, Neural Computing and Applications.

[32]  S. B. Singh,et al.  Hybrid Algorithm of Particle Swarm Optimization and Grey Wolf Optimizer for Improving Convergence Performance , 2017, J. Appl. Math..

[33]  Hany M. Hasanien,et al.  A novel hybrid GWO-PSO optimization technique for optimal reactive power dispatch problem solution , 2020 .

[34]  J. Jona,et al.  Ant-cuckoo colony optimization for feature selection in digital mammogram. , 2014, Pakistan journal of biological sciences : PJBS.

[35]  Majdi M. Mafarja,et al.  S-Shaped vs. V-Shaped Transfer Functions for Ant Lion Optimization Algorithm in Feature Selection Problem , 2017, ICFNDS.

[36]  M. Nandhini,et al.  Optimal feature selection using binary teaching learning based optimization algorithm , 2018, J. King Saud Univ. Comput. Inf. Sci..

[37]  Ahmad Taher Azar,et al.  A novel hybrid feature selection method based on rough set and improved harmony search , 2015, Neural Computing and Applications.

[38]  Andrew Lewis,et al.  The Whale Optimization Algorithm , 2016, Adv. Eng. Softw..

[39]  Said Jadid Abdul Kadir,et al.  Binary Optimization Using Hybrid Grey Wolf Optimization for Feature Selection , 2019, IEEE Access.

[40]  Mohamed Elhoseny,et al.  A new binary salp swarm algorithm: development and application for optimization tasks , 2018, Neural Computing and Applications.

[41]  Tournament selection , 2018, Evolutionary Computation 1.

[42]  Yue Shi,et al.  A modified particle swarm optimizer , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).

[43]  Mohamed Elhoseny,et al.  Improved Feature Selection Model for Big Data Analytics , 2020, IEEE Access.

[44]  Marco Dorigo,et al.  Ant system: optimization by a colony of cooperating agents , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[45]  Xiaoming Xu,et al.  A hybrid genetic algorithm for feature selection wrapper based on mutual information , 2007, Pattern Recognit. Lett..

[46]  Aboul Ella Hassanien,et al.  Hybrid System based on Rough Sets and Genetic Algorithms for Medical Data Classifications , 2013, Int. J. Fuzzy Syst. Appl..

[47]  Songfeng Lu,et al.  Improved salp swarm algorithm based on particle swarm optimization for feature selection , 2018, Journal of Ambient Intelligence and Humanized Computing.

[48]  Hossam Faris,et al.  An intelligent system for spam detection and identification of the most relevant features based on evolutionary Random Weight Networks , 2019, Inf. Fusion.

[49]  James Kennedy,et al.  The particle swarm: social adaptation of knowledge , 1997, Proceedings of 1997 IEEE International Conference on Evolutionary Computation (ICEC '97).

[50]  Hossein Nezamabadi-pour,et al.  BGSA: binary gravitational search algorithm , 2010, Natural Computing.

[51]  Thomas Stützle,et al.  Ant Colony Optimization: Overview and Recent Advances , 2018, Handbook of Metaheuristics.

[52]  Majdi M. Mafarja,et al.  Binary Dragonfly Algorithm for Feature Selection , 2017, 2017 International Conference on New Trends in Computing Sciences (ICTCS).

[53]  Russell C. Eberhart,et al.  A discrete binary version of the particle swarm algorithm , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[54]  Parham Pahlavani,et al.  An efficient modified grey wolf optimizer with Lévy flight for optimization tasks , 2017, Appl. Soft Comput..

[55]  Dervis Karaboga,et al.  A powerful and efficient algorithm for numerical function optimization: artificial bee colony (ABC) algorithm , 2007, J. Glob. Optim..

[56]  Wei Gao,et al.  Low-discrepancy sequence initialized particle swarm optimization algorithm with high-order nonlinear time-varying inertia weight , 2015, Appl. Soft Comput..

[57]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[58]  Albert Y. Zomaya Handbook of Nature-Inspired and Innovative Computing - Integrating Classical Models with Emerging Technologies , 2006 .

[59]  Mohammad Ali Zare Chahooki,et al.  A Survey on semi-supervised feature selection methods , 2017, Pattern Recognit..

[60]  Jun Zhang,et al.  Adaptive Particle Swarm Optimization , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[61]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[62]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[63]  Seyedali Mirjalili,et al.  Dragonfly algorithm: a new meta-heuristic optimization technique for solving single-objective, discrete, and multi-objective problems , 2015, Neural Computing and Applications.

[64]  Jun Wu,et al.  Hybridizing grey wolf optimization with differential evolution for global optimization and test scheduling for 3D stacked SoC , 2015 .

[65]  Salwani Abdullah,et al.  Modified great deluge for attribute reduction in rough set theory , 2011, 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD).

[66]  Majdi M. Mafarja,et al.  Feature selection using binary particle swarm optimization with time varying inertia weight strategies , 2018, ICFNDS.

[67]  Aboul Ella Hassanien,et al.  Binary grey wolf optimization approaches for feature selection , 2016, Neurocomputing.

[68]  Hui Xu,et al.  An improved grey wolf optimizer algorithm integrated with Cuckoo Search , 2017, 2017 9th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS).

[69]  Ron Kohavi,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998 .

[70]  Aboul Ella Hassanien,et al.  A binary whale optimization algorithm with hyperbolic tangent fitness function for feature selection , 2017, 2017 Eighth International Conference on Intelligent Computing and Information Systems (ICICIS).

[71]  Seyed Mohammad Mirjalili,et al.  The Ant Lion Optimizer , 2015, Adv. Eng. Softw..

[72]  Seyed Mohammad Mirjalili,et al.  Whale optimization approaches for wrapper feature selection , 2018, Appl. Soft Comput..

[73]  Selma Ayse Özel,et al.  A hybrid approach of differential evolution and artificial bee colony for feature selection , 2016, Expert Syst. Appl..

[74]  Byung Ro Moon,et al.  Hybrid Genetic Algorithms for Feature Selection , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[75]  Ratna Babu Chinnam,et al.  mr2PSO: A maximum relevance minimum redundancy feature selection method based on swarm intelligence for support vector machine classification , 2011, Inf. Sci..

[76]  Constantin F. Aliferis,et al.  A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis , 2004, Bioinform..