A novel multi-objective forest optimization algorithm for wrapper feature selection

Abstract Feature selection is one of the important techniques of dimensionality reduction in data preprocessing because datasets generally have redundant and irrelevant features that adversely affect the performance and complexity of classification models. Feature selection has two main objectives, i.e., reducing the number of features and increasing classification performance due to its inherent nature. In this paper, we propose a multi-objective feature selection algorithm based on forest optimization algorithm (FOA) using the archive, grid, and region-based selection concepts. For this purpose, two versions of the proposed algorithm are developed using continuous and binary representations. The performance of the proposed algorithms is investigated on nine UCI datasets and two microarray datasets. Next, the obtained results are compared with seven traditional single-objective and five multi-objective methods. Based on the results, both proposed algorithms have reached the same performance or even outperformed the single-objective methods. Compared with other multi-objective algorithms, MOFOA with continuous representation has managed to reduce the classification error in most cases by selecting less number of features than other methods.

[1]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[2]  Janez Brest,et al.  Multi-Objective Differential Evolution for feature selection in Facial Expression Recognition systems , 2017, Expert Syst. Appl..

[3]  Keinosuke Fukunaga,et al.  A Branch and Bound Algorithm for Feature Subset Selection , 1977, IEEE Transactions on Computers.

[4]  Mengjie Zhang,et al.  Multi-objective particle swarm optimisation (PSO) for feature selection , 2012, GECCO '12.

[5]  Mikkel T. Jensen,et al.  Reducing the run-time complexity of multiobjective EAs: The NSGA-II and other algorithms , 2003, IEEE Trans. Evol. Comput..

[6]  Sai Wang,et al.  A Hybrid Genetic Algorithm With Wrapper-Embedded Approaches for Feature Selection , 2018, IEEE Access.

[7]  S. Kanmani,et al.  A hybrid algorithm using ant and bee colony optimization for feature selection and classification (AC-ABC Hybrid) , 2017, Swarm Evol. Comput..

[8]  A. R. Baig,et al.  Multi-Objective Feature Subset Selection using Non-dominated Sorting Genetic Algorithm , 2015 .

[9]  Mengjie Zhang,et al.  Particle swarm optimisation for feature selection in classification: Novel initialisation and updating mechanisms , 2014, Appl. Soft Comput..

[10]  Carlos A. Coello Coello,et al.  Improving PSO-Based Multi-objective Optimization Using Crowding, Mutation and epsilon-Dominance , 2005, EMO.

[11]  Rinkle Rani,et al.  C-HMOSHSSA: Gene selection for cancer classification using multi-objective meta-heuristic and machine learning methods , 2019, Comput. Methods Programs Biomed..

[12]  Gang Wang,et al.  A Novel Hybrid Algorithm for Feature Selection Based on Whale Optimization Algorithm , 2019, IEEE Access.

[13]  Anne Auger,et al.  Theory of the hypervolume indicator: optimal μ-distributions and the choice of the reference point , 2009, FOGA '09.

[14]  Majdi M. Mafarja,et al.  Hybrid Whale Optimization Algorithm with simulated annealing for feature selection , 2017, Neurocomputing.

[15]  Marco Laumanns,et al.  SPEA2: Improving the strength pareto evolutionary algorithm , 2001 .

[16]  Mohammad-Reza Feizi-Derakhshi,et al.  Feature selection using Forest Optimization Algorithm , 2016, Pattern Recognit..

[17]  Mohammad-Reza Feizi-Derakhshi,et al.  Forest Optimization Algorithm , 2014, Expert Syst. Appl..

[18]  A. Wayne Whitney,et al.  A Direct Method of Nonparametric Measurement Selection , 1971, IEEE Transactions on Computers.

[19]  Xingcheng Liu,et al.  Hierarchy Strengthened Grey Wolf Optimizer for Numerical Optimization and Feature Selection , 2019, IEEE Access.

[20]  Harpreet Singh,et al.  A New Hybrid Algorithm Based on Grey Wolf Optimization and Crow Search Algorithm for Unconstrained Function Optimization and Feature Selection , 2019, IEEE Access.

[21]  Seyed Hamid Zahiri,et al.  An optimal SVM with feature selection using multi-objective PSO , 2016, 2016 1st Conference on Swarm Intelligence and Evolutionary Computation (CSIEC).

[22]  Xiaoyan Sun,et al.  Multi-objective feature selection based on artificial bee colony: An acceleration approach with variable sample size , 2020, Appl. Soft Comput..

[23]  Li-Yeh Chuang,et al.  Improved binary particle swarm optimization using catfish effect for feature selection , 2011, Expert Syst. Appl..

[24]  Yasushi Makihara,et al.  Object recognition supported by user interaction for service robots , 2002, Object recognition supported by user interaction for service robots.

[25]  Ujjwal Maulik,et al.  A Survey of Multiobjective Evolutionary Algorithms for Data Mining: Part I , 2014, IEEE Transactions on Evolutionary Computation.

[26]  Behrouz Minaei-Bidgoli,et al.  Optimizing multi-objective PSO based feature selection method using a feature elitism mechanism , 2018, Expert Syst. Appl..

[27]  Mohamed Touahria,et al.  Feature selection for intrusion detection using new multi-objective estimation of distribution algorithms , 2019, Applied Intelligence.

[28]  Luis E. Zárate,et al.  Multi-Objective Genetic Algorithm for Feature Selection in a Protein Function Prediction Context , 2018, 2018 IEEE Congress on Evolutionary Computation (CEC).

[29]  Zhixin Sun,et al.  An Improved Feature Selection Algorithm Based on Ant Colony Optimization , 2018, IEEE Access.

[30]  S. B. Singh,et al.  Hybrid Algorithm of Particle Swarm Optimization and Grey Wolf Optimizer for Improving Convergence Performance , 2017, J. Appl. Math..

[31]  Martin J. Oates,et al.  PESA-II: region-based selection in evolutionary multiobjective optimization , 2001 .

[32]  Josef Kittler,et al.  Floating search methods in feature selection , 1994, Pattern Recognit. Lett..

[33]  Zixing Cai,et al.  A Fast Method of Constructing the Non-dominated Set: Arena's Principle , 2008, 2008 Fourth International Conference on Natural Computation.

[34]  Umi Kalsom Yusof,et al.  Filter-Based Multi-Objective Feature Selection Using NSGA III and Cuckoo Optimization Algorithm , 2020, IEEE Access.

[35]  Kalyanmoy Deb,et al.  Multi-objective Optimization , 2014 .

[36]  Said Jadid Abdul Kadir,et al.  Binary Optimization Using Hybrid Grey Wolf Optimization for Feature Selection , 2019, IEEE Access.

[37]  Václav Snásel,et al.  Large-dimensionality small-instance set feature selection: A hybrid bio-inspired heuristic approach , 2018, Swarm Evol. Comput..

[38]  Roberto Battiti,et al.  Using mutual information for selecting features in supervised neural net learning , 1994, IEEE Trans. Neural Networks.

[39]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[40]  Aboul Ella Hassanien,et al.  Binary grey wolf optimization approaches for feature selection , 2016, Neurocomputing.

[41]  Nikhil R. Pal,et al.  Genetic programming for simultaneous feature selection and classifier design , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[42]  Mengjie Zhang,et al.  Pareto front feature selection based on artificial bee colony optimization , 2018, Inf. Sci..

[43]  Mohammad Karim Sohrabi,et al.  Multi-objective feature selection for warfarin dose prediction , 2017, Comput. Biol. Chem..

[44]  Dun-Wei Gong,et al.  A multi-objective feature selection based on differential evolution , 2015, 2015 International Conference on Control, Automation and Information Sciences (ICCAIS).

[45]  Mengjie Zhang,et al.  Differential evolution (DE) for multi-objective feature selection in classification , 2014, GECCO.

[46]  Y. Ong,et al.  Feature Selection Using Single/Multi-Objective Memetic Frameworks , 2009 .

[47]  Zhong Yan,et al.  Ant Colony Optimization for Feature Selection in Face Recognition , 2004, ICBA.

[48]  George K. Karagiannidis,et al.  Efficient Machine Learning for Big Data: A Review , 2015, Big Data Res..

[49]  Anshuman Panda,et al.  A Modern Approach for Load Balancing Using Forest Optimization Algorithm , 2018, 2018 Second International Conference on Computing Methodologies and Communication (ICCMC).

[50]  Seyedali Mirjalili,et al.  Approaches to Multi-Objective Feature Selection: A Systematic Literature Review , 2020, IEEE Access.

[51]  S. Boucheron,et al.  Theory of classification : a survey of some recent advances , 2005 .

[52]  Amr Badr,et al.  A binary clonal flower pollination algorithm for feature selection , 2016, Pattern Recognit. Lett..

[53]  Jacob Scharcanski,et al.  Feature selection for face recognition based on multi-objective evolutionary wrappers , 2013, Expert Syst. Appl..

[54]  Bhuvaneswari Ragothaman,et al.  A Multi-objective Non-Dominated Sorted Artificial Bee Colony Feature Selection Algorithm for Medical Datasets , 2016 .

[55]  Edoardo Amaldi,et al.  On the Approximability of Minimizing Nonzero Variables or Unsatisfied Relations in Linear Systems , 1998, Theor. Comput. Sci..

[56]  Fernando Jiménez,et al.  Multi-objective evolutionary feature selection for online sales forecasting , 2017, Neurocomputing.

[57]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[58]  Carlos A. Coello Coello,et al.  Handling multiple objectives with particle swarm optimization , 2004, IEEE Transactions on Evolutionary Computation.

[59]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[60]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[61]  Seyedali Mirjalili,et al.  Binary Multi-Objective Grey Wolf Optimizer for Feature Selection in Classification , 2020, IEEE Access.

[62]  Seyed Hamid Zahiri,et al.  An optimal SVM with feature selection using multi-objective PSO , 2016 .

[63]  Mengjie Zhang,et al.  Multi-objective Feature Selection in Classification: A Differential Evolution Approach , 2014, SEAL.

[64]  Silvia Casado Yusta,et al.  Different metaheuristic strategies to solve the feature selection problem , 2009, Pattern Recognit. Lett..

[65]  Eibe Frank,et al.  Large-scale attribute selection using wrappers , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[66]  Sambit Bakshi,et al.  Analysis of high-dimensional biomedical data using an evolutionary multi-objective emperor penguin optimizer , 2019, Swarm Evol. Comput..

[67]  Mengjie Zhang,et al.  Particle Swarm Optimization for Feature Selection in Classification: A Multi-Objective Approach , 2013, IEEE Transactions on Cybernetics.

[68]  Jon Louis Bentley,et al.  Multidimensional divide-and-conquer , 1980, CACM.

[69]  Chih-Cheng Hung,et al.  A Multi-objective hybrid filter-wrapper evolutionary approach for feature selection , 2018, Memetic Comput..

[70]  M. Tahar Kechadi,et al.  Multi-objective feature selection by using NSGA-II for customer churn prediction in telecommunications , 2010, Expert Syst. Appl..

[71]  Hua Xu,et al.  An improved NSGA-III procedure for evolutionary many-objective optimization , 2014, GECCO.

[72]  Parham Moradi,et al.  An unsupervised feature selection algorithm based on ant colony optimization , 2014, Eng. Appl. Artif. Intell..

[73]  Fakhri Karray,et al.  Multi-objective Feature Selection with NSGA II , 2007, ICANNGA.

[74]  Salma Jamoussi,et al.  A multi-objective genetic algorithm for simultaneous model and feature selection for support vector machines , 2017, Artificial Intelligence Review.

[75]  Luiz Eduardo Soares de Oliveira,et al.  Feature selection using multi-objective genetic algorithms for handwritten digit recognition , 2002, Object recognition supported by user interaction for service robots.

[76]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[77]  Byung Ro Moon,et al.  Hybrid Genetic Algorithms for Feature Selection , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[78]  G. Geoffrey Vining,et al.  Taguchi's parameter design: a panel discussion , 1992 .

[79]  Jian Cheng,et al.  Multi-Objective Particle Swarm Optimization Approach for Cost-Based Feature Selection in Classification , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[80]  Frank Neumann,et al.  Analyzing Hypervolume Indicator Based Algorithms , 2008, PPSN.

[81]  Seyed Mohammad Mirjalili,et al.  Whale optimization approaches for wrapper feature selection , 2018, Appl. Soft Comput..

[82]  Dunwei Gong,et al.  Binary differential evolution with self-learning for multi-objective feature selection , 2020, Inf. Sci..

[83]  Thomas Marill,et al.  On the effectiveness of receptors in recognition systems , 1963, IEEE Trans. Inf. Theory.

[84]  Mengjie Zhang,et al.  A multi-objective artificial bee colony approach to feature selection using fuzzy mutual information , 2015, 2015 IEEE Congress on Evolutionary Computation (CEC).

[85]  Zexuan Zhu,et al.  Markov blanket-embedded genetic algorithm for gene selection , 2007, Pattern Recognit..

[86]  E.J. Delp,et al.  A Comparison of Feature Selection Methods for the Detection of Breast Cancers in Mammograms: Adaptive Sequential Floating Search vs. Genetic Algorithm , 2005, 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference.

[87]  Xin Yao,et al.  A Survey on Evolutionary Computation Approaches to Feature Selection , 2016, IEEE Transactions on Evolutionary Computation.

[88]  Mengjie Zhang,et al.  A binary ABC algorithm based on advanced similarity scheme for feature selection , 2015, Appl. Soft Comput..

[89]  Luis Javier Rodríguez-Fuentes,et al.  Feature Selection Based on Genetic Algorithms for Speaker Recognition , 2006, 2006 IEEE Odyssey - The Speaker and Language Recognition Workshop.

[90]  Zhong Ming,et al.  An improved NSGA-III algorithm for feature selection used in intrusion detection , 2017, Knowl. Based Syst..

[91]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[92]  Mengjie Zhang,et al.  New mechanism for archive maintenance in PSO-based multi-objective feature selection , 2016, Soft Comput..

[93]  Rich Caruana,et al.  Greedy Attribute Selection , 1994, ICML.

[94]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[95]  Hossein Nezamabadi-pour,et al.  An advanced ACO algorithm for feature subset selection , 2015, Neurocomputing.

[96]  Andrew Hunter,et al.  A multi-objective genetic algorithm approach to feature selection in neural and fuzzy modeling , 2001 .

[97]  Parham Moradi,et al.  A hybrid particle swarm optimization for feature subset selection by integrating a novel local search strategy , 2016, Appl. Soft Comput..

[98]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[99]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..