Differential Evolution-Based Feature Selection: A Niching-Based Multiobjective Approach

Feature selection is to reduce both the dimensionality of data and the classification error rate (i.e., increase the classification accuracy) of a learning algorithm. The two objectives are often conflicting, thus a multiobjective feature selection method can obtain a set of nondominated feature subsets. Each solution in the set has a different size and a corresponding classification error rate. However, most existing feature selection algorithms have ignored that, for a given size, there can be different feature subsets with very similar or the same accuracy. This article introduces a niching-based multiobjective feature selection method that simultaneously minimizes the number of selected features and the classification error rate. The proposed method conceives to identify: 1) a set of feature subsets with good convergence and distribution and 2) multiple feature subsets choosing the same number of features with almost the same lowest classification error rate. The contributions of this article are threefold. First, a niching and global interaction mutation operator is proposed that can produce promising feature subsets. Second, a newly developed environmental selection mechanism allows equal informative feature subsets to be stored by relaxing the Pareto-dominance relationship. Finally, the proposed subset repairing mechanism can generate better feature subsets and further remove the redundant features. The proposed method is compared against seven multiobjective feature selection algorithms on 19 datasets, including both binary and multiclass classification tasks. The results show that the proposed method can evolve a rich and diverse set of nondominated solutions for different feature selection tasks, and their availability helps in understanding the relationships between features.

[1]  Bing Xue,et al.  A Grid-dominance based Multi-objective Algorithm for Feature Selection in Classification , 2021, 2021 IEEE Congress on Evolutionary Computation (CEC).

[2]  Bing Xue,et al.  Improved Crowding Distance in Multi-objective Optimization for Feature Selection in Classification , 2021, EvoApplications.

[3]  Jing J. Liang,et al.  Differential evolution using improved crowding distance for multimodal multiobjective optimization , 2021, Swarm Evol. Comput..

[4]  Fei Han,et al.  Multi-objective particle swarm optimization with adaptive strategies for feature selection , 2021, Swarm Evol. Comput..

[5]  D. Gong,et al.  A Fast Hybrid Feature Selection Based on Correlation-Guided Clustering and Particle Swarm Optimization for High-Dimensional Data , 2021, IEEE Transactions on Cybernetics.

[6]  Fan Cheng,et al.  A Steering-Matrix-Based Multiobjective Evolutionary Algorithm for High-Dimensional Feature Selection , 2021, IEEE Transactions on Cybernetics.

[7]  Feiping Nie,et al.  A General Framework for Feature Selection Under Orthogonal Regression With Global Redundancy Minimization , 2021, IEEE Transactions on Knowledge and Data Engineering.

[8]  Kanchan Jha,et al.  Incorporation of multimodal multiobjective optimization in designing a filter based feature selection technique , 2020, Appl. Soft Comput..

[9]  Yong Zhang,et al.  Multiobjective Particle Swarm Optimization for Feature Selection With Fuzzy Cost , 2020, IEEE Transactions on Cybernetics.

[10]  Bing Xue,et al.  A Duplication Analysis-Based Evolutionary Algorithm for Biobjective Feature Selection , 2020, IEEE Transactions on Evolutionary Computation.

[11]  Ying Lin,et al.  Local Binary Pattern-Based Adaptive Differential Evolution for Multimodal Optimization Problems , 2020, IEEE Transactions on Cybernetics.

[12]  Bing Xue,et al.  Segmented initialization and offspring modification in evolutionary algorithms for bi-objective feature selection , 2020, GECCO.

[13]  Xiaoping Li,et al.  A Survey on Sparse Learning Models for Feature Selection , 2020, IEEE Transactions on Cybernetics.

[14]  Ye Tian,et al.  An Evolutionary Algorithm for Large-Scale Sparse Multiobjective Optimization Problems , 2020, IEEE Transactions on Evolutionary Computation.

[15]  Hisao Ishibuchi,et al.  A Review of Evolutionary Multimodal Multiobjective Optimization , 2020, IEEE Transactions on Evolutionary Computation.

[16]  Hisao Ishibuchi,et al.  Multiple Reference Points-Based Decomposition for Multiobjective Feature Selection in Classification: Static and Dynamic Mechanisms , 2020, IEEE Transactions on Evolutionary Computation.

[17]  Jing J. Liang,et al.  Multimodal Multiobjective Optimization in Feature Selection , 2019, 2019 IEEE Congress on Evolutionary Computation (CEC).

[18]  Mengjie Zhang,et al.  Variable-Length Particle Swarm Optimization for Feature Selection on High-Dimensional Classification , 2019, IEEE Transactions on Evolutionary Computation.

[19]  Shahryar Rahnamayan,et al.  A Novel Multi-objective Binary Differential Evolution Algorithm for Multi-label Feature Selection , 2019, 2019 IEEE Congress on Evolutionary Computation (CEC).

[20]  Xingquan Zuo,et al.  Barebones Particle Swarm Optimization with a Neighborhood Search Strategy for Feature Selection , 2018, BIC-TA.

[21]  Xin Feng,et al.  Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances. , 2018, Journal of visualized experiments : JoVE.

[22]  Jing J. Liang,et al.  A Multiobjective Particle Swarm Optimizer Using Ring Topology for Solving Multimodal Multiobjective Problems , 2018, IEEE Transactions on Evolutionary Computation.

[23]  Emrah Hancer,et al.  A differential evolution approach for simultaneous clustering and feature selection , 2018, 2018 International Conference on Artificial Intelligence and Data Processing (IDAP).

[24]  Tapabrata Ray,et al.  Evolutionary Algorithms for Finding Nash Equilibria in Electricity Markets , 2018, IEEE Transactions on Evolutionary Computation.

[25]  Fengfeng Zhou,et al.  Multiple similarly effective solutions exist for biomedical feature selection and classification problems , 2017, Scientific Reports.

[26]  Alexander Pletzer,et al.  How NeSI Helps Users Run Better and Faster on New Zealand's Supercomputing Platforms , 2017, 2017 IEEE 13th International Conference on e-Science (e-Science).

[27]  Xiaodong Li,et al.  Seeking Multiple Solutions: An Updated Survey on Niching Methods and Their Applications , 2017, IEEE Transactions on Evolutionary Computation.

[28]  Mengjie Zhang,et al.  Particle Swarm Optimisation with genetic operators for feature selection , 2017, 2017 IEEE Congress on Evolutionary Computation (CEC).

[29]  Ye Tian,et al.  PlatEMO: A MATLAB Platform for Evolutionary Multi-Objective Optimization [Educational Forum] , 2017, IEEE Computational Intelligence Magazine.

[30]  Xin Yao,et al.  A Survey on Evolutionary Computation Approaches to Feature Selection , 2016, IEEE Transactions on Evolutionary Computation.

[31]  Selin Damla Ahipasaoglu,et al.  Identifying (Quasi) Equally Informative Subsets in Feature Selection Problems for Classification: A Max-Relevance Min-Redundancy Approach , 2016, IEEE Transactions on Cybernetics.

[32]  Enhong Chen,et al.  Feature Selection with Integrated Relevance and Redundancy Optimization , 2015, 2015 IEEE International Conference on Data Mining.

[33]  Mengjie Zhang,et al.  Multi-objective Feature Selection in Classification: A Differential Evolution Approach , 2014, SEAL.

[34]  Jian Zhuang,et al.  Combining Crowding Estimation in Objective and Decision Space With Multiple Selection and Search Strategies for Multi-Objective Evolutionary Optimization , 2014, IEEE Transactions on Cybernetics.

[35]  Mengjie Zhang,et al.  Particle Swarm Optimization for Feature Selection in Classification: A Multi-Objective Approach , 2013, IEEE Transactions on Cybernetics.

[36]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[37]  Lei Wang,et al.  Efficient Spectral Feature Selection with Minimum Redundancy , 2010, AAAI.

[38]  Kalyanmoy Deb,et al.  Omni-optimizer: A generic evolutionary algorithm for single and multi-objective optimization , 2008, Eur. J. Oper. Res..

[39]  Gavin C. Cawley,et al.  Sparse Multinomial Logistic Regression via Bayesian L1 Regularisation , 2006, NIPS.

[40]  Huan Liu,et al.  Efficient Feature Selection via Analysis of Relevance and Redundancy , 2004, J. Mach. Learn. Res..

[41]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[42]  Lloyd A. Smith,et al.  Feature Selection for Machine Learning: Comparing a Correlation-Based Filter Approach to the Wrapper , 1999, FLAIRS.

[43]  Hiroshi Motoda,et al.  Feature Extraction, Construction and Selection: A Data Mining Perspective , 1998 .

[44]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[45]  S. Ronald Finding multiple solutions with an evolutionary algorithm , 1995, Proceedings of 1995 IEEE International Conference on Evolutionary Computation.

[46]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[47]  Josef Kittler,et al.  Floating search methods in feature selection , 1994, Pattern Recognit. Lett..

[48]  Thomas G. Dietterich,et al.  Learning Boolean Concepts in the Presence of Many Irrelevant Features , 1994, Artif. Intell..

[49]  Darrell Whitley,et al.  A genetic algorithm tutorial , 1994, Statistics and Computing.

[50]  Igor Kononenko,et al.  Estimating Attributes: Analysis and Extensions of RELIEF , 1994, ECML.

[51]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[52]  David E. Goldberg,et al.  Genetic Algorithms with Sharing for Multimodalfunction Optimization , 1987, ICGA.

[53]  Dunwei Gong,et al.  Binary differential evolution with self-learning for multi-objective feature selection , 2020, Inf. Sci..

[54]  Mahdi Eftekhari,et al.  Feature selection using multimodal optimization techniques , 2016, Neurocomputing.

[55]  Huan Liu,et al.  Feature Selection for High-Dimensional Data: A Fast Correlation-Based Filter Solution , 2003, ICML.

[56]  P. John Clarkson,et al.  A Species Conserving Genetic Algorithm for Multimodal Function Optimization , 2002, Evolutionary Computation.

[57]  Stanley J. Reeves,et al.  Sequential algorithms for observation selection , 1999, IEEE Trans. Signal Process..

[58]  K. Dejong,et al.  An analysis of the behavior of a class of genetic adaptive systems , 1975 .