Information-Theory-based Nondominated Sorting Ant Colony Optimization for Multiobjective Feature Selection in Classification

Feature selection (FS) has received significant attention since the use of a well-selected subset of features may achieve better classification performance than that of full features in many real-world applications. It can be considered as a multiobjective optimization consisting of two objectives: 1) minimizing the number of selected features and 2) maximizing classification performance. Ant colony optimization (ACO) has shown its effectiveness in FS due to its problem-guided search operator and flexible graph representation. However, there lacks an effective ACO-based approach for multiobjective FS to handle the problematic characteristics originated from the feature interactions and highly discontinuous Pareto fronts. This article presents an Information-theory-based Nondominated Sorting ACO (called INSA) to solve the aforementioned difficulties. First, the probabilistic function in ACO is modified based on the information theory to identify the importance of features; second, a new ACO strategy is designed to construct solutions; and third, a novel pheromone updating strategy is devised to ensure the high diversity of tradeoff solutions. INSA’s performance is compared with four machine-learning-based methods, four representative single-objective evolutionary algorithms, and six state-of-the-art multiobjective ones on 13 benchmark classification datasets, which consist of both low and high-dimensional samples. The empirical results verify that INSA is able to obtain solutions with better classification performance using features whose count is similar to or less than those obtained by its peers.

[1]  Mengchu Zhou,et al.  Competition-Driven Multimodal Multiobjective Optimization and Its Application to Feature Selection for Credit Card Fraud Detection , 2022, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[2]  Mengjie Zhang,et al.  Multiobjective Differential Evolution for Feature Selection in Classification , 2021, IEEE Transactions on Cybernetics.

[3]  J. Zhong,et al.  An efficient multi-objective ant colony optimization for task allocation of heterogeneous unmanned aerial vehicles , 2021, J. Comput. Sci..

[4]  Jun Tang,et al.  A Review on Representative Swarm Intelligence Algorithms for Solving Optimization Problems: Applications and Trends , 2021, IEEE/CAA Journal of Automatica Sinica.

[5]  Xin Luo,et al.  A Latent Factor Analysis-Based Approach to Online Sparse Streaming Feature Selection , 2021, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[6]  Qinqin Fan,et al.  Zoning Search With Adaptive Resource Allocating Method for Balanced and Imbalanced Multimodal Multi-Objective Optimization , 2021, IEEE/CAA Journal of Automatica Sinica.

[7]  Quanwang Wu,et al.  A Hybrid Probabilistic Multiobjective Evolutionary Algorithm for Commercial Recommendation Systems , 2021, IEEE Transactions on Computational Social Systems.

[8]  MengChu Zhou,et al.  Dual-Objective Mixed Integer Linear Program and Memetic Algorithm for an Industrial Group Scheduling Problem , 2021, IEEE/CAA Journal of Automatica Sinica.

[9]  F. Nie,et al.  Semisupervised Feature Selection With Sparse Discriminative Least Squares Regression , 2021, IEEE Transactions on Cybernetics.

[10]  Qingfu Zhang,et al.  Investigating the Properties of Indicators and an Evolutionary Many-Objective Algorithm Using Promising Regions , 2021, IEEE Transactions on Evolutionary Computation.

[11]  Yaochu Jin,et al.  A Survey of Evolutionary Algorithms for Multi-Objective Optimization Problems With Irregular Pareto Fronts , 2021, IEEE/CAA Journal of Automatica Sinica.

[12]  Feiping Nie,et al.  Subspace Sparse Discriminative Feature Selection , 2020, IEEE Transactions on Cybernetics.

[13]  José Pérez-Parras Toledano,et al.  Grab’Em: A Novel Graph-Based Method for Combining Feature Subset Selectors , 2020, IEEE Transactions on Cybernetics.

[14]  Bing Xue,et al.  A Duplication Analysis-Based Evolutionary Algorithm for Biobjective Feature Selection , 2020, IEEE Transactions on Evolutionary Computation.

[15]  Kalyanmoy Deb,et al.  Neural Architecture Transfer , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Ram Sarkar,et al.  Introducing clustering based population in Binary Gravitational Search Algorithm for Feature Selection , 2020, Appl. Soft Comput..

[17]  Mengjie Zhang,et al.  A survey on swarm intelligence approaches to feature selection in data mining , 2020, Swarm Evol. Comput..

[18]  Ye Tian,et al.  An Evolutionary Algorithm for Large-Scale Sparse Multiobjective Optimization Problems , 2020, IEEE Transactions on Evolutionary Computation.

[19]  Kay Chen Tan,et al.  Solving Large-Scale Multiobjective Optimization Problems With Sparse Optimal Solutions via Unsupervised Neural Networks , 2020, IEEE Transactions on Cybernetics.

[20]  Hisao Ishibuchi,et al.  Multiple Reference Points-Based Decomposition for Multiobjective Feature Selection in Classification: Static and Dynamic Mechanisms , 2020, IEEE Transactions on Evolutionary Computation.

[21]  Xiaoyan Sun,et al.  Variable-Size Cooperative Coevolutionary Particle Swarm Optimization for Feature Selection on High-Dimensional Data , 2020, IEEE Transactions on Evolutionary Computation.

[22]  Canhua Wang,et al.  Functional connectivity-based classification of autism and control using SVM-RFECV on rs-fMRI data. , 2019, Physica medica : PM : an international journal devoted to the applications of physics to medicine and biology : official journal of the Italian Association of Biomedical Physics.

[23]  Mengjie Zhang,et al.  Variable-Length Particle Swarm Optimization for Feature Selection on High-Dimensional Classification , 2019, IEEE Transactions on Evolutionary Computation.

[24]  Ram Sarkar,et al.  A wrapper-filter feature selection technique based on ant colony optimization , 2019, Neural Computing and Applications.

[25]  Ye Tian,et al.  A Strengthened Dominance Relation Considering Convergence and Diversity for Evolutionary Many-Objective Optimization , 2019, IEEE Transactions on Evolutionary Computation.

[26]  MengChu Zhou,et al.  An embedded feature selection method for imbalanced data classification , 2019, IEEE/CAA Journal of Automatica Sinica.

[27]  Xiangxiang Zeng,et al.  MOEA/HD: A Multiobjective Evolutionary Algorithm Based on Hierarchical Decomposition , 2019, IEEE Transactions on Cybernetics.

[28]  Jiujun Cheng,et al.  Dendritic Neuron Model With Effective Learning Algorithms for Classification, Approximation, and Prediction , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[29]  Bo Tang,et al.  Spectrally Constrained MIMO Radar Waveform Design Based on Mutual Information , 2019, IEEE Transactions on Signal Processing.

[30]  Ye Tian,et al.  An Indicator-Based Multiobjective Evolutionary Algorithm With Reference Point Adaptation for Better Versatility , 2018, IEEE Transactions on Evolutionary Computation.

[31]  Hongmei He,et al.  Information gain directed genetic algorithm wrapper feature selection for credit rating , 2018, Appl. Soft Comput..

[32]  Dun-Wei Gong,et al.  A return-cost-based binary firefly algorithm for feature selection , 2017, Inf. Sci..

[33]  Majdi M. Mafarja,et al.  Hybrid Whale Optimization Algorithm with simulated annealing for feature selection , 2017, Neurocomputing.

[34]  Cheng Wang,et al.  A filter feature selection method based on the Maximal Information Coefficient and Gram-Schmidt Orthogonalization for biomedical data mining , 2017, Comput. Biol. Medicine.

[35]  Ye Tian,et al.  PlatEMO: A MATLAB Platform for Evolutionary Multi-Objective Optimization [Educational Forum] , 2017, IEEE Computational Intelligence Magazine.

[36]  Zhiwei Ye,et al.  A feature selection method based on modified binary coded ant colony optimization algorithm , 2016, Appl. Soft Comput..

[37]  Xin Yao,et al.  A Survey on Evolutionary Computation Approaches to Feature Selection , 2016, IEEE Transactions on Evolutionary Computation.

[38]  Jiujun Cheng,et al.  Ant colony optimization with clustering for solving the dynamic location routing problem , 2016, Appl. Math. Comput..

[39]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[40]  Qingfu Zhang,et al.  Decomposition-Based Algorithms Using Pareto Adaptive Scalarizing Methods , 2016, IEEE Transactions on Evolutionary Computation.

[41]  Jie Zhang,et al.  A Simple and Fast Hypervolume Indicator-Based Multiobjective Evolutionary Algorithm , 2015, IEEE Transactions on Cybernetics.

[42]  Halife Kodaz,et al.  A new hybrid method based on Particle Swarm Optimization, Ant Colony Optimization and 3-Opt algorithms for Traveling Salesman Problem , 2015, Appl. Soft Comput..

[43]  Hisao Ishibuchi,et al.  Modified Distance Calculation in Generational Distance and Inverted Generational Distance , 2015, EMO.

[44]  Jon Atli Benediktsson,et al.  Feature Selection Based on Hybridization of Genetic Algorithm and Particle Swarm Optimization , 2015, IEEE Geoscience and Remote Sensing Letters.

[45]  Hossein Nezamabadi-pour,et al.  An advanced ACO algorithm for feature subset selection , 2015, Neurocomputing.

[46]  Kalyanmoy Deb,et al.  An Evolutionary Many-Objective Optimization Algorithm Using Reference-Point-Based Nondominated Sorting Approach, Part I: Solving Problems With Box Constraints , 2014, IEEE Transactions on Evolutionary Computation.

[47]  Parham Moradi,et al.  An unsupervised feature selection algorithm based on ant colony optimization , 2014, Eng. Appl. Artif. Intell..

[48]  Mengjie Zhang,et al.  Particle Swarm Optimization for Feature Selection in Classification: A Multi-Objective Approach , 2013, IEEE Transactions on Cybernetics.

[49]  U. Maulik,et al.  An SVM-Wrapped Multiobjective Evolutionary Feature Selection Approach for Identifying Cancer-MicroRNA Markers , 2013, IEEE Transactions on NanoBioscience.

[50]  Mengjie Zhang,et al.  Enhanced feature selection for biomarker discovery in LC-MS data using GP , 2013, 2013 IEEE Congress on Evolutionary Computation.

[51]  Ashish Ghosh,et al.  Self-adaptive differential evolution for feature selection in hyperspectral image data , 2013, Appl. Soft Comput..

[52]  Hong Li,et al.  A modification to MOEA/D-DE for multiobjective optimization problems with complicated Pareto sets , 2012, Inf. Sci..

[53]  Jieping Ye,et al.  Feature grouping and selection over an undirected graph , 2012, KDD.

[54]  Fikret S. Gürgen,et al.  A feature selection method based on kernel canonical correlation analysis and the minimum Redundancy-Maximum Relevance filter method , 2012, Expert Syst. Appl..

[55]  Alper Ekrem Murat,et al.  A discrete particle swarm optimization method for feature selection in binary classification problems , 2010, Eur. J. Oper. Res..

[56]  Kazuyuki Murase,et al.  A new wrapper feature selection approach using neural network , 2010, Neurocomputing.

[57]  Pedro Antonio Gutiérrez,et al.  Sensitivity Versus Accuracy in Multiclass Problems Using Memetic Pareto Evolutionary Neural Networks , 2010, IEEE Transactions on Neural Networks.

[58]  Nasser Ghasem-Aghaee,et al.  A novel ACO-GA hybrid algorithm for feature selection in protein function prediction , 2009, Expert Syst. Appl..

[59]  Yongming Li,et al.  Research of multi-population agent genetic algorithm for feature selection , 2009, Expert Syst. Appl..

[60]  Nasser Ghasem-Aghaee,et al.  Text feature selection using ant colony optimization , 2009, Expert Syst. Appl..

[61]  Qingfu Zhang,et al.  Multiobjective Optimization Problems With Complicated Pareto Sets, MOEA/D and NSGA-II , 2009, IEEE Transactions on Evolutionary Computation.

[62]  Liang-Hsuan Chen,et al.  Feature selection to diagnose a business crisis by using a real GA-based support vector machine: An empirical study , 2008, Expert Syst. Appl..

[63]  Cheng-Lung Huang,et al.  A distributed PSO-SVM hybrid system with feature selection and parameter optimization , 2008, Appl. Soft Comput..

[64]  Ahmed Al-Ani,et al.  Feature Subset Selection Using Ant Colony Optimization , 2008 .

[65]  Qingfu Zhang,et al.  MOEA/D: A Multiobjective Evolutionary Algorithm Based on Decomposition , 2007, IEEE Transactions on Evolutionary Computation.

[66]  Sushmita Mitra,et al.  Evolutionary Rough Feature Selection in Gene Expression Data , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[67]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[68]  R. Lyndon While,et al.  A faster algorithm for calculating hypervolume , 2006, IEEE Transactions on Evolutionary Computation.

[69]  Henry W. Altland,et al.  Computer-Based Robust Engineering: Essentials for DFSS , 2006, Technometrics.

[70]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[71]  Alice E. Smith,et al.  An ant colony optimization algorithm for the redundancy allocation problem (RAP) , 2004, IEEE Transactions on Reliability.

[72]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[73]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[74]  L. Breiman Random Forests , 2001, Encyclopedia of Machine Learning and Data Mining.

[75]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[76]  Thomas Stützle,et al.  MAX-MIN Ant System , 2000, Future Gener. Comput. Syst..

[77]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[78]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[79]  T. Stützle,et al.  MAX-MIN Ant System and local search for the traveling salesman problem , 1997, Proceedings of 1997 IEEE International Conference on Evolutionary Computation (ICEC '97).

[80]  Roberto Battiti,et al.  Using mutual information for selecting features in supervised neural net learning , 1994, IEEE Trans. Neural Networks.

[81]  Sahibsingh A. Dudani The Distance-Weighted k-Nearest-Neighbor Rule , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[82]  Seyed Mohammad Mirjalili,et al.  Whale optimization approaches for wrapper feature selection , 2018, Appl. Soft Comput..

[83]  Heba Abusamra,et al.  A Comparative Study of Feature Selection and Classification Methods for Gene Expression Data of Glioma , 2013 .

[84]  Manuel López-Ibáñez,et al.  Ant colony optimization , 2010, GECCO '10.

[85]  Oleg V. Favorov,et al.  Using covariates for improving the minimum redundancy maximum relevance feature selection method , 2010 .

[86]  Leslie S. Smith,et al.  Feature subset selection in large dimensionality domains , 2010, Pattern Recognit..

[87]  Thomas A. Runkler,et al.  Multi-Criteria Ant Feature Selection Using Fuzzy Classifiers , 2009 .

[88]  Yanqing Zhang,et al.  A genetic algorithm-based method for feature subset selection , 2008, Soft Comput..