A Multi-Label Feature Selection Based on Mutual Information and Ant Colony Optimization

Multi-label learning aims at assigning more than a class label to each instance. Due to the development of digital technologies, in many real-world applications that are high-dimensional. Feature selection methods are widely used in multi-label learning to reduce the dimensionality of the data. In this paper, multivariate and filter feature selection method based on the ant colony optimization and mutual information is proposed. The proposed method employs the ant colony optimization to rank the features based on their importance. To this end, first the search space is mapped to a graph and then each ant moves through the graph to select a predefined number of features. Moreover, a novel information-theoretic measure is proposed to evaluate the features selected by each ant. This measure uses the concept of mutual information to calculate the relevancy of each feature with a set of labels. This measure is also employed to assess the redundancy between selected features. The pheromones of features are assigned based on the quality of solutions founded by the ants. Finally, the features are sorted based on their assigned pheromones, and then those of top features are chosen as the final feature set. The proposed method is performed on some real-world datasets and the results show the superiority of the proposed method in comparison with some well-known and state-of-the-art methods.

[1]  Fardin Ahmadizar,et al.  A novel multivariate filter method for feature selection in text classification problems , 2018, Eng. Appl. Artif. Intell..

[2]  Rebecca S. Wills Google's page rank: the Math behind the search engine , 2006 .

[3]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[4]  Vinod Kumar Jain,et al.  Correlation feature selection based improved-Binary Particle Swarm Optimization for gene selection and cancer classification , 2018, Appl. Soft Comput..

[5]  Yong Zhang,et al.  A PSO-based multi-objective multi-label feature selection method in classification , 2017, Scientific Reports.

[6]  Michel Verleysen,et al.  Feature Selection for Multi-label Classification Problems , 2011, IWANN.

[7]  Witold Pedrycz,et al.  Granular multi-label feature selection based on mutual information , 2017, Pattern Recognit..

[8]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[9]  M Dorigo,et al.  Ant colonies for the travelling salesman problem. , 1997, Bio Systems.

[10]  Qinghua Hu,et al.  Multi-label feature selection based on max-dependency and min-redundancy , 2015, Neurocomputing.

[11]  Parham Moradi,et al.  An unsupervised feature selection algorithm based on ant colony optimization , 2014, Eng. Appl. Artif. Intell..

[12]  Zhi-Hua Zhou,et al.  A Unified View of Multi-Label Performance Measures , 2016, ICML.

[13]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[14]  Parham Moradi,et al.  Integration of graph clustering with ant colony optimization for feature selection , 2015, Knowl. Based Syst..

[15]  Dae-Won Kim,et al.  Fast multi-label feature selection based on information-theoretic feature ranking , 2015, Pattern Recognit..

[16]  Hossein Nezamabadi-pour,et al.  MGFS: A multi-label graph-based feature selection algorithm via PageRank centrality , 2020, Expert Syst. Appl..

[17]  Dae-Won Kim,et al.  Feature selection for multi-label classification using multivariate mutual information , 2013, Pattern Recognit. Lett..

[18]  Qinghua Hu,et al.  Multi-label feature selection with missing labels , 2018, Pattern Recognit..

[19]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[20]  Zhiming Luo,et al.  Manifold regularized discriminative feature selection for multi-label learning , 2019, Pattern Recognit..

[21]  Everton Alvares Cherman,et al.  Multi-label Problem Transformation Methods: a Case Study , 2011, CLEI Electron. J..

[22]  Rui Huang,et al.  Manifold-based constraint Laplacian score for multi-label feature selection , 2018, Pattern Recognit. Lett..

[23]  Rebecca S. Wills Google’s pagerank , 2006 .

[24]  Hojat Ghimatgar,et al.  An improved feature selection algorithm based on graph clustering and ant colony optimization , 2018, Knowl. Based Syst..

[25]  Chiman Salavati,et al.  Fast unsupervised feature selection based on the improved binary ant system and mutation strategy , 2019, Neural Computing and Applications.