Feature selection based on label distribution and fuzzy mutual information

Abstract In multi-label learning, high-dimensionality is the most prominent characteristic of the data. An efficient pre-processing step, named feature selection, is required to reduce “the curse of dimensionality” caused by irrelevant and redundant features in the high-dimensional feature space. However, the difference in significance of the related labels of an instance is ubiquitous in most practical applications. Motivated by that, in this paper, the label distribution learning is integrated into multi-label feature selection, which is proposed to mine the more supervised information ignored by equivalence relations in the label space of multi-label data. With the perspective of granular computing, a novel label enhancement algorithm is presented based on the fuzzy similarity relation, which utilizes the similarity between instances to explore the hidden label relevance and transform the logical label in multi-label data into a label distribution. Then, a label distribution feature selection algorithm is presented to measure the significance of features with the fuzzy mutual information framework. Moreover, on twelve publicly available multi-label datasets, the presented algorithm is compared with six state-of-the-art multi-label feature selection algorithms. As indicated in the experimental results, the presented algorithm achieves significant improvement over the extant algorithms.

[1]  Witold Pedrycz,et al.  Granular multi-label feature selection based on mutual information , 2017, Pattern Recognit..

[2]  Xin Geng,et al.  Indoor Crowd Counting by Mixture of Gaussians Label Distribution Learning , 2019, IEEE Transactions on Image Processing.

[3]  Hossein Nezamabadi-pour,et al.  Multilabel feature selection: A comprehensive review and guiding experiments , 2018, WIREs Data Mining Knowl. Discov..

[4]  Zhiming Luo,et al.  Manifold regularized discriminative feature selection for multi-label learning , 2019, Pattern Recognit..

[5]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[6]  Xin Geng,et al.  Soft Facial Landmark Detection by Label Distribution Learning , 2019, AAAI.

[7]  Xin Geng,et al.  Hierarchical Classification Based on Label Distribution Learning , 2019, AAAI.

[8]  Xiao Zhang,et al.  Active Incremental Feature Selection Using a Fuzzy-Rough-Set-Based Information Entropy , 2020, IEEE Transactions on Fuzzy Systems.

[9]  Dae-Won Kim,et al.  Memetic feature selection for multilabel text categorization using label frequency difference , 2019, Inf. Sci..

[10]  Zhi-Hua Zhou,et al.  Label Distribution Learning by Optimal Transport , 2018, AAAI.

[11]  Hua Li,et al.  A novel attribute reduction approach for multi-label data based on rough set theory , 2016, Inf. Sci..

[12]  Qinghua Hu,et al.  Multi-label feature selection based on max-dependency and min-redundancy , 2015, Neurocomputing.

[13]  Jiucheng Xu,et al.  Multilabel feature selection using ML-ReliefF and neighborhood mutual information for multilabel neighborhood decision systems , 2020, Inf. Sci..

[14]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[15]  Jianhua Dai,et al.  Feature selection via normative fuzzy information weight with application into tumor classification , 2020, Appl. Soft Comput..

[16]  Craig A. Knoblock,et al.  A Survey of Digital Map Processing Techniques , 2014, ACM Comput. Surv..

[17]  Newton Spolaôr,et al.  ReliefF for Multi-label Feature Selection , 2013, 2013 Brazilian Conference on Intelligent Systems.

[18]  Manli Zhou,et al.  hMuLab: A Biomedical Hybrid MUlti-LABel Classifier Based on Multiple Linear Regression , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[19]  Duoqian Miao,et al.  Class-specific information measures and attribute reducts for hierarchy and systematicness , 2021, Inf. Sci..

[20]  Ning Xu,et al.  Partial Multi-Label Learning with Label Distribution , 2020, AAAI.

[21]  Qinghua Hu,et al.  Streaming Feature Selection for Multilabel Learning Based on Fuzzy Mutual Information , 2017, IEEE Transactions on Fuzzy Systems.

[22]  Li Yu,et al.  FM-ECG: A fine-grained multi-label framework for ECG image classification , 2021, Inf. Sci..

[23]  Bin Fang,et al.  DSRPH: Deep semantic-aware ranking preserving hashing for efficient multi-label image retrieval , 2020, Inf. Sci..

[24]  Yongming Han,et al.  An asymmetric knowledge representation learning in manifold space , 2020, Inf. Sci..

[25]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[26]  Weiwei Li,et al.  Label Distribution Learning with Label Correlations via Low-Rank Approximation , 2019, IJCAI.

[27]  Dae-Won Kim,et al.  Feature selection for multi-label classification using multivariate mutual information , 2013, Pattern Recognit. Lett..

[28]  D. Dubois,et al.  ROUGH FUZZY SETS AND FUZZY ROUGH SETS , 1990 .

[29]  Yu Zhang,et al.  Label Distribution for Learning with Noisy Labels , 2020, IJCAI.

[30]  Lin Sun,et al.  Feature selection using neighborhood entropy-based uncertainty measures for gene expression data classification , 2019, Inf. Sci..

[31]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[32]  Ju-Sheng Mi,et al.  A novel approach for learning label correlation with application to feature selection of multi-label data , 2020, Inf. Sci..

[33]  Zhi-Hua Zhou,et al.  Multilabel dimensionality reduction via dependence maximization , 2008, TKDD.

[34]  Víctor Robles,et al.  Feature selection for multi-label naive Bayes classification , 2009, Inf. Sci..

[35]  Alberto Cano,et al.  Distributed Selection of Continuous Features in Multilabel Classification Using Mutual Information , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[36]  Xin Geng,et al.  Label Distribution Learning , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.

[37]  Jing Wang,et al.  Classification with Label Distribution Learning , 2019, IJCAI.

[38]  Witold Pedrycz,et al.  Granular structure-based incremental updating for multi-label classification , 2020, Knowl. Based Syst..

[39]  Zhiqiang Geng,et al.  Joint entity and relation extraction model based on rich semantics , 2021, Neurocomputing.

[40]  Yu Zhang,et al.  Label Enhancement for Label Distribution Learning via Prior Knowledge , 2020, IJCAI.

[41]  Ning Xu,et al.  Label Enhancement for Label Distribution Learning , 2019 .

[42]  Wanfu Gao,et al.  Multi-label feature selection based on the division of label topics , 2021, Inf. Sci..

[43]  M. Friedman A Comparison of Alternative Tests of Significance for the Problem of $m$ Rankings , 1940 .

[44]  Zehong Cao,et al.  Attribute reduction with fuzzy rough self-information measures , 2021, Inf. Sci..

[45]  Sebastián Ventura,et al.  A Tutorial on Multilabel Learning , 2015, ACM Comput. Surv..

[46]  Yongming Han,et al.  Semantic relation extraction using sequential and tree-structured LSTM with attention , 2020, Inf. Sci..

[47]  Yuwen Li,et al.  Attribute reduction for multi-label learning with fuzzy rough set , 2018, Knowl. Based Syst..

[48]  Zhi-Hua Zhou,et al.  Multi-Label Learning with Global and Local Label Correlation , 2017, IEEE Transactions on Knowledge and Data Engineering.

[49]  Xiangyang Luo,et al.  Gated Neural Network with Regularized Loss for Multi-label Text Classification , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[50]  Hong Chen,et al.  Incremental feature selection based on fuzzy rough sets , 2020, Inf. Sci..

[51]  Rui Huang,et al.  Manifold-based constraint Laplacian score for multi-label feature selection , 2018, Pattern Recognit. Lett..