Multi-label feature selection via adaptive label correlation estimation

In multi-label learning, each instance is associated with multiple labels simultaneously. Multi-label data often has noisy, irrelevant, and redundant features of high dimensionality. Multi-label feature selection has received considerable attention as an effective means for dealing with high-dimensional multi-label data. Many multi-label feature selection methods exploit label correlations to help select features. However, finding label correlations and selecting features in existing multi-label feature selection methods are often two separate processes, the existence of noises and outliers in training data makes the label correlations exploited from label space less reliable. Therefore, the learned label correlations may mislead the feature selection process and result in the selection of less informative features. This paper proposes a novel algorithm named ROAD, i.e., multi-label featuRe selectiOn via ADaptive label correlation estimation. ROAD jointly performs adaptive label correlation exploration and feature selection with alternating optimization to obtain reliable estimation of label correlations, which can more effectively reveal the intrinsic manifold structure among labels and lead to the selection of a more proper feature subset. Comprehensive experiments on several frequently used data sets validate the superiority of ROAD against the state-of-the-art multi-label feature selection algorithms.

[1]  Yang Wang,et al.  Online Multi-Label Streaming Feature Selection With Label Correlation , 2023, IEEE Transactions on Knowledge and Data Engineering.

[2]  Hongbo Zhang,et al.  Fuzzy Mutual Information-Based Multilabel Feature Selection With Label Dependency and Streaming Labels , 2023, IEEE Transactions on Fuzzy Systems.

[3]  M. Rafsanjani,et al.  Ensemble feature selection for multi‐label text classification: An intelligent order statistics approach , 2022, Int. J. Intell. Syst..

[4]  Liang Hu,et al.  Multi-label feature selection via robust flexible sparse regularization , 2022, Pattern Recognit..

[5]  M. B. Dowlatshahi,et al.  MSSL: a memetic-based sparse subspace learning algorithm for multi-label classification , 2022, International Journal of Machine Learning and Cybernetics.

[6]  Liang Hu,et al.  Label correlations variation for robust multi-label feature selection , 2022, Inf. Sci..

[7]  Jiuyong Li,et al.  Integrating Global and Local Feature Selection for Multi-Label Learning , 2022, ACM Trans. Knowl. Discov. Data.

[8]  Liang Hu,et al.  Feature-specific mutual information variation for multi-label feature selection , 2022, Inf. Sci..

[9]  Wei Weng,et al.  Multi-label feature selection based on label correlations and feature redundancy , 2022, Knowl. Based Syst..

[10]  Hossein Nezamabadi-pour,et al.  An efficient Pareto-based feature selection algorithm for multi-label classification , 2021, Inf. Sci..

[11]  Qinghua Hu,et al.  MULFE: Multi-Label Learning via Label-Specific Feature Space Ensemble , 2021, ACM Trans. Knowl. Discov. Data.

[12]  D. Gong,et al.  A Fast Hybrid Feature Selection Based on Correlation-Guided Clustering and Particle Swarm Optimization for High-Dimensional Data , 2021, IEEE Transactions on Cybernetics.

[13]  Hossein Nezamabadi-pour,et al.  MFS-MCDM: Multi-label feature selection using multi-criteria decision making , 2020, Knowl. Based Syst..

[14]  Yong Zhang,et al.  Multiobjective Particle Swarm Optimization for Feature Selection With Fuzzy Cost , 2020, IEEE Transactions on Cybernetics.

[15]  Kay Chen Tan,et al.  Multi-label Feature Selection via Global Relevance and Redundancy Optimization , 2020, IJCAI.

[16]  Hossein Nezamabadi-pour,et al.  MLACO: A multi-label feature selection algorithm based on ant colony optimization , 2020, Knowl. Based Syst..

[17]  Ju-Sheng Mi,et al.  A novel approach for learning label correlation with application to feature selection of multi-label data , 2020, Inf. Sci..

[18]  Zhengming Ding,et al.  Generative Correlation Discovery Network for Multi-label Learning , 2019, 2019 IEEE International Conference on Data Mining (ICDM).

[19]  Zhiming Luo,et al.  Manifold regularized discriminative feature selection for multi-label learning , 2019, Pattern Recognit..

[20]  Tong Wei,et al.  Learning Compact Model for Large-Scale Multi-Label Data , 2019, AAAI.

[21]  Rui Huang,et al.  Manifold-based constraint Laplacian score for multi-label feature selection , 2018, Pattern Recognit. Lett..

[22]  William Zhu,et al.  Multi-label feature selection via feature manifold learning and sparsity regularization , 2018, Int. J. Mach. Learn. Cybern..

[23]  Feiping Nie,et al.  SVM based multi-label learning with missing labels for image annotation , 2018, Pattern Recognit..

[24]  Qinghua Hu,et al.  Multi-label feature selection with missing labels , 2018, Pattern Recognit..

[25]  Qinghua Hu,et al.  Streaming Feature Selection for Multilabel Learning Based on Fuzzy Mutual Information , 2017, IEEE Transactions on Fuzzy Systems.

[26]  Zhi-Hua Zhou,et al.  Multi-Label Learning with Global and Local Label Correlation , 2017, IEEE Transactions on Knowledge and Data Engineering.

[27]  Xindong Wu,et al.  Learning Label-Specific Features and Class-Dependent Labels for Multi-Label Classification , 2016, IEEE Transactions on Knowledge and Data Engineering.

[28]  Zhi-Hua Zhou,et al.  A Unified View of Multi-Label Performance Measures , 2016, ICML.

[29]  Xin Yao,et al.  A Survey on Evolutionary Computation Approaches to Feature Selection , 2016, IEEE Transactions on Evolutionary Computation.

[30]  Huan Liu,et al.  Multi-Label Informed Feature Selection , 2016, IJCAI.

[31]  Qinghua Hu,et al.  Multi-label feature selection based on max-dependency and min-redundancy , 2015, Neurocomputing.

[32]  Feiping Nie,et al.  Clustering and projected clustering with adaptive neighbors , 2014, KDD.

[33]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[34]  Witold Pedrycz,et al.  Multi-label classification by exploiting label correlations , 2014, Expert Syst. Appl..

[35]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[36]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[37]  J. Demšar Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[38]  Richard H. Bartels,et al.  Algorithm 432 [C2]: Solution of the matrix equation AX + XB = C [F4] , 1972, Commun. ACM.

[39]  O. J. Dunn Multiple Comparisons among Means , 1961 .

[40]  Kui Yu,et al.  Causal Feature Selection with Missing Data , 2022, ACM Trans. Knowl. Discov. Data.