Cost-sensitive feature selection on multi-label data via neighborhood granularity and label enhancement

Multi-label feature selection, which is an efficient and effective pre-processing step in machine learning and data mining, can select a feature subset that contains more contributions for multi-label classification while improving the performance of the classifiers. In real-world applications, an instance may be associated with multiple related labels with different relative importances, and the process of obtaining different features usually requires different costs, containing money, and time, etc. However, most existing works with regard to multi-label feature selection do not take into consideration the above two critical issues simultaneously. Therefore, in this paper, we exploit the idea of neighborhood granularity to enhance the traditional logical labels into label distribution forms to excavate the deeper supervised information hidden in multi-label data, and further consider the effect of the test cost under three different distributions, simultaneously. Motivated by these issues, a novel test cost multi-label feature selection algorithm with label enhancement and neighborhood granularity is designed. Moreover, the proposed algorithm is tested upon ten publicly available benchmark multi-label datasets with six widely-used metrics from two different aspects. Finally, two groups of experimental results demonstrate that the proposed algorithm achieves the satisfactory and superior performance over other four state-of-the-art comparing algorithms, and it is effective for improving the learning performance and decreasing the total test costs of the selected feature subset.

[1]  Witold Pedrycz,et al.  Granular multi-label feature selection based on mutual information , 2017, Pattern Recognit..

[2]  Witold Pedrycz,et al.  Measuring relevance between discrete and continuous features based on neighborhood mutual information , 2011, Expert Syst. Appl..

[3]  Hamido Fujita,et al.  An incremental attribute reduction approach based on knowledge granularity with a multi-granulation view , 2017, Inf. Sci..

[4]  Grigorios Tsoumakas,et al.  A systematic review of multi-label feature selection and a new method based on label construction , 2016, Neurocomputing.

[5]  Ning Xu,et al.  Label Enhancement for Label Distribution Learning , 2019 .

[6]  Xin Geng,et al.  A multi-task model for simultaneous face identification and facial expression recognition , 2016, Neurocomputing.

[7]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Reduction strategies for hierarchical multi-label classification in protein function prediction , 2016, BMC Bioinformatics.

[8]  Dae-Won Kim,et al.  Feature selection for multi-label classification using multivariate mutual information , 2013, Pattern Recognit. Lett..

[9]  Ju-Sheng Mi,et al.  A novel approach for learning label correlation with application to feature selection of multi-label data , 2020, Inf. Sci..

[10]  Huanyu Zhao,et al.  Nonlinear system modeling using self-organizing fuzzy neural networks for industrial applications , 2020, Applied Intelligence.

[11]  Hossein Nezamabadi-pour,et al.  MGFS: A multi-label graph-based feature selection algorithm via PageRank centrality , 2020, Expert Syst. Appl..

[12]  Michael K. Ng,et al.  ML-FOREST: A Multi-Label Tree Ensemble Method for Multi-Label Classification , 2016, IEEE Transactions on Knowledge and Data Engineering.

[13]  Qinghua Hu,et al.  Multi-label feature selection based on max-dependency and min-redundancy , 2015, Neurocomputing.

[14]  Sebastián Ventura,et al.  Distributed multi-label feature selection using individual mutual information measures , 2020, Knowl. Based Syst..

[15]  Youlong Yang,et al.  Multi-label imbalanced classification based on assessments of cost and value , 2018, Applied Intelligence.

[16]  Xin Yang,et al.  A temporal-spatial composite sequential approach of three-way granular computing , 2019, Inf. Sci..

[17]  Haytham Elghazel,et al.  Ensemble multi-label text categorization based on rotation forest and latent semantic indexing , 2016, Expert Syst. Appl..

[18]  Xin Jin,et al.  A Multi-Label Supervised Topic Model Conditioned on Arbitrary Features for Gene Function Prediction , 2019, Genes.

[19]  Weiwei Liu,et al.  Multi-Label Image Classification by Feature Attention Network , 2019, IEEE Access.

[20]  Jie Duan,et al.  Multi-label feature selection based on neighborhood mutual information , 2016, Appl. Soft Comput..

[21]  Fan Min,et al.  Representative-based classification through covering-based neighborhood rough sets , 2015, Applied Intelligence.

[22]  Robert E. Schapire,et al.  Hierarchical multi-label prediction of gene function , 2006, Bioinform..

[23]  Wu Liu,et al.  DELTA: A deep dual-stream network for multi-label image classification , 2019, Pattern Recognit..

[24]  Hossein Nezamabadi-pour,et al.  Multilabel feature selection: A comprehensive review and guiding experiments , 2018, WIREs Data Mining Knowl. Discov..

[25]  Hamido Fujita,et al.  An efficient selector for multi-granularity attribute reduction , 2019, Inf. Sci..

[26]  Hossein Nezamabadi-pour,et al.  A label-specific multi-label feature selection algorithm based on the Pareto dominance concept , 2019, Pattern Recognit..

[27]  Jinghua Liu,et al.  Feature selection for multi-label learning with missing labels , 2019, Applied Intelligence.

[28]  Qing Wang,et al.  A filter-based bare-bone particle swarm optimization algorithm for unsupervised feature selection , 2019, Applied Intelligence.

[29]  Hong Zhao,et al.  Test-cost-sensitive attribute reduction of data with normal distribution measurement errors , 2012, ArXiv.

[30]  Dae-Won Kim,et al.  Mutual Information-based multi-label feature selection using interaction information , 2015, Expert Syst. Appl..

[31]  Dongyan Zhao,et al.  Optimizing Multi-Query Evaluation in Federated RDF Systems , 2021, IEEE Transactions on Knowledge and Data Engineering.

[32]  Yuwen Li,et al.  Attribute reduction for multi-label learning with fuzzy rough set , 2018, Knowl. Based Syst..

[33]  Zhi-Hua Zhou,et al.  Multilabel dimensionality reduction via dependence maximization , 2008, TKDD.

[34]  Ming Yang,et al.  Sparse and low-rank representation for multi-label classification , 2018, Applied Intelligence.

[35]  Yehuda Koren,et al.  Factor in the neighbors: Scalable and accurate collaborative filtering , 2010, TKDD.

[36]  Shunxiang Wu,et al.  Online multi-label streaming feature selection based on neighborhood rough set , 2018, Pattern Recognit..

[37]  Jia Zhang,et al.  Mutual information based multi-label feature selection via constrained convex optimization , 2019, Neurocomputing.

[38]  O. J. Dunn Multiple Comparisons among Means , 1961 .

[39]  Kezhu Tan,et al.  Neighborhood mutual information and its application on hyperspectral band selection for classification , 2016 .

[40]  Na Li,et al.  Multi-label text categorization using L21-norm minimization extreme learning machine , 2017, Neurocomputing.

[41]  William Zhu,et al.  Attribute reduction of data with error ranges and test costs , 2012, Inf. Sci..

[42]  Dae-Won Kim,et al.  MFC: Initialization method for multi-label feature selection based on conditional mutual information , 2020, Neurocomputing.

[43]  Qi Wu,et al.  Attend and Imagine: Multi-Label Image Classification With Visual Attention and Recurrent Neural Networks , 2019, IEEE Transactions on Multimedia.

[44]  Liang Hu,et al.  Feature redundancy term variation for mutual information-based feature selection , 2020, Applied Intelligence.

[45]  Qinghua Hu,et al.  Neighborhood rough set based heterogeneous feature subset selection , 2008, Inf. Sci..

[46]  Sebastián Ventura,et al.  Scalable extensions of the ReliefF algorithm for weighting and selecting features on the multi-label learning context , 2015, Neurocomputing.

[47]  Hamido Fujita,et al.  Attribute group for attribute reduction , 2020, Inf. Sci..

[48]  Xin Yang,et al.  Local temporal-spatial multi-granularity learning for sequential three-way granular computing , 2020, Inf. Sci..

[49]  Shuzhi Sam Ge,et al.  Small traffic sign detection from large image , 2019, Applied Intelligence.

[50]  Xin Geng,et al.  Label Distribution Learning , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.

[51]  Philip S. Yu,et al.  Hierarchical Taxonomy-Aware and Attentional Graph Capsule RCNNs for Large-Scale Multi-Label Text Classification , 2019, IEEE Transactions on Knowledge and Data Engineering.

[52]  Zhi-Hua Zhou,et al.  Facial Age Estimation by Learning from Label Distributions , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[54]  Yuhua Qian,et al.  Accelerator for supervised neighborhood based attribute reduction , 2020, Int. J. Approx. Reason..

[55]  Angelo Gaeta,et al.  Resilience Analysis of Critical Infrastructures: A Cognitive Approach Based on Granular Computing , 2019, IEEE Transactions on Cybernetics.

[56]  Ping Zhang,et al.  Distinguishing two types of labels for multi-label feature selection , 2019, Pattern Recognit..

[57]  Hossein Nezamabadi-pour,et al.  MLACO: A multi-label feature selection algorithm based on ant colony optimization , 2020, Knowl. Based Syst..

[58]  M. Friedman A Comparison of Alternative Tests of Significance for the Problem of $m$ Rankings , 1940 .

[59]  Na Li,et al.  Multi-label Text Categorization Using $$L_{21}$$-norm Minimization Extreme Learning Machine , 2017 .

[60]  Qinghua Hu,et al.  Neighborhood classifiers , 2008, Expert Syst. Appl..

[61]  Yuhua Qian,et al.  Test-cost-sensitive attribute reduction , 2011, Inf. Sci..