Feature selection for multi-label learning with streaming label

Abstract Multi-label feature selection has drawn wide attention in recent years. The existing multi-label feature selection algorithms mainly assume that the labels of the training data are obtained before feature selection takes place. However, this assumption does not always founded because the acquisition of labeling data is costly. In real-world applications, the available labels usually arrive one by one over time. To address this problem, we develop a novel multi-label feature selection method under the circumstance of streaming label to select a set of the most relevant and discriminative features. Specifically, we firstly select label-specific features for each newly-arrived label by designing inter-class discrimination and intra-class neighbor recognition. Then, a feature conversion is created to fuse the generated label-specific feature sets. Comprehensive experiments on a series of benchmark data sets clearly demonstrate the superiority of the proposed method against other state-of-the-art multi-label feature selection methods.

[1]  Jiye Liang,et al.  Fuzzy-rough feature selection accelerator , 2015, Fuzzy Sets Syst..

[2]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[3]  Zhiming Luo,et al.  Manifold regularized discriminative feature selection for multi-label learning , 2019, Pattern Recognit..

[4]  Ivor W. Tsang,et al.  Making Trillion Correlations Feasible in Feature Grouping and Selection , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  M. Friedman A Comparison of Alternative Tests of Significance for the Problem of $m$ Rankings , 1940 .

[6]  Grigorios Tsoumakas,et al.  MULAN: A Java Library for Multi-Label Learning , 2011, J. Mach. Learn. Res..

[7]  Jie Duan,et al.  Multi-label feature selection based on neighborhood mutual information , 2016, Appl. Soft Comput..

[8]  Yun Li,et al.  Graph-Margin Based Multi-label Feature Selection , 2016, ECML/PKDD.

[9]  Dae-Won Kim,et al.  Feature selection for multi-label classification using multivariate mutual information , 2013, Pattern Recognit. Lett..

[10]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[11]  Jinghua Liu,et al.  Feature selection for multi-label learning with missing labels , 2019, Applied Intelligence.

[12]  Zhi-Hua Zhou,et al.  Multilabel dimensionality reduction via dependence maximization , 2008, TKDD.

[13]  Dacheng Tao,et al.  Streaming Label Learning for Modeling Labels on the Fly , 2016, ArXiv.

[14]  Weiwei Liu,et al.  Deep Discrete Prototype Multilabel Learning , 2018, IJCAI.

[15]  Chris H. Q. Ding,et al.  Multi-label ReliefF and F-statistic feature selections for image annotation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Yaojin Lin,et al.  Feature selection based on quality of information , 2017, Neurocomputing.

[17]  Ping Zhang,et al.  Distinguishing two types of labels for multi-label feature selection , 2019, Pattern Recognit..

[18]  Shunxiang Wu,et al.  Feature selection for multi-label learning based on kernelized fuzzy rough sets , 2018, Neurocomputing.

[19]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[20]  Witold Pedrycz,et al.  Large-Scale Multimodality Attribute Reduction With Multi-Kernel Fuzzy Rough Sets , 2018, IEEE Transactions on Fuzzy Systems.

[21]  Zhi-Hua Zhou,et al.  Multi-Label Learning with Emerging New Labels , 2018, IEEE Transactions on Knowledge and Data Engineering.

[22]  Huan Liu,et al.  Multi-Label Informed Feature Selection , 2016, IJCAI.

[23]  Yiming Yang,et al.  RCV1: A New Benchmark Collection for Text Categorization Research , 2004, J. Mach. Learn. Res..

[24]  Witold Pedrycz,et al.  Granular multi-label feature selection based on mutual information , 2017, Pattern Recognit..

[25]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[26]  Qinghua Hu,et al.  Streaming Feature Selection for Multilabel Learning Based on Fuzzy Mutual Information , 2017, IEEE Transactions on Fuzzy Systems.

[27]  Jason Weston,et al.  A kernel method for multi-labelled classification , 2001, NIPS.

[28]  Shunxiang Wu,et al.  Online multi-label streaming feature selection based on neighborhood rough set , 2018, Pattern Recognit..

[29]  Jiawei Han,et al.  Correlated multi-label feature selection , 2011, CIKM '11.

[30]  Qinghua Hu,et al.  A Fitting Model for Feature Selection With Fuzzy Rough Sets , 2017, IEEE Transactions on Fuzzy Systems.

[31]  Víctor Robles,et al.  Feature selection for multi-label naive Bayes classification , 2009, Inf. Sci..

[32]  Yuwen Li,et al.  Attribute reduction for multi-label learning with fuzzy rough set , 2018, Knowl. Based Syst..

[33]  Qinghua Hu,et al.  Multi-label feature selection based on max-dependency and min-redundancy , 2015, Neurocomputing.

[34]  Qinghua Hu,et al.  Multi-label feature selection with streaming labels , 2016, Inf. Sci..

[35]  Jianhua Xu,et al.  A weighted linear discriminant analysis framework for multi-label feature extraction , 2018, Neurocomputing.

[36]  Zhiming Luo,et al.  Towards a unified multi-source-based optimization framework for multi-label learning , 2019, Appl. Soft Comput..

[37]  Haytham Elghazel,et al.  A Comparison of Multi-Label Feature Selection Methods Using the Random Forest Paradigm , 2014, Canadian Conference on AI.

[38]  Newton Spolaôr,et al.  ReliefF for Multi-label Feature Selection , 2013, 2013 Brazilian Conference on Intelligent Systems.

[39]  David D. Lewis,et al.  Feature Selection and Feature Extraction for Text Categorization , 1992, HLT.

[40]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[41]  Lei Wu,et al.  Lift: Multi-Label Learning with Label-Specific Features , 2015, IEEE Trans. Pattern Anal. Mach. Intell..

[42]  Pei Zhang,et al.  Classifying Categorical Data by Rule-Based Neighbors , 2011, 2011 IEEE 11th International Conference on Data Mining.

[43]  Shunxiang Wu,et al.  Different classes' ratio fuzzy rough set based robust feature selection , 2017, Knowl. Based Syst..

[44]  Alex Alves Freitas,et al.  Two Extensions to Multi-label Correlation-Based Feature Selection: A Case Study in Bioinformatics , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[45]  Shunxiang Wu,et al.  Online Multi-label Group Feature Selection , 2017, Knowl. Based Syst..

[46]  O. J. Dunn Multiple Comparisons among Means , 1961 .

[47]  Dae-Won Kim,et al.  SCLS: Multi-label feature selection based on scalable criterion for large label set , 2017, Pattern Recognit..

[48]  Weiwei Liu,et al.  Multilabel Prediction via Cross-View Search , 2018, IEEE Transactions on Neural Networks and Learning Systems.