Multi-Label Informed Feature Selection

Multi-label learning has been extensively studied in the area of bioinformatics, information retrieval, multimedia annotation, etc. In multi-label learning, each instance is associated with multiple interdependent class labels, the label information can be noisy and incomplete. In addition, multilabeled data often has noisy, irrelevant and redundant features of high dimensionality. As an effective data preprocessing step, feature selection has shown its effectiveness to prepare high-dimensional data for numerous data mining and machine learning tasks. Most of existing multi-label feature selection algorithms either boil down to solving multiple single-labeled feature selection problems or directly make use of imperfect labels. Therefore, they may not be able to find discriminative features that are shared by multiple labels. In this paper, we propose a novel multi-label informed feature selection framework MIFS, which exploits label correlations to select discriminative features across multiple labels. Specifically, to reduce the negative effects of imperfect label information in finding label correlations, we decompose the multi-label information into a low-dimensional space and then employ the reduced space to steer the feature selection process. Empirical studies on real-world datasets demonstrate the effectiveness and efficiency of the proposed framework.

[1]  Yi Yang,et al.  A Convex Formulation for Semi-Supervised Multi-Label Feature Selection , 2014, AAAI.

[2]  Zhi-Hua Zhou,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[3]  Huan Liu,et al.  Robust Unsupervised Feature Selection on Networked Data , 2016, SDM.

[4]  Stefan C. Kremer,et al.  Neural and Statistical Classification to Families of Bio-sequences , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[5]  Deng Cai,et al.  Unsupervised feature selection for multi-cluster data , 2010, KDD.

[6]  M.N.S. Swamy,et al.  IEEE TRANSACTIONS ON MULTIMEDIA STEERING COMMITTEE MEMBERS , 2005 .

[7]  Jiebo Luo,et al.  Multilabel machine learning and its application to semantic scene classification , 2003, IS&T/SPIE Electronic Imaging.

[8]  IEEE Transactions on Knowledge and Data Engineering, Vol. 14 , 2002 .

[9]  Hiroshi Motoda,et al.  Computational Methods of Feature Selection , 2007 .

[10]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[11]  Jieping Ye,et al.  Extracting shared subspace for multi-label classification , 2008, KDD.

[12]  Yiming Yang,et al.  Multilabel classification with meta-level features , 2010, SIGIR.

[13]  Jiawei Han,et al.  ACM Transactions on Knowledge Discovery from Data: Introduction , 2007 .

[14]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[15]  David J. Spiegelhalter,et al.  Machine Learning, Neural and Statistical Classification , 2009 .

[16]  Huan Liu,et al.  Unsupervised Streaming Feature Selection in Social Media , 2015, CIKM.

[17]  Zhi-Hua Zhou,et al.  Multi-Label Learning by Exploiting Label Correlations Locally , 2012, AAAI.

[18]  Yiming Yang,et al.  RCV1: A New Benchmark Collection for Text Categorization Research , 2004, J. Mach. Learn. Res..

[19]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Jason Weston,et al.  A kernel method for multi-labelled classification , 2001, NIPS.

[21]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .

[22]  Stefien Bickel,et al.  ECML-PKDD Discovery Challenge 2006 Overview , 2006 .

[23]  Grigorios Tsoumakas,et al.  Multilabel Text Classification for Automated Tag Suggestion , 2008 .

[24]  Zi Huang,et al.  Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence ℓ2,1-Norm Regularized Discriminative Feature Selection for Unsupervised Learning , 2022 .

[25]  Kewei Cheng,et al.  Feature Selection , 2016, ACM Comput. Surv..

[26]  David G. Stork,et al.  Pattern Classification , 1973 .

[27]  Xian-Sheng Hua,et al.  Online multi-label active annotation: towards large-scale content-based video search , 2008, ACM Multimedia.

[28]  Michael K. Buckland,et al.  Annual Review of Information Science and Technology , 2006, J. Documentation.

[29]  Yang Song,et al.  A sparse gaussian processes classification framework for fast tag suggestions , 2008, CIKM '08.

[30]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[31]  S. Dumais Latent Semantic Analysis. , 2005 .

[32]  Nicu Sebe,et al.  Web Image Annotation Via Subspace-Sparsity Collaborated Feature Selection , 2012, IEEE Transactions on Multimedia.

[33]  Volker Tresp,et al.  Multi-label informed latent semantic indexing , 2005, SIGIR '05.

[34]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[35]  Sui-Yu Wang Huan Liu and Hiroshi Motoda (eds): Computational methods of feature selection , 2008, Pattern Analysis and Applications.

[36]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[37]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[38]  Zhi-Hua Zhou,et al.  Multilabel dimensionality reduction via dependence maximization , 2008, TKDD.

[39]  Lei Tang,et al.  Large scale multi-label classification via metalabeler , 2009, WWW '09.

[40]  Jieping Ye,et al.  Multi-Task Feature Learning Via Efficient l2, 1-Norm Minimization , 2009, UAI.