Semi-supervised multi-label classification using incomplete label information

Abstract Classifying multi-label instances using incompletely labeled instances is one of the fundamental tasks in multi-label learning. Most existing methods regard this task as supervised weak-label learning problem and assume sufficient partially labeled instances are available. However, collecting or annotating such instances is expensive and time-consuming. In contrast, abundant unlabeled instances are easy to accumulate. Recently, some methods move toward exploiting unlabeled instances and performing transductive multi-label classification. However, these methods can not directly apply to new instances, which are not available during training process. In this paper, we proposed an approach called Semi-supervised multi-label classification using incomplete label information (SMILE for short). SMILE first estimates label correlation from partially labeled instances and replenishes missing labels of these instances. Then, it takes advantage of labeled and unlabeled instances to construct a neighborhood graph. Next, the known labels and replenished ones of labeled instances, along with unlabeled instances are exploited to train a graph based semi-supervised linear classifier. SMILE can further replenish the missing labels of training instances based on the adopted neighborhood graph. In addition, it can directly predict the labels of completely unlabeled new instances. The empirical study on multi-label datasets shows that SMILE performs significantly better than other related methods across various evaluation criteria and it is important to leverage unlabeled data with label correlation for multi-label classification.

[1]  Qiang Ji,et al.  Multi-label Learning with Missing Labels , 2014, 2014 22nd International Conference on Pattern Recognition.

[2]  Yuhong Guo,et al.  Semi-Supervised Multi-Label Learning with Incomplete Labels , 2015, IJCAI.

[3]  Michael K. Ng,et al.  Transductive Multilabel Learning via Label Set Propagation , 2013, IEEE Transactions on Knowledge and Data Engineering.

[4]  Sunil Arya,et al.  An optimal algorithm for approximate nearest neighbor searching fixed dimensions , 1998, JACM.

[5]  Zili Zhang,et al.  Semi-supervised classification based on subspace sparse representation , 2013, Knowledge and Information Systems.

[6]  Alexander Panchenko,et al.  A Study of Heterogeneous Similarity Measures for Semantic Relation Extraction , 2012, JEP/TALN/RECITAL.

[7]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[8]  Bernhard Schölkopf,et al.  Measuring Statistical Dependence with Hilbert-Schmidt Norms , 2005, ALT.

[9]  Gert R. G. Lanckriet,et al.  Semantic Annotation and Retrieval of Music and Sound Effects , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  Zhiwu Lu,et al.  Exhaustive and Efficient Constraint Propagation: A Graph-Based Learning Approach and Its Applications , 2011, International Journal of Computer Vision.

[11]  Martha White,et al.  Convex Sparse Coding, Subspace Learning, and Semi-Supervised Extensions , 2011, AAAI.

[12]  Huzefa Rangwala,et al.  Protein Function Prediction Using Dependence Maximization , 2013, ECML/PKDD.

[13]  Grigorios Tsoumakas,et al.  Random K-labelsets for Multilabel Classification , 2022 .

[14]  Tao Mei,et al.  Correlative multi-label video annotation , 2007, ACM Multimedia.

[15]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[16]  Zhi-Hua Zhou,et al.  Multi-Label Learning with Weak Label , 2010, AAAI.

[17]  Qiang Ji,et al.  Multi-label learning with missing labels for image annotation and facial action unit recognition , 2015, Pattern Recognit..

[18]  Tao Mei,et al.  Building a comprehensive ontology to refine video concept detection , 2007, MIR '07.

[19]  Andrew McCallum,et al.  Collective multi-label classification , 2005, CIKM '05.

[20]  Baoyuan Wu,et al.  ML-MG: Multi-label Learning with Missing Labels Using a Mixed Graph , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[21]  Philip S. Yu,et al.  Large-Scale Multi-Label Learning with Incomplete Label Assignments , 2014, SDM.

[22]  Paul Smolensky,et al.  Information processing in dynamical systems: foundations of harmony theory , 1986 .

[23]  Yi Liu,et al.  Semi-supervised Multi-label Learning by Constrained Non-negative Matrix Factorization , 2006, AAAI.

[24]  Tao Mei,et al.  Graph-based semi-supervised learning with multiple labels , 2009, J. Vis. Commun. Image Represent..

[25]  Geoff Holmes,et al.  Classifier Chains for Multi-label Classification , 2009, ECML/PKDD.

[26]  Le Wu,et al.  Multi-Label Classification with Unlabeled Data: An Inductive Approach , 2013, ACML.

[27]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[28]  Xin Li,et al.  Conditional Restricted Boltzmann Machines for Multi-label Learning with Incomplete Labels , 2015, AISTATS.

[29]  Jason Weston,et al.  A kernel method for multi-labelled classification , 2001, NIPS.

[30]  Zhiwen Yu,et al.  Protein function prediction using weak-label learning , 2012, BCB.

[31]  Inderjit S. Dhillon,et al.  Large-scale Multi-label Learning with Missing Labels , 2013, ICML.

[32]  L. Reichel,et al.  Krylov-subspace methods for the Sylvester equation , 1992 .

[33]  Zhi-Hua Zhou,et al.  Multi-Label Learning by Exploiting Label Correlations Locally , 2012, AAAI.

[34]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[35]  Rong Jin,et al.  Multi-label learning with incomplete class assignments , 2011, CVPR 2011.

[36]  Fan Chung,et al.  Spectral Graph Theory , 1996 .

[37]  Jian Yu,et al.  Semi-supervised low-rank mapping learning for multi-label classification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Kun Zhang,et al.  Multi-label learning by exploiting label dependency , 2010, KDD.

[39]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[40]  Grigorios Tsoumakas,et al.  Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.

[41]  Rong Jin,et al.  Correlated Label Propagation with Application to Multi-label Learning , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[42]  Ming Yang,et al.  Mining partially annotated images , 2011, KDD.

[43]  Gang Chen,et al.  Semi-supervised Multi-label Learning by Solving a Sylvester Equation , 2008, SDM.

[44]  Stephen J. Wright,et al.  Numerical Optimization , 2018, Fundamental Statistical Inference.

[45]  Dale Schuurmans,et al.  Semi-supervised Multi-label Classification - A Simultaneous Large-Margin, Subspace Learning Approach , 2012, ECML/PKDD.

[46]  Dale Schuurmans,et al.  Adaptive Large Margin Training for Multilabel Classification , 2011, AAAI.

[47]  Zhi-Hua Zhou,et al.  Multilabel dimensionality reduction via dependence maximization , 2008, TKDD.