论文信息 - Conditional Restricted Boltzmann Machines for Multi-label Learning with Incomplete Labels

Conditional Restricted Boltzmann Machines for Multi-label Learning with Incomplete Labels

Standard multi-label learning methods assume fully labeled training data. This assumption however is impractical in many application domains where labels are dicult to collect and missing labels are prevalent. In this paper, we develop a novel conditional restricted Boltzmann machine model to address multi-label learning with incomplete labels. It uses a restricted Boltzmann machine to capture the high-order label dependence relationships in the output space, aiming to enhance the capacity of recovering missing labels and learning high quality multi-label prediction models. Moreover, it also incorporates label co-occurrence information retrieved from auxiliary resources as prior knowledge. We perform model training by maximizing the regularized marginal conditional likelihood of the label vectors given the input features, and develop a Viterbi style EM algorithm to solve the induced optimization problem. The proposed approach is evaluated on four real word multi-label data sets by comparing to a number of state-of-the-art methods. The experimental results show it outperforms all the other comparison methods across the applied data sets.

Xin Li | Yuhong Guo | Feipeng Zhao

[1] Grigorios Tsoumakas,et al. Effective and Efficient Multilabel Classification in Domains with Large Number of Labels , 2008 .

[2] Concha Bielza,et al. Bayesian Chain Classifiers for Multidimensional Classification , 2011, IJCAI.

[3] Inderjit S. Dhillon,et al. Large-scale Multi-label Learning with Missing Labels , 2013, ICML.

[4] Honglak Lee,et al. Augmenting CRFs with Boltzmann Machine Shape Priors for Image Labeling , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[5] Geoffrey E. Hinton. Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[6] Paul Smolensky,et al. Information processing in dynamical systems: foundations of harmony theory , 1986 .

[7] Antonio Torralba,et al. Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[8] David A. Forsyth,et al. Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.

[9] Marcel Worring,et al. The challenge problem for automated detection of 101 semantic concepts in multimedia , 2006, MM '06.

[10] Theodora Tsikrika,et al. The Wikipedia Image Retrieval Task , 2010, ImageCLEF.

[11] Hsuan-Tien Lin,et al. Multilabel Classification with Principal Label Space Transformation , 2012, Neural Computation.