论文信息 - Semi-Supervised Low-Rank Semantics Grouping for Zero-Shot Learning

Semi-Supervised Low-Rank Semantics Grouping for Zero-Shot Learning

Zero-shot learning has received great interest in visual recognition community. It aims to classify new unobserved classes based on the model learned from observed classes. Most zero-shot learning methods require pre-provided semantic attributes as the mid-level information to discover the intrinsic relationship between observed and unobserved categories. However, it is impractical to annotate the enriched label information of the observed objects in real-world applications, which would extremely hurt the performance of zero-shot learning with limited labeled seen data. To overcome this obstacle, we develop a Low-rank Semantics Grouping (LSG) method for zero-shot learning in a semi-supervised fashion, which attempts to jointly uncover the intrinsic relationship across visual and semantic information and recover the missing label information from seen classes. Specifically, the visual-semantic encoder is utilized as projection model, low-rank semantic grouping scheme is explored to capture the intrinsic attributes correlations and a Laplacian graph is constructed from the visual features to guide the label propagation from labeled instances to unlabeled ones. Experiments have been conducted on several standard zero-shot learning benchmarks, which demonstrate the efficiency of the proposed method by comparing with state-of-the-art methods. Our model is robust to different levels of missing label settings. Also visualized results prove that the LSG can distinguish the test unseen classes more discriminative.

[1] Zhengming Ding,et al. Marginalized Latent Semantic Encoder for Zero-Shot Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Xiaojin Zhu,et al. --1 CONTENTS , 2006 .

[3] Yoshua Bengio,et al. Zero-data Learning of New Tasks , 2008, AAAI.

[4] Feiping Nie,et al. New Graph Structured Sparsity Model for Multi-label Image Annotations , 2013, 2013 IEEE International Conference on Computer Vision.

[5] Philip H. S. Torr,et al. An embarrassingly simple approach to zero-shot learning , 2015, ICML.

[6] Kristen Grauman,et al. Zero-shot recognition with unreliable attributes , 2014, NIPS.

[7] Rong Jin,et al. Multi-label learning with incomplete class assignments , 2011, CVPR 2011.

[8] Yong Yu,et al. Robust Recovery of Subspace Structures by Low-Rank Representation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] Emmanuel J. Candès,et al. A Singular Value Thresholding Algorithm for Matrix Completion , 2008, SIAM J. Optim..

[10] Bernt Schiele,et al. Evaluation of output embeddings for fine-grained image classification , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11] Yue Gao,et al. Zero-Shot Learning With Transferred Samples , 2017, IEEE Transactions on Image Processing.

[12] Yannis Avrithis,et al. Label Propagation for Deep Semi-Supervised Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Weiping Wang,et al. Multi-Class Learning using Unlabeled Samples: Theory and Algorithm , 2019, IJCAI.

[14] Bernt Schiele,et al. Latent Embeddings for Zero-Shot Classification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Liang Wang,et al. Deep Unbiased Embedding Transfer for Zero-Shot Learning , 2020, IEEE Transactions on Image Processing.

[16] Venkatesh Saligrama,et al. Zero-Shot Learning via Semantic Similarity Embedding , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[17] Venkatesh Saligrama,et al. Zero-Shot Learning via Joint Latent Similarity Embedding , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[19] Andrew Zisserman,et al. Learning Visual Attributes , 2007, NIPS.

[20] Ming Shao,et al. Missing Modality Transfer Learning via Latent Low-Rank Constraint , 2015, IEEE Transactions on Image Processing.

[21] Ling Shao,et al. Triple Verification Network for Generalized Zero-Shot Learning , 2019, IEEE Transactions on Image Processing.

[22] Cees Snoek,et al. Attributes Make Sense on Segmented Objects , 2014, ECCV.

[23] Rainer Stiefelhagen,et al. Recovering the Missing Link: Predicting Class-Attribute Associations for Unsupervised Zero-Shot Learning , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Wei-Lun Chao,et al. Synthesized Classifiers for Zero-Shot Learning , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] FrankEibe,et al. Classifier chains for multi-label classification , 2011 .

[26] Jun Yu,et al. Zero-Shot Learning via Robust Latent Representation and Manifold Regularization , 2019, IEEE Transactions on Image Processing.

[27] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[28] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Bernt Schiele,et al. Transfer Learning in a Transductive Setting , 2013, NIPS.

[30] Cordelia Schmid,et al. Label-Embedding for Attribute-Based Classification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[31] Shuicheng Yan,et al. Smoothed Low Rank and Sparse Matrix Recovery by Iteratively Reweighted Least Squares Minimization , 2014, IEEE Transactions on Image Processing.

[32] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Dale Schuurmans,et al. Semi-Supervised Zero-Shot Classification with Label Representation Learning , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[34] Geoffrey E. Hinton,et al. Zero-shot Learning with Semantic Output Codes , 2009, NIPS.

[35] Xin Li,et al. Max-Margin Zero-Shot Learning for Multi-class Classification , 2015, AISTATS.

[36] Gabriela Csurka,et al. Metric Learning for Large Scale Image Classification: Generalizing to New Classes at Near-Zero Cost , 2012, ECCV.

[37] Chen Xu,et al. The SUN Attribute Database: Beyond Categories for Deeper Scene Understanding , 2014, International Journal of Computer Vision.

[38] Trevor Darrell,et al. Generalized Zero- and Few-Shot Learning via Aligned Variational Autoencoders , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[39] Samy Bengio,et al. Zero-Shot Learning by Convex Combination of Semantic Embeddings , 2013, ICLR.

[40] Yu-Chiang Frank Wang,et al. Robust Face Recognition With Structurally Incoherent Low-Rank Matrix Decomposition , 2014, IEEE Transactions on Image Processing.

[41] Christoph H. Lampert,et al. Attribute-Based Classification for Zero-Shot Visual Object Categorization , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42] Ming Yang,et al. Mining partially annotated images , 2011, KDD.

[43] Richard H. Bartels,et al. Algorithm 432 [C2]: Solution of the matrix equation AX + XB = C [F4] , 1972, Commun. ACM.

[44] Ling Shao,et al. Zero-VAE-GAN: Generating Unseen Features for Generalized and Transductive Zero-Shot Learning , 2020, IEEE Transactions on Image Processing.

[45] Christoph H. Lampert,et al. Zero-Shot Learning—A Comprehensive Evaluation of the Good, the Bad and the Ugly , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46] Shaogang Gong,et al. Semantic Autoencoder for Zero-Shot Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47] Hongguang Zhang,et al. Zero-Shot Kernel Learning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[48] Fatih Porikli,et al. A Unified Approach for Conventional Zero-Shot, Generalized Zero-Shot, and Few-Shot Learning , 2017, IEEE Transactions on Image Processing.

[49] Eyke Hüllermeier,et al. Multilabel classification via calibrated label ranking , 2008, Machine Learning.

[50] Ming Shao,et al. Deep Robust Encoder Through Locality Preserving Low-Rank Dictionary , 2016, ECCV.

[51] Jianmin Wang,et al. Transfer Learning with Graph Co-Regularization , 2012, IEEE Transactions on Knowledge and Data Engineering.

[52] Miao Xu,et al. Incomplete Label Distribution Learning , 2017, IJCAI.

[53] Geoff Holmes,et al. Classifier chains for multi-label classification , 2009, Machine Learning.

[54] Christoph H. Lampert,et al. Learning to detect unseen object classes by between-class attribute transfer , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[55] Ali Farhadi,et al. Describing objects by their attributes , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[56] Yuhong Guo,et al. Semi-Supervised Multi-Label Learning with Incomplete Labels , 2015, IJCAI.

[57] Junfeng Yang,et al. A New Alternating Minimization Algorithm for Total Variation Image Reconstruction , 2008, SIAM J. Imaging Sci..

[58] Shuicheng Yan,et al. Latent Low-Rank Representation for subspace segmentation and feature extraction , 2011, 2011 International Conference on Computer Vision.

[59] Ming Shao,et al. Low-Rank Embedded Ensemble Semantic Dictionary for Zero-Shot Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[60] Pietro Perona,et al. The Caltech-UCSD Birds-200-2011 Dataset , 2011 .

[61] Shaogang Gong,et al. Transductive Multi-view Embedding for Zero-Shot Recognition and Annotation , 2014, ECCV.

[62] Wei Liu,et al. Multi-label Learning with Missing Labels Using Mixed Dependency Graphs , 2018, International Journal of Computer Vision.

[63] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[64] Chris H. Q. Ding,et al. Convex and Semi-Nonnegative Matrix Factorizations , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.