论文信息 - Multiview self-learning

Multiview self-learning

In many applications, observations are available with different views. This is, for example, the case with image-text classification, multilingual document classification or document classification on the web. In addition, unlabeled multiview examples can be easily acquired, but assigning labels to these examples is usually a time consuming task. We describe a multiview self-learning strategy which trains different voting classifiers on different views. The margin distributions over the unlabeled training data, obtained with each view-specific classifier are then used to estimate an upper-bound on their transductive Bayes error. Minimizing this upper-bound provides an automatic margin-threshold which is used to assign pseudo-labels to unlabeled examples. Final class labels are then assigned to these examples, by taking a vote on the pool of the previous pseudo-labels. New view-specific classifiers are then trained using the labeled and pseudo-labeled training data. We consider applications to image-text classification and to multilingual document classification. We present experimental results on the NUS-WIDE collection and on Reuters RCV1-RCV2 which show that despite its simplicity, our approach is competitive with other state-of-the-art techniques.

[1] Meng Wang,et al. Semisupervised Multiview Distance Metric Learning for Cartoon Synthesis , 2012, IEEE Transactions on Image Processing.

[2] Sebastian Thrun,et al. Learning to Classify Text from Labeled and Unlabeled Documents , 1998, AAAI/IAAI.

[3] Thorsten Joachims,et al. Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[4] Yiming Yang,et al. RCV1: A New Benchmark Collection for Text Categorization Research , 2004, J. Mach. Learn. Res..

[5] Robert E. Schapire,et al. Theoretical Views of Boosting , 1999, EuroCOLT.

[6] Xiaojin Zhu,et al. --1 CONTENTS , 2006 .

[7] Chee Sun Won,et al. Efficient use of local edge histogram descriptor , 2000, MULTIMEDIA '00.

[8] Massih-Reza Amini,et al. Combining coregularization and consensus-based self-training for multilingual text categorization , 2010, SIGIR.

[9] David G. Lowe,et al. Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[10] Markus A. Stricker,et al. Similarity of color images , 1995, Electronic Imaging.

[11] Yongdong Zhang,et al. Multiview Spectral Embedding , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[12] François Laviolette,et al. A Transductive Bound for the Voted Classifier with an Application to Semi-supervised Learning , 2008, NIPS.

[13] Linda G. Shapiro,et al. Computer Vision , 2001 .

[14] Gholamreza Haffari,et al. Transductive learning for statistical machine translation , 2007, ACL.

[15] Yoram Singer,et al. Unsupervised Models for Named Entity Classification , 1999, EMNLP.

[16] Massih-Reza Amini,et al. Multiview Semi-supervised Learning for Ranking Multilingual Documents , 2011, ECML/PKDD.

[17] David S. Rosenberg,et al. The rademacher complexity of coregularized kernel classes , 2007 .

[18] Jiao Wang,et al. Exploiting Ensemble Method in Semi-Supervised Learning , 2006, 2006 International Conference on Machine Learning and Cybernetics.

[19] Ilya Narsky,et al. Reducing Multiclass to Binary , 2013 .

[20] Tat-Seng Chua,et al. NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[21] Thorsten Joachims,et al. Training linear SVMs in linear time , 2006, KDD '06.

[22] Mikhail Belkin,et al. Semi-Supervised Learning on Riemannian Manifolds , 2004, Machine Learning.

[23] Massih-Reza Amini,et al. Learning from Multiple Partially Observed Views - an Application to Multilingual Text Categorization , 2009, NIPS.

[24] Jing Huang,et al. Image indexing using color correlograms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25] David Yarowsky,et al. Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[26] Michael I. Jordan,et al. Multiple kernel learning, conic duality, and the SMO algorithm , 2004, ICML.

[27] Cordelia Schmid,et al. Multimodal semi-supervised learning for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[28] Yoram Singer,et al. Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers , 2000, J. Mach. Learn. Res..

[29] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.

[30] Zhi-Hua Zhou,et al. Tri-training: exploiting unlabeled data using three classifiers , 2005, IEEE Transactions on Knowledge and Data Engineering.

[31] Chong-Wah Ngo,et al. Click-through-based cross-view learning for image search , 2014, SIGIR.

[32] B. S. Manjunath,et al. Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[33] Mikhail Belkin,et al. Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[34] Sham M. Kakade,et al. Multi-view Regression Via Canonical Correlation Analysis , 2007, COLT.

[35] Jun Yu,et al. Click Prediction for Web Image Reranking Using Multimodal Sparse Coding , 2014, IEEE Transactions on Image Processing.

[36] Tom Diethe,et al. Multiview Fisher Discriminant Analysis , 2008 .