论文信息 - A New Analysis of Co-Training - 字舞流文

A New Analysis of Co-Training

In this paper, we present a new analysis on co-training, a representative paradigm of disagreement-based semi-supervised learning methods. In our analysis the co-training process is viewed as a combinative label propagation over two views; this provides a possibility to bring the graph-based and disagreement-based semi-supervised methods into a unified framework. With the analysis we get some insight that has not been disclosed by previous theoretical studies. In particular, we provide the sufficient and necessary condition for co-training to succeed. We also discuss the relationship to previous theoretical results and give some other interesting implications of our results, such as combination of weight matrices and view split.

Zhi-Hua Zhou | Wei Wang | Zhi-Hua Zhou | Wei Wang

[1] Steven P. Abney,et al. Bootstrapping , 2002, ACL.

[2] Ulf Brefeld,et al. Multi-view Discriminative Sequential Learning , 2005, ECML.

[3] Zhi-Hua Zhou,et al. Semi-supervised learning by disagreement , 2010, Knowledge and Information Systems.

[4] Ronald Rosenfeld,et al. Semi-supervised learning with graphs , 2005 .

[5] Zhi-Hua Zhou,et al. Analyzing Co-training Style Algorithms , 2007, ECML.

[6] Tong Zhang,et al. Linear prediction models with graph regularization for web-page categorization , 2006, KDD '06.

[7] Rayid Ghani,et al. Analyzing the effectiveness and applicability of co-training , 2000, CIKM '00.

[8] Ulrike von Luxburg,et al. Influence of graph construction on graph-based clustering measures , 2008, NIPS.

[9] Mark Herbster,et al. Combining Graph Laplacians for Semi-Supervised Learning , 2005, NIPS.

[10] R. Bharat Rao,et al. Bayesian Co-Training , 2007, J. Mach. Learn. Res..

[11] Mikhail Belkin,et al. A Co-Regularization Approach to Semi-supervised Learning with Multiple Views , 2005 .

[12] Xiaojin Zhu,et al. --1 CONTENTS , 2006 .

[13] Mark Steedman,et al. Bootstrapping statistical parsers from small datasets , 2003, EACL.

[14] Shai Ben-David,et al. Does Unlabeled Data Provably Help? Worst-case Analysis of the Sample Complexity of Semi-Supervised Learning , 2008, COLT.

[15] Maria-Florina Balcan,et al. Co-Training and Expansion: Towards Bridging Theory and Practice , 2004, NIPS.

[16] Craig A. Knoblock,et al. Active + Semi-supervised Learning = Robust Multi-View Learning , 2002, ICML.

[17] Anoop Sarkar,et al. Corrected Co-training for Statistical Parsers , 2003 .

[18] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.

[19] Zhi-Hua Zhou,et al. Tri-training: exploiting unlabeled data using three classifiers , 2005, IEEE Transactions on Knowledge and Data Engineering.

[20] Christopher J. C. Burges,et al. Spectral clustering and transductive learning with multiple views , 2007, ICML '07.

[21] Yan Zhou,et al. Enhancing Supervised Learning with Unlabeled Data , 2000, ICML.

[22] Shih-Fu Chang,et al. Graph construction and b-matching for semi-supervised learning , 2009, ICML '09.