论文信息 - Large-Scale Semi-Supervised Learning

Large-Scale Semi-Supervised Learning

Labeling data is expensive, whilst unlabeled data is often abundant and cheap to collect. Semi-supervised learning algorithms that use both types of data can perform significantly better than supervised algorithms that use labeled data alone. However, for such gains to be observed, the amount of unlabeled data trained on should be relatively large. Therefore, making semi-supervised algorithms scalable is paramount. In this work we review several recent techniques for semisupervised learning, and methods for improving the scalability of these algorithms.

Jason Weston | J. Weston

[1] H. J. Scudder,et al. Probability of error of some adaptive pattern-recognition machines , 1965, IEEE Trans. Inf. Theory.

[2] D. Hosmer. A Comparison of Iterative Maximum Likelihood Estimates of the Parameters of a Mixture of Two Normal Distributions Under Three Different Types of Sample , 1973 .

[3] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[4] Jeff Shrager,et al. Observation of Phase Transitions in Spreading Activation Networks , 1987, Science.

[5] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.

[6] Vladimir Vapnik,et al. Statistical learning theory , 1998 .

[7] Thorsten Joachims,et al. Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[8] Tommi S. Jaakkola,et al. Partially labeled classification with Markov random walks , 2001, NIPS.

[9] Michael I. Jordan,et al. On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[10] Alan L. Yuille,et al. The Concave-Convex Procedure (CCCP) , 2001, NIPS.

[11] Zoubin Ghahramani,et al. Learning from labeled and unlabeled data with label propagation , 2002 .