论文信息 - Distribution Matching for Transduction

Distribution Matching for Transduction

Many transductive inference algorithms assume that distributions over training and test estimates should be related, e.g. by providing a large margin of separation on both sets. We use this idea to design a transduction algorithm which can be used without modification for classification, regression, and structured estimation. At its heart we exploit the fact that for a good learner the distributions over the outputs on training and test sets should match. This is a classical two-sample problem which can be solved efficiently in its most general form by using distance measures in Hilbert Space. It turns out that a number of existing heuristics can be viewed as special cases of our approach.

[1] [CRF]. , 1975, Horumon to rinsho. Clinical endocrinology.

[2] Alexander Gammerman,et al. Learning by Transduction , 1998, UAI.

[3] W. Gander,et al. A D.C. OPTIMIZATION ALGORITHM FOR SOLVING THE TRUST-REGION SUBPROBLEM∗ , 1998 .

[4] Thorsten Joachims,et al. Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[5] Sabine Buchholz,et al. Introduction to the CoNLL-2000 Shared Task Chunking , 2000, CoNLL/LLL.

[6] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[7] Wei Li,et al. Early results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons , 2003, CoNLL.

[8] Thomas Gärtner,et al. Large-Scale Multiclass Transduction , 2005, NIPS.

[9] S. Sathiya Keerthi,et al. Large scale semi-supervised linear SVMs , 2006, SIGIR.

[10] Thomas Gärtner,et al. Transductive Gaussian Process Regression with Automatic Model Selection , 2006, ECML.

[11] Bernhard Schölkopf,et al. A Kernel Method for the Two-Sample-Problem , 2006, NIPS.

[12] Bernhard Schölkopf,et al. Introduction to Semi-Supervised Learning , 2006, Semi-Supervised Learning.

[13] Ben Taskar,et al. Expectation Maximization and Posterior Constraints , 2007, NIPS.

[14] Alexander Zien,et al. Transductive support vector machines for structured variables , 2007, ICML '07.

[15] Gideon S. Mann,et al. Learning from labeled features using generalized expectation criteria , 2008, SIGIR '08.

[16] Yurii Nesterov,et al. Confidence level solutions for stochastic programming , 2000, Autom..