Distribution Matching for Transduction

Many transductive inference algorithms assume that distributions over training and test estimates should be related, e.g. by providing a large margin of separation on both sets. We use this idea to design a transduction algorithm which can be used without modification for classification, regression, and structured estimation. At its heart we exploit the fact that for a good learner the distributions over the outputs on training and test sets should match. This is a classical two-sample problem which can be solved efficiently in its most general form by using distance measures in Hilbert Space. It turns out that a number of existing heuristics can be viewed as special cases of our approach.

[1]  [CRF]. , 1975, Horumon to rinsho. Clinical endocrinology.

[2]  Alexander Gammerman,et al.  Learning by Transduction , 1998, UAI.

[3]  W. Gander,et al.  A D.C. OPTIMIZATION ALGORITHM FOR SOLVING THE TRUST-REGION SUBPROBLEM∗ , 1998 .

[4]  Thorsten Joachims,et al.  Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[5]  Sabine Buchholz,et al.  Introduction to the CoNLL-2000 Shared Task Chunking , 2000, CoNLL/LLL.

[6]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[7]  Wei Li,et al.  Early results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons , 2003, CoNLL.

[8]  Thomas Gärtner,et al.  Large-Scale Multiclass Transduction , 2005, NIPS.

[9]  S. Sathiya Keerthi,et al.  Large scale semi-supervised linear SVMs , 2006, SIGIR.

[10]  Thomas Gärtner,et al.  Transductive Gaussian Process Regression with Automatic Model Selection , 2006, ECML.

[11]  Bernhard Schölkopf,et al.  A Kernel Method for the Two-Sample-Problem , 2006, NIPS.

[12]  Bernhard Schölkopf,et al.  Introduction to Semi-Supervised Learning , 2006, Semi-Supervised Learning.

[13]  Ben Taskar,et al.  Expectation Maximization and Posterior Constraints , 2007, NIPS.

[14]  Alexander Zien,et al.  Transductive support vector machines for structured variables , 2007, ICML '07.

[15]  Gideon S. Mann,et al.  Learning from labeled features using generalized expectation criteria , 2008, SIGIR '08.

[16]  Yurii Nesterov,et al.  Confidence level solutions for stochastic programming , 2000, Autom..