论文信息 - Analysis of Representations for Domain Adaptation

Analysis of Representations for Domain Adaptation

Discriminative learning methods for classification perform well when training and test data are drawn from the same distribution. In many situations, though, we have labeled training data for a source domain, and we wish to learn a classifier which performs well on a target domain with a different distribution. Under what conditions can we adapt a classifier trained on the source domain for use in the target domain? Intuitively, a good feature representation is a crucial factor in the success of domain adaptation. We formalize this intuition theoretically with a generalization bound for domain adaption. Our theory illustrates the tradeoffs inherent in designing a representation for domain adaptation and gives a new justification for a recently proposed model. It also points toward a promising new model for domain adaptation: one which explicitly minimizes the difference between the source and target domains, while at the same time maximizing the margin of the training set.

[1] Jonathan Baxter,et al. Learning internal representations , 1995, COLT '95.

[2] Vladimir Vapnik,et al. Statistical learning theory , 1998 .

[3] Ronitt Rubinfeld,et al. Testing that distributions are close , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[4] T. Ben-David,et al. Exploiting Task Relatedness for Multiple , 2003 .

[5] Shai Ben-David,et al. On the difficulty of approximately maximizing agreements , 2000, J. Comput. Syst. Sci..

[6] Tong Zhang,et al. Solving large scale linear prediction problems using stochastic gradient descent algorithms , 2004, ICML.

[7] P. Kantor. Foundations of Statistical Natural Language Processing , 2001, Information Retrieval.

[8] Yee Whye Teh,et al. Sharing Clusters among Related Groups: Hierarchical Dirichlet Processes , 2004, NIPS.

[9] Shai Ben-David,et al. Detecting Change in Data Streams , 2004, VLDB.

[10] Koby Crammer,et al. Learning from Data of Variable Quality , 2005, NIPS.

[11] K. Müller,et al. Generalization Error Estimation under Covariate Shift , 2005 .

[12] Santosh S. Vempala,et al. An algorithmic theory of learning: Robust concepts and random projection , 1999, Machine Learning.

[13] Eugene Charniak,et al. Reranking and Self-Training for Parser Adaptation , 2006, ACL.

[14] John Blitzer,et al. Domain Adaptation with Structural Correspondence Learning , 2006, EMNLP.