论文信息 - The Necessity of Combining Adaptation Methods

The Necessity of Combining Adaptation Methods

Problems stemming from domain adaptation continue to plague the statistical natural language processing community. There has been continuing work trying to find general purpose algorithms to alleviate this problem. In this paper we argue that existing general purpose approaches usually only focus on one of two issues related to the difficulties faced by adaptation: 1) difference in base feature statistics or 2) task differences that can be detected with labeled data. We argue that it is necessary to combine these two classes of adaptation algorithms, using evidence collected through theoretical analysis and simulated and real-world data experiments. We find that the combined approach often outperforms the individual adaptation approaches. By combining simple approaches from each class of adaptation algorithm, we achieve state-of-the-art results for both Named Entity Recognition adaptation task and the Preposition Sense Disambiguation adaptation task. Second, we also show that applying an adaptation algorithm that finds shared representation between domains often impacts the choice in adaptation algorithm that makes use of target labeled data.

Ming-Wei Chang | Dan Roth | Michael Connor

[1] ChengXiang Zhai,et al. Instance Weighting for Domain Adaptation in NLP , 2007, ACL.

[2] Christopher D. Manning,et al. Hierarchical Bayesian Domain Adaptation , 2009, NAACL.

[3] Percy Liang,et al. Semi-Supervised Learning for Natural Language , 2005 .

[4] John Blitzer,et al. Domain Adaptation with Structural Correspondence Learning , 2006, EMNLP.

[5] Tong Zhang,et al. Covering Number Bounds of Certain Regularized Linear Function Classes , 2002, J. Mach. Learn. Res..

[6] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.

[7] Erik F. Tjong Kim Sang,et al. Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition , 2003, CoNLL.

[8] Massimiliano Pontil,et al. Regularized multi--task learning , 2004, KDD.

[9] Alexander Yates,et al. Distributional Representations for Handling Sparsity in Supervised Sequence-Labeling , 2009, ACL.

[10] Dirk Hovy,et al. Disambiguation of Preposition Sense Using Linguistically Motivated Features , 2009, NAACL.

[11] Hal Daumé,et al. Frustratingly Easy Domain Adaptation , 2007, ACL.