论文信息 - Vector Space Model for Adaptation in Statistical Machine Translation - 字舞流文

Vector Space Model for Adaptation in Statistical Machine Translation

This paper proposes a new approach to domain adaptation in statistical machine translation (SMT) based on a vector space model (VSM). The general idea is first to create a vector profile for the in-domain development (“dev”) set. This profile might, for instance, be a vector with a dimensionality equal to the number of training subcorpora; each entry in the vector reflects the contribution of a particular subcorpus to all the phrase pairs that can be extracted from the dev set. Then, for each phrase pair extracted from the training data, we create a vector with features defined in the same way, and calculate its similarity score with the vector representing the dev set. Thus, we obtain a decoding feature whose value represents the phrase pair’s closeness to the dev. This is a simple, computationally cheap form of instance weighting for phrase pairs. Experiments on large scale NIST evaluation data show improvements over strong baselines: +1.8 BLEU on Arabic to English and +1.4 BLEU on Chinese to English over a non-adapted baseline, and significant improvements in most circumstances over baselines with linear mixture model adaptation. An informal analysis suggests that VSM adaptation may help in making a good choice among words with the same meaning, on the basis of style and genre.

Roland Kuhn | George F. Foster | Boxing Chen | Boxing Chen | R. Kuhn

[1] Bing Xiang,et al. Feature-Rich Discriminative Phrase Rescoring for SMT , 2010, COLING.

[2] Aaron Phillips,et al. Training Machine Translation with a Second-Order Taylor Approximation of Weighted Translation Instances , 2011, MTSUMMIT.

[3] George F. Foster,et al. Batch Tuning Strategies for Statistical Machine Translation , 2012, NAACL.

[4] Haizhou Li,et al. Exploiting N-best Hypotheses for SMT Self-Enhancement , 2008, ACL.

[5] Philipp Koehn,et al. Statistical Significance Tests for Machine Translation Evaluation , 2004, EMNLP.

[6] Masaki Murata,et al. A Bayesian Method for Robust Estimation of Distributional Similarities , 2010, ACL.

[7] Jianfeng Gao,et al. Domain Adaptation via Pseudo In-Domain Data Selection , 2011, EMNLP.

[8] Roland Kuhn,et al. Discriminative Instance Weighting for Domain Adaptation in Statistical Machine Translation , 2010, EMNLP.

[9] Christopher D. Manning,et al. A Simple and Effective Hierarchical Phrase Reordering Model , 2008, EMNLP.

[10] Alex Waibel,et al. Adaptation of the translation model for statistical machine translation based on information retrieval , 2005, EAMT.

[11] Curt Burgess,et al. Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .

[12] William D. Lewis,et al. Intelligent Selection of Language Model Training Data , 2010, ACL.

[13] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[14] Philipp Koehn,et al. Experiments in Domain Adaptation for Statistical Machine Translation , 2007, WMT@ACL.

[15] Sung-Hyuk Cha. Comprehensive Survey on Distance/Similarity Measures between Probability Density Functions , 2007 .

[16] Gholamreza Haffari,et al. Transductive learning for statistical machine translation , 2007, ACL.

[17] Roland Kuhn,et al. Mixture-Model Adaptation for SMT , 2007, WMT@ACL.

[18] Peter D. Turney. Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[19] Stephan Vogel,et al. Language Model Adaptation for Statistical Machine Translation via Structured Query Models , 2004, COLING.

[20] Dekang Lin,et al. Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[21] Qun Liu,et al. Improving Statistical Machine Translation Performance by Training Data Selection and Optimization , 2007, EMNLP-CoNLL.

[22] Donald Hindle,et al. Noun Classification From Predicate-Argument Structures , 1990, ACL.

[23] Rico Sennrich,et al. Perplexity Minimization for Translation Model Domain Adaptation in Statistical Machine Translation , 2012, EACL.

[24] Spyridon Matsoukas,et al. Discriminative Corpus Weight Estimation for Machine Translation , 2009, EMNLP.

[25] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[26] Marcello Federico,et al. Domain Adaptation for Statistical Machine Translation with Monolingual Resources , 2009, WMT@EACL.

[27] George F. Foster,et al. Unpacking and Transforming Feature Functions: New Ways to Smooth Phrase Tables , 2011, MTSUMMIT.