论文信息 - Positive Diversity Tuning for Machine Translation System Combination

Positive Diversity Tuning for Machine Translation System Combination

We present Positive Diversity Tuning, a new method for tuning machine translation models specifically for improved performance during system combination. System combination gains are often limited by the fact that the translations produced by the different component systems are too similar to each other. We propose a method for reducing excess cross-system similarity by optimizing a joint objective that simultaneously rewards models for producing translations that are similar to reference translations, while also punishing them for translations that are too similar to those produced by other systems. The formulation of the Positive Diversity objective is easy to implement and allows for its quick integration with most machine translation tuning pipelines. We find that individual systems tuned on the same data to Positive Diversity can be even more diverse than systems built using different data sets, while still obtaining good BLEU scores. When these individual systems are used together for system combination, our approach allows for significant gains of 0.8 BLEU even when the combination is performed using a small number of otherwise identical individual systems.

Daniel Jurafsky | Christopher D. Manning | Daniel M. Cer

[1] Christopher D. Manning,et al. Fast and Adaptive Online Training of Feature-Rich Translation Models , 2013, ACL.

[2] John DeNero,et al. Model Combination for Machine Translation , 2010, HLT-NAACL.

[3] Jingbo Zhu,et al. Bagging and Boosting statistical machine translation systems , 2013, Artif. Intell..

[4] Alon Lavie,et al. Meteor 1.3: Automatic Metric for Reliable Optimization and Evaluation of Machine Translation Systems , 2011, WMT@EMNLP.

[5] Nitin Madnani,et al. Fluency, Adequacy, or HTER? Exploring Different Human Judgments with a Tunable MT Metric , 2009, WMT@EACL.

[6] Mark J. F. Gales,et al. Complementary System Generation using Directed Decision Trees , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[7] Slav Petrov,et al. Training Structured Prediction Models with Extrinsic Loss Functions , 2011 .

[8] Mark Hopkins,et al. Tuning as Ranking , 2011, EMNLP.

[9] Tadashi Nomoto. Multi-Engine Machine Translation with Voted Language Model , 2004, ACL.

[10] Franz Josef Och,et al. Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[11] Takako Aikawa,et al. Chained System: A Linear Combination of Different Types of Statistical Machine Translation Systems , 2009, MTSUMMIT.