论文信息 - Large-scale Expected BLEU Training of Phrase-based Reordering Models

Large-scale Expected BLEU Training of Phrase-based Reordering Models

Recent work by Cherry (2013) has shown that directly optimizing phrase-based reordering models towards BLEU can lead to significant gains. Their approach is limited to small training sets of a few thousand sentences and a similar number of sparse features. We show how the expected BLEU objective allows us to train a simple linear discriminative reordering model with millions of sparse features on hundreds of thousands of sentences resulting in significant improvements. A comparison to likelihood training demonstrates that expected BLEU is vastly more effective. Our best results improve a hierarchical lexicalized reordering baseline by up to 2.0 BLEU in a single-reference setting on a French-English WMT 2012 setup.

Jianfeng Gao | Michael Auli | Michel Galley

[1] Christoph Tillmann,et al. A Unigram Orientation Model for Statistical Machine Translation , 2004, NAACL.

[2] Mark Hopkins,et al. Tuning as Ranking , 2011, EMNLP.

[3] Richard M. Schwartz,et al. Expected BLEU Training for Graphs: BBN System Description for WMT11 System Combination Task , 2011, WMT@EMNLP.

[4] Jimmy J. Lin,et al. Mr. MIRA: Open-Source Large-Margin Structured Learning on MapReduce , 2013, ACL.

[5] Dekai Wu,et al. Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora , 1997, CL.

[6] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[7] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.

[8] Daniel Marcu,et al. Scalable Inference and Training of Context-Rich Syntactic Translation Models , 2006, ACL.

[9] Haitao Mi,et al. Max-Violation Perceptron and Forced Decoding for Scalable MT Training , 2013, EMNLP.

[10] Richard M. Schwartz,et al. BBN System Description for WMT10 System Combination Task , 2010, WMT@ACL.

[11] Ben Taskar,et al. An End-to-End Discriminative Approach to Machine Translation , 2006, ACL.