论文信息 - Expected BLEU Training for Graphs: BBN System Description for WMT11 System Combination Task

Expected BLEU Training for Graphs: BBN System Description for WMT11 System Combination Task

BBN submitted system combination outputs for Czech-English, German-English, Spanish-English, and French-English language pairs. All combinations were based on confusion network decoding. The confusion networks were built using incremental hypothesis alignment algorithm with flexible matching. A novel bi-gram count feature, which can penalize bi-grams not present in the input hypotheses corresponding to a source sentence, was introduced in addition to the usual decoder features. The system combination weights were tuned using a graph based expected BLEU as the objective function while incrementally expanding the networks to bi-gram and 5-gram contexts. The expected BLEU tuning described in this paper naturally generalizes to hypergraphs and can be used to optimize thousands of weights. The combination gained about 0.5-4.0 BLEU points over the best individual systems on the official WMT11 language pairs. A 39 system multi-source combination achieved an 11.1 BLEU point gain.

[1] William H. Press,et al. Numerical recipes in C. The art of scientific computing , 1987 .

[2] Jorge Nocedal,et al. On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[3] J. E. Glynn,et al. Numerical Recipes: The Art of Scientific Computing , 1989 .

[4] F ChenStanley,et al. An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.

[5] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[6] Franz Josef Och,et al. Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[7] Ralph Weischedel,et al. A STUDY OF TRANSLATION ERROR RATE WITH TARGETED HUMAN ANNOTATION , 2005 .

[8] Matthew G. Snover,et al. A Study of Translation Edit Rate with Targeted Human Annotation , 2006, AMTA.

[9] David A. Smith,et al. Minimum Risk Annealing for Training Log-Linear Models , 2006, ACL.

[10] Richard M. Schwartz,et al. Improved Word-Level System Combination for Machine Translation , 2007, ACL.

[11] Zhifei Li,et al. First- and Second-Order Expectation Semirings with Applications to Minimum-Risk Training on Translation Forests , 2009, EMNLP.

[12] John DeNero,et al. Consensus Training for Consensus Decoding in Machine Translation , 2009, EMNLP.

[13] Richard M. Schwartz,et al. Incremental Hypothesis Alignment with Flexible Matching for Building Confusion Networks: BBN System Description for WMT09 System Combination Task , 2009, WMT@EACL.

[14] Richard M. Schwartz,et al. BBN System Description for WMT10 System Combination Task , 2010, WMT@ACL.