论文信息 - Regularization and Search for Minimum Error Rate Training

Regularization and Search for Minimum Error Rate Training

Minimum error rate training (MERT) is a widely used learning procedure for statistical machine translation models. We contrast three search strategies for MERT: Powell's method, the variant of coordinate descent found in the Moses MERT utility, and a novel stochastic method. It is shown that the stochastic method obtains test set gains of +0.98 BLEU on MT03 and +0.61 BLEU on MT05. We also present a method for regularizing the MERT objective that achieves statistically significant gains when combined with both Powell's method and coordinate descent.

Daniel Jurafsky | Christopher D. Manning | Daniel M. Cer | Dan Jurafsky | Daniel Matthew Cer

[1] David A. Smith,et al. Minimum Risk Annealing for Training Log-Linear Models , 2006, ACL.

[2] Hermann Ney,et al. A Systematic Comparison of Training Criteria for Statistical Machine Translation , 2007, EMNLP-CoNLL.

[3] Daniel Jurafsky,et al. A Conditional Random Field Word Segmenter for Sighan Bakeoff 2005 , 2005, IJCNLP.

[4] Daniel Marcu,et al. Statistical Phrase-Based Translation , 2003, NAACL.

[5] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[6] Christoph Tillmann,et al. A Unigram Orientation Model for Statistical Machine Translation , 2004, NAACL.

[7] Franz Josef Och,et al. Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[8] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[9] Hermann Ney,et al. Discriminative Training and Maximum Entropy Models for Statistical Machine Translation , 2002, ACL.

[10] Stefan Riezler,et al. On Some Pitfalls in Automatic Evaluation and Significance Testing for MT , 2005, IEEvaluation@ACL.

[11] Andreas Stolcke,et al. SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[12] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[13] William H. Press,et al. Numerical Recipes 3rd Edition: The Art of Scientific Computing , 2007 .