论文信息 - Improving the IBM Alignment Models Using Variational Bayes

Improving the IBM Alignment Models Using Variational Bayes

Bayesian approaches have been shown to reduce the amount of overfitting that occurs when running the EM algorithm, by placing prior probabilities on the model parameters. We apply one such Bayesian technique, variational Bayes, to the IBM models of word alignment for statistical machine translation. We show that using variational Bayes improves the performance of the widely used GIZA++ software, as well as improving the overall performance of the Moses machine translation system in terms of BLEU score.

Daniel Gildea | Darcey Riley

[1] Robert C. Moore. Improving IBM Word Alignment Model 1 , 2004, ACL.

[2] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[3] John DeNero,et al. Sampling Alignment Structure under a Bayesian Translation Model , 2008, EMNLP.

[4] Matthew J. Beal. Variational algorithms for approximate Bayesian inference , 2003 .

[5] Hermann Ney,et al. HMM-Based Word Alignment in Statistical Translation , 1996, COLING.

[6] Mark Johnson,et al. Why Doesn’t EM Find Good HMM POS-Taggers? , 2007, EMNLP.

[7] Phil Blunsom,et al. Bayesian Synchronous Grammar Induction , 2008, NIPS.

[8] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[9] Daniel Marcu,et al. What’s in a translation rule? , 2004, NAACL.

[10] Hermann Ney,et al. Improved Statistical Alignment Models , 2000, ACL.

[11] David Chiang,et al. A Hierarchical Phrase-Based Model for Statistical Machine Translation , 2005, ACL.

[12] Murat Saraclar,et al. Bayesian Word Alignment for Statistical Machine Translation , 2011, ACL.

[13] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.