论文信息 - Distributed Minimum Error Rate Training of SMT using Particle Swarm Optimization

Distributed Minimum Error Rate Training of SMT using Particle Swarm Optimization

The direct optimization of a translation metric is an integral part of building stateof-the-art SMT systems. Unfortunately, widely used translation metrics such as BLEU-score are non-smooth, non-convex, and non-trivial to optimize. Thus, standard optimizers such as minimum error rate training (MERT) can be extremely time-consuming, leading to a slow turnaround rate for SMT research and experimentation. We propose an alternative approach based on particle swarm optimization (PSO), which can easily exploit the fast growth of distributed computing to obtain solutions quickly. For example in our experiments on NIST 2008 Chineseto-English data with 512 cores, we demonstrate a speed increase of up to 15x and reduce the parameter tuning time from 10 hours to 40 minutes with no degradation in BLEU-score.

[1] Roland Kuhn,et al. Stabilizing Minimum Error Rate Training , 2009, WMT@EACL.

[2] Wolfgang Macherey,et al. Lattice-based Minimum Error Rate Training for Statistical Machine Translation , 2008, EMNLP.

[3] Yoram Singer,et al. Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..

[4] Maria Leonor Pacheco,et al. of the Association for Computational Linguistics: , 2001 .

[5] Chris Quirk,et al. Random Restarts in Minimum Error Rate Training for Statistical Machine Translation , 2008, COLING.

[6] Riccardo Poli,et al. Particle swarm optimization , 1995, Swarm Intelligence.

[7] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[8] Franz Josef Och,et al. Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[9] A. Engelbrecht,et al. A new locally convergent particle swarm optimiser , 2002, IEEE International Conference on Systems, Man and Cybernetics.

[10] Barry Haddow,et al. Improved Minimum Error Rate Training in Moses , 2009, Prague Bull. Math. Linguistics.

[11] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[12] Daniel Jurafsky,et al. Regularization and Search for Minimum Error Rate Training , 2008, WMT@ACL.