论文信息 - Optimization Strategies for Online Large-Margin Learning in Machine Translation

Optimization Strategies for Online Large-Margin Learning in Machine Translation

The introduction of large-margin based discriminative methods for optimizing statistical machine translation systems in recent years has allowed exploration into many new types of features for the translation process. By removing the limitation on the number of parameters which can be optimized, these methods have allowed integrating millions of sparse features. However, these methods have not yet met with wide-spread adoption. This may be partly due to the perceived complexity of implementation, and partly due to the lack of standard methodology for applying these methods to MT. This papers aims to shed light on large-margin learning for MT, explicitly presenting the simple passive-aggressive algorithm which underlies many previous approaches, with direct application to MT, and empirically comparing several widespread optimization strategies.

Vladimir Eidelman | Vladimir Eidelman

[1] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[2] Mark Hopkins,et al. Tuning as Ranking , 2011, EMNLP.

[3] Philipp Koehn,et al. Online learning methods for discriminative training of phrase based statistical machine translation , 2007, MTSUMMIT.

[4] David Chiang,et al. Hope and Fear for Discriminative Training of Statistical Translation Models , 2012, J. Mach. Learn. Res..

[5] Chin-Yew Lin,et al. ORANGE: a Method for Evaluating Automatic Evaluation Metrics for Machine Translation , 2004, COLING.

[6] Franz Josef Och,et al. Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[7] Eric P. Xing,et al. Learning Structured Classifiers with Dual Coordinate Ascent , 2010 .

[8] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[9] David A. Smith,et al. Minimum Risk Annealing for Training Log-Linear Models , 2006, ACL.

[10] Gideon S. Mann,et al. Distributed Training Strategies for the Structured Perceptron , 2010, NAACL.

[11] Maria Leonor Pacheco,et al. of the Association for Computational Linguistics: , 2001 .