论文信息 - Optimization of Log-linear Machine Translation Model Parameters Using SVMs

Optimization of Log-linear Machine Translation Model Parameters Using SVMs

The state-of-the art in statistical machine translation is based on a log-linear combination of different models. In this approach, the coefficients of the combination are computed by using the MERT algorithm with a validation data set. This algorithm presents high computational costs. As an alternative, we propose a novel technique based on Support Vector Machines to calculate these coefficients using a loss function to be minimized. We report the experiments on a Italian-English translation task showing encouraging results.

[1] Hermann Ney,et al. Discriminative Training and Maximum Entropy Models for Statistical Machine Translation , 2002, ACL.

[2] Thorsten Joachims,et al. Support Vector Training of Protein Alignment Models , 2007, RECOMB.

[3] Thomas Hofmann,et al. Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..

[4] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[5] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[6] F. Casacuberta,et al. Thot: a Toolkit To Train Phrase-based Statistical Translation Models , 2005, MTSUMMIT.

[7] Koby Crammer,et al. On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..

[8] Thomas Hofmann,et al. Support vector machine learning for interdependent and structured output spaces , 2004, ICML.

[9] Jason Weston,et al. Multi-Class Support Vector Machines , 1998 .

[10] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[11] Thorsten Joachims,et al. Making large scale SVM learning practical , 1998 .

[12] Corinna Cortes,et al. Support-Vector Networks , 1995, Machine Learning.

[13] Franz Josef Och,et al. Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[14] Matthew G. Snover,et al. A Study of Translation Edit Rate with Targeted Human Annotation , 2006, AMTA.

[15] Thorsten Joachims,et al. Training linear SVMs in linear time , 2006, KDD '06.

[16] Thorsten Joachims,et al. Learning to Align Sequences: A Maximum-Margin Approach , 2006 .

[17] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.