论文信息 - Tuning Syntactically Enhanced Word Alignment for Statistical Machine Translation

Tuning Syntactically Enhanced Word Alignment for Statistical Machine Translation

We introduce a syntactically enhanced word alignment model that is more flexible than state-of-the-art generative word alignment models and can be tuned according to different end tasks. First of all, this model takes the advantages of both unsupervised and supervised word alignment approaches by obtaining anchor alignments from unsupervised generative models and seeding the anchor alignments into a supervised discriminative model. Second, this model offers the flexibility of tuning the alignment according to different optimisation criteria. Our experiments show that using our word alignment in a Phrase-Based Statistical Machine Translation system yields a 5.38% relative increase on IWSLT 2007 task in terms of BLEU score.

Yanjun Ma | Andy Way | Patrik Lambert

[1] Franz Josef Och,et al. Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[2] Hermann Ney,et al. HMM-Based Word Alignment in Statistical Translation , 1996, COLING.

[3] Alexander M. Fraser,et al. Squibs and Discussions: Measuring Word Alignment Quality for Statistical Machine Translation , 2007, CL.

[4] I. Dan Melamed,et al. Models of translation equivalence among words , 2000, CL.

[5] Yang Liu,et al. Log-Linear Models for Word Alignment , 2005, ACL.

[6] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[7] Ted Dunning,et al. Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[8] Joakim Nivre,et al. MaltParser: A Language-Independent System for Data-Driven Dependency Parsing , 2007, Natural Language Engineering.

[9] Ben Taskar,et al. Alignment by Agreement , 2006, NAACL.

[10] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[11] Russell V. Lenth,et al. Computer Intensive Methods for Testing Hypotheses: An Introduction , 1990 .