论文信息 - A Constrained Viterbi Relaxation for Bidirectional Word Alignment

A Constrained Viterbi Relaxation for Bidirectional Word Alignment

Bidirectional models of word alignment are an appealing alternative to post-hoc combinations of directional word aligners. Unfortunately, most bidirectional formulations are NP-Hard to solve, and a previous attempt to use a relaxationbased decoder yielded few exact solutions (6%). We present a novel relaxation for decoding the bidirectional model of DeNero and Macherey (2011). The relaxation can be solved with a modified version of the Viterbi algorithm. To find optimal solutions on difficult instances, we alternate between incrementally adding constraints and applying optimality-preserving coarse-to-fine pruning. The algorithm finds provably exact solutions on 86% of sentence pairs and shows improvements over directional models.

[1] Sadao Kurohashi,et al. An Alignment Algorithm Using Belief Propagation and a Structure-Based Distortion Model , 2009, EACL.

[2] Robert C. Moore. A Discriminative Framework for Bilingual Word Alignment , 2005, HLT.

[3] Alexander M. Rush,et al. A Tutorial on Dual Decomposition and Lagrangian Relaxation for Inference in Natural Language Processing , 2012, J. Artif. Intell. Res..

[4] Hermann Ney,et al. Improved Alignment Models for Statistical Machine Translation , 1999, EMNLP.

[5] Necip Fazil Ayan,et al. Going Beyond AER: An Extensive Analysis of Word Alignments and Their Impact on MT , 2006, ACL.

[6] Ben Taskar,et al. Posterior Regularization for Structured Latent Variable Models , 2010, J. Mach. Learn. Res..

[7] John DeNero,et al. Better Word Alignments with Supervised ITG Models , 2009, ACL.

[8] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[9] Daniel Marcu,et al. Statistical Phrase-Based Translation , 2003, NAACL.

[10] John DeNero,et al. Model-Based Aligner Combination Using Dual Decomposition , 2011, ACL.

[11] Ben Taskar,et al. Expectation Maximization and Posterior Constraints , 2007, NIPS.

[12] Ben Taskar,et al. Alignment by Agreement , 2006, NAACL.

[13] Ben Taskar,et al. Word Alignment via Quadratic Assignment , 2006, NAACL.

[14] Ben Taskar,et al. Better Alignments = Better Translations? , 2008, ACL.

[15] John DeNero,et al. Discriminative Modeling of Extraction Sets for Machine Translation , 2010, ACL.

[16] Hermann Ney,et al. HMM-Based Word Alignment in Statistical Translation , 1996, COLING.