A Constrained Viterbi Relaxation for Bidirectional Word Alignment

Bidirectional models of word alignment are an appealing alternative to post-hoc combinations of directional word aligners. Unfortunately, most bidirectional formulations are NP-Hard to solve, and a previous attempt to use a relaxationbased decoder yielded few exact solutions (6%). We present a novel relaxation for decoding the bidirectional model of DeNero and Macherey (2011). The relaxation can be solved with a modified version of the Viterbi algorithm. To find optimal solutions on difficult instances, we alternate between incrementally adding constraints and applying optimality-preserving coarse-to-fine pruning. The algorithm finds provably exact solutions on 86% of sentence pairs and shows improvements over directional models.

[1]  Sadao Kurohashi,et al.  An Alignment Algorithm Using Belief Propagation and a Structure-Based Distortion Model , 2009, EACL.

[2]  Robert C. Moore A Discriminative Framework for Bilingual Word Alignment , 2005, HLT.

[3]  Alexander M. Rush,et al.  A Tutorial on Dual Decomposition and Lagrangian Relaxation for Inference in Natural Language Processing , 2012, J. Artif. Intell. Res..

[4]  Hermann Ney,et al.  Improved Alignment Models for Statistical Machine Translation , 1999, EMNLP.

[5]  Necip Fazil Ayan,et al.  Going Beyond AER: An Extensive Analysis of Word Alignments and Their Impact on MT , 2006, ACL.

[6]  Ben Taskar,et al.  Posterior Regularization for Structured Latent Variable Models , 2010, J. Mach. Learn. Res..

[7]  John DeNero,et al.  Better Word Alignments with Supervised ITG Models , 2009, ACL.

[8]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[9]  Daniel Marcu,et al.  Statistical Phrase-Based Translation , 2003, NAACL.

[10]  John DeNero,et al.  Model-Based Aligner Combination Using Dual Decomposition , 2011, ACL.

[11]  Ben Taskar,et al.  Expectation Maximization and Posterior Constraints , 2007, NIPS.

[12]  Ben Taskar,et al.  Alignment by Agreement , 2006, NAACL.

[13]  Ben Taskar,et al.  Word Alignment via Quadratic Assignment , 2006, NAACL.

[14]  Ben Taskar,et al.  Better Alignments = Better Translations? , 2008, ACL.

[15]  John DeNero,et al.  Discriminative Modeling of Extraction Sets for Machine Translation , 2010, ACL.

[16]  Hermann Ney,et al.  HMM-Based Word Alignment in Statistical Translation , 1996, COLING.