论文信息 - Word Alignment via Quadratic Assignment

Word Alignment via Quadratic Assignment

Recently, discriminative word alignment methods have achieved state-of-the-art accuracies by extending the range of information sources that can be easily incorporated into aligners. The chief advantage of a discriminative framework is the ability to score alignments based on arbitrary features of the matching word tokens, including orthographic form, predictions of other models, lexical context and so on. However, the proposed bipartite matching model of Taskar et al. (2005), despite being tractable and effective, has two important limitations. First, it is limited by the restriction that words have fertility of at most one. More importantly, first order correlations between consecutive words cannot be directly captured by the model. In this work, we address these limitations by enriching the model form. We give estimation and inference algorithms for these enhancements. Our best model achieves a relative AER reduction of 25% over the basic matching formulation, outperforming intersected IBM Model 4 without using any overly compute-intensive features. By including predictions of other models as features, we achieve AER of 3.8 on the standard Hansards dataset.

[1] John Cocke,et al. A Statistical Approach to Machine Translation , 1990, CL.

[2] Hermann Ney,et al. HMM-Based Word Alignment in Statistical Translation , 1996, COLING.

[3] Michael Collins,et al. Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms , 2002, EMNLP.

[4] Ted Pedersen,et al. An Evaluation Exercise for Word Alignment , 2003, ParallelTexts@NAACL-HLT.

[5] Daniel Marcu,et al. Statistical Phrase-Based Translation , 2003, NAACL.

[6] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[7] Alexander Schrijver,et al. Combinatorial optimization. Polyhedra and efficiency. , 2003 .

[8] Ben Taskar,et al. A Discriminative Matching Approach to Word Alignment , 2005, HLT.

[9] Ben Taskar,et al. Learning structured prediction models: a large margin approach , 2005, ICML.

[10] Robert C. Moore. A Discriminative Framework for Bilingual Word Alignment , 2005, HLT.

[11] Ben Taskar,et al. Alignment by Agreement , 2006, NAACL.