Phrase-Based Alignment Models for Statistical Machine Translation

The first pattern recognition approaches to machine translation were based on single-word models. However, these models present an important deficiency; they do not take contextual information into account for the translation decision. The phrase-based approach consists in translating a multiword source sequence into a multiword target sequence, instead of a single source word into a single target word. We present different methods to train the parameters of this kind of model. In the evaluation phase of this approach, we obtained interesting results in comparison with other statistical models.

[1]  Franz Josef Och,et al.  Statistical machine translation: from single word models to alignment templates , 2002 .

[2]  Gerhard Lakemeyer,et al.  KI 2002: Advances in Artificial Intelligence , 2002, Lecture Notes in Computer Science.

[3]  Francisco Casacuberta,et al.  Binary Feature Classification for Word Disambiguation in Statistical Machine Translation , 2002, PRIS.

[4]  Francisco Casacuberta Inference of Finite-State Transducers by Using Regular Grammars and Morphisms , 2000, ICGI.

[5]  Hermann Ney,et al.  Phrase-Based Statistical Machine Translation , 2002, KI.

[6]  Francisco Casacuberta,et al.  Combining Phrase-Based and Template-Based Alignment Models in Statistical Translation , 2003, IbPRIA.

[7]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[8]  Daniel Marcu,et al.  Statistical Phrase-Based Translation , 2003, NAACL.

[9]  Francisco Casacuberta,et al.  Statistical Machine Translation Decoding Using Target Word Reordering , 2004, SSPR/SPR.

[10]  Andrew Lysley,et al.  INFORMATION SOCIETY TECHNOLOGIES (IST) PROGRAMME , 2004 .

[11]  Hermann Ney,et al.  Improvements in Phrase-Based Statistical Machine Translation , 2004, NAACL.

[12]  Hermann Ney,et al.  HMM-Based Word Alignment in Statistical Translation , 1996, COLING.

[13]  Hermann Ney,et al.  A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[14]  Marti A. Hearst,et al.  HLT-NAACL 2003 : Human Language Technology conference of the North American Chapter of the Association for Computational Linguistics : proceedings of the main conference : May 27 to June 1, 2003, Edmonton, Alberta, Canada , 2003 .

[15]  Francisco Casacuberta,et al.  MONOTONE STATISTICAL TRANSLATION USING WORD GROUPS , 2001 .