In this paper, we describe improved alignment models for statistical machine translation. The statistical translation approach uses two types of information: a translation model and a language model. The language model used is a bigram or general m-gram model. The translation model is decomposed into a lexical and an alignment model. We describe two different approaches for statistical translation and present experimental results. The first approach is based on dependencies between single words, the second approach explicitly takes shallow phrase structures into account, using two different alignment levels: a phrase level alignment between phrases and a word level alignment between single words. We present results using the Verbmobil task (German-English, 6000word vocabulary) which is a limited-domain spoken-language task. The experimental tests were performed on both the text transcription and the speech recognizer output. 1 S t a t i s t i c a l M a c h i n e T r a n s l a t i o n The goal of machine translation is the translation of a text given in some source language into a target language. We are given a source string f / = fl...fj...fJ, which is to be translated into a target string e{ = el...ei...ex. Among all possible target strings, we will choose the string with the highest probability: = argmax {Pr(ezIlflJ)}
[1]
Robert L. Mercer,et al.
The Mathematics of Statistical Machine Translation: Parameter Estimation
,
1993,
CL.
[2]
Hermann Ney,et al.
HMM-Based Word Alignment in Statistical Translation
,
1996,
COLING.
[3]
Hermann Ney,et al.
A DP based Search Using Monotone Alignments in Statistical Translation
,
1997,
ACL.
[4]
Srinivas Bangalore,et al.
Automatic Acquisition of Hierarchical Transduction Models for Machine Translation
,
1998,
COLING-ACL.
[5]
Franz Josef Och,et al.
Improving Statistical Natural Language Translation with Categories and Rules
,
1998,
ACL.
[6]
Hermann Ney,et al.
A DP based Search Algorithm for Statistical Machine Translation
,
1998,
ACL.
[7]
Hermann Ney,et al.
Speech translation: coupling of recognition and translation
,
1999,
1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).