论文信息 - The CMU statistical machine translation system

The CMU statistical machine translation system

In this paper we describe the components of our statistical machine translation system. This system combines phrase-to-phrase translations extracted from a bilingual corpus using different alignment approaches. Special methods to extract and align named entities are used. We show how a manual lexicon can be incorporated into the statistical system in an optimized way. Experiments on Chinese-to-English and Arabic-to-English translation tasks are presented.

[1] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[2] Hermann Ney,et al. HMM-Based Word Alignment in Statistical Translation , 1996, COLING.

[3] Dekai Wu,et al. Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora , 1997, CL.

[4] Alexander H. Waibel,et al. Fast decoding for statistical machine translation , 1998, ICSLP.

[5] Hermann Ney,et al. Translation with Cascaded Finite State Transducers , 2000, ACL.

[6] Hermann Ney,et al. Improved Statistical Alignment Models , 2000, ACL.

[7] Kevin Knight,et al. A Syntax-based Statistical Translation Model , 2001, ACL.

[8] Daniel Marcu,et al. A Phrase-Based,Joint Probability Model for Statistical Machine Translation , 2002, EMNLP.

[9] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[10] Stephan Vogel,et al. Word Alignment Based on Bilingual Bracketing , 2003, ParallelTexts@NAACL-HLT.

[11] S. Vogel,et al. Overlapping phrase-level translation rules in an SMT engine , 2003, International Conference on Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003.

[12] Alexander H. Waibel,et al. Effective Phrase Translation Extraction from Alignment Models , 2003, ACL.

[13] Alexander H. Waibel,et al. Automatic Extraction of Named Entity Translingual Equivalence Based on Multi-Feature Cost Minimization , 2003, NER@ACL.