论文信息 - Bilingual Structured Language Models for Statistical Machine Translation - 字舞流文

Bilingual Structured Language Models for Statistical Machine Translation

This paper describes a novel target-side syntactic language model for phrase-based statistical machine translation, bilingual structured language model. Our approach represents a new way to adapt structured language models (Chelba and Jelinek, 2000) to statistical machine translation, and a first attempt to adapt them to phrasebased statistical machine translation. We propose a number of variations of the bilingual structured language model and evaluate them in a series of rescoring experiments. Rescoring of 1000-best translation lists produces statistically significant improvements of up to 0.7 BLEU over a strong baseline for Chinese-English, but does not yield improvements for ArabicEnglish.

Christof Monz | Ekaterina Garmash | Christof Monz | Ekaterina Garmash

[1] Colin Cherry,et al. Cohesive Phrase-Based Decoding for Statistical Machine Translation , 2008, ACL.

[2] Daniel Marcu,et al. Statistical Phrase-Based Translation , 2003, NAACL.

[3] Chris Quirk,et al. Dependency treelet translation: the convergence of statistical and example-based machine-translation? , 2006, MTSUMMIT.

[4] Joakim Nivre,et al. Mildly Non-Projective Dependency Structures , 2006, ACL.

[5] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[6] Christof Monz,et al. Dependency-Based Bilingual Language Models for Reordering in Statistical Machine Translation , 2014, EMNLP.

[7] Jinxi Xu,et al. A New String-to-Dependency Machine Translation Algorithm with a Target Dependency Language Model , 2008, ACL.

[8] Frederick Jelinek,et al. Structured language modeling , 2000, Comput. Speech Lang..

[9] Mark Hopkins,et al. Tuning as Ranking , 2011, EMNLP.

[10] Stephan Vogel,et al. Cohesive Constraints in A Beam Search Phrase-based Decoder , 2009, NAACL.

[11] H. Ney,et al. A Source-side Decoding Sequence Model for Statistical Machine Translation , 2010, AMTA.

[12] Philip Resnik,et al. Evaluating Translational Correspondence using Annotation Projection , 2002, ACL.

[13] Slav Petrov,et al. Source-Side Classifier Preordering for Machine Translation , 2013, EMNLP.

[14] Jirí Havelka. Beyond Projectivity: Multilingual Evaluation of Constraints and Measures on Non-Projective Structures , 2007, ACL.

[15] Daniel Gildea. Parsers as language models for statistical machine translation , 2008 .

[16] Alexandra Birch,et al. Metrics for MT evaluation: evaluating reordering , 2010, Machine Translation.

[17] Christopher D. Manning,et al. Better Arabic Parsing: Baselines, Evaluations, and Analysis , 2010, COLING.

[18] Dan Klein,et al. Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[19] Kevin Knight,et al. A Syntax-based Statistical Translation Model , 2001, ACL.

[20] K. J. Evans,et al. Computer Intensive Methods for Testing Hypotheses: An Introduction , 1990 .

[21] Regina Barzilay,et al. Selective Sharing for Multilingual Dependency Parsing , 2012, ACL.

[22] Timothy Osborne,et al. Major constituents and two dependency grammar constraints on sharing in coordination , 2008 .

[23] Niyu Ge. A Direct Syntax-Driven Reordering Model for Phrase-Based Machine Translation , 2010, HLT-NAACL.

[24] Stefan Riezler,et al. On Some Pitfalls in Automatic Evaluation and Significance Testing for MT , 2005, IEEvaluation@ACL.

[25] Michael Collins,et al. Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[26] S. T. Buckland,et al. Computer-Intensive Methods for Testing Hypotheses. , 1990 .

[27] Qun Liu,et al. A Structured Language Model for Incremental Tree-to-String Translation , 2014, COLING.

[28] Nizar Habash,et al. Improving Arabic-to-English Statistical Machine Translation by Reordering Post-Verbal Subjects for Alignment , 2010, ACL.

[29] Jan Niehues,et al. Wider Context by Using Bilingual Language Models in Machine Translation , 2011, WMT@EMNLP.

[30] Dekai Wu,et al. Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora , 1997, CL.

[31] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[32] Andreas Stolcke,et al. SRILM at Sixteen: Update and Outlook , 2011 .

[33] Rico Sennrich,et al. Modelling and Optimizing on Syntactic N-Grams for Statistical Machine Translation , 2015, TACL.

[34] Daniel Jurafsky,et al. Discriminative Reordering with Chinese Grammatical Relations Features , 2009, SSST@HLT-NAACL.

[35] Heidi Fox,et al. Phrasal Cohesion and Statistical Machine Translation , 2002, EMNLP.

[36] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[37] Bing Xiang,et al. Improving Reordering for Statistical Machine Translation with Smoothed Priors and Syntactic Features , 2011, SSST@ACL.

[38] François Yvon,et al. Improving Reordering with Linguistically Informed Bilingual n-grams , 2010, COLING.

[39] Andreas Zollmann,et al. Syntax Augmented Machine Translation via Chart Parsing , 2006, WMT@HLT-NAACL.