Bilingual Structured Language Models for Statistical Machine Translation

This paper describes a novel target-side syntactic language model for phrase-based statistical machine translation, bilingual structured language model. Our approach represents a new way to adapt structured language models (Chelba and Jelinek, 2000) to statistical machine translation, and a first attempt to adapt them to phrasebased statistical machine translation. We propose a number of variations of the bilingual structured language model and evaluate them in a series of rescoring experiments. Rescoring of 1000-best translation lists produces statistically significant improvements of up to 0.7 BLEU over a strong baseline for Chinese-English, but does not yield improvements for ArabicEnglish.

[1]  Colin Cherry,et al.  Cohesive Phrase-Based Decoding for Statistical Machine Translation , 2008, ACL.

[2]  Daniel Marcu,et al.  Statistical Phrase-Based Translation , 2003, NAACL.

[3]  Chris Quirk,et al.  Dependency treelet translation: the convergence of statistical and example-based machine-translation? , 2006, MTSUMMIT.

[4]  Joakim Nivre,et al.  Mildly Non-Projective Dependency Structures , 2006, ACL.

[5]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[6]  Christof Monz,et al.  Dependency-Based Bilingual Language Models for Reordering in Statistical Machine Translation , 2014, EMNLP.

[7]  Jinxi Xu,et al.  A New String-to-Dependency Machine Translation Algorithm with a Target Dependency Language Model , 2008, ACL.

[8]  Frederick Jelinek,et al.  Structured language modeling , 2000, Comput. Speech Lang..

[9]  Mark Hopkins,et al.  Tuning as Ranking , 2011, EMNLP.

[10]  Stephan Vogel,et al.  Cohesive Constraints in A Beam Search Phrase-based Decoder , 2009, NAACL.

[11]  H. Ney,et al.  A Source-side Decoding Sequence Model for Statistical Machine Translation , 2010, AMTA.

[12]  Philip Resnik,et al.  Evaluating Translational Correspondence using Annotation Projection , 2002, ACL.

[13]  Slav Petrov,et al.  Source-Side Classifier Preordering for Machine Translation , 2013, EMNLP.

[14]  Jirí Havelka Beyond Projectivity: Multilingual Evaluation of Constraints and Measures on Non-Projective Structures , 2007, ACL.

[15]  Daniel Gildea Parsers as language models for statistical machine translation , 2008 .

[16]  Alexandra Birch,et al.  Metrics for MT evaluation: evaluating reordering , 2010, Machine Translation.

[17]  Christopher D. Manning,et al.  Better Arabic Parsing: Baselines, Evaluations, and Analysis , 2010, COLING.

[18]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[19]  Kevin Knight,et al.  A Syntax-based Statistical Translation Model , 2001, ACL.

[20]  K. J. Evans,et al.  Computer Intensive Methods for Testing Hypotheses: An Introduction , 1990 .

[21]  Regina Barzilay,et al.  Selective Sharing for Multilingual Dependency Parsing , 2012, ACL.

[22]  Timothy Osborne,et al.  Major constituents and two dependency grammar constraints on sharing in coordination , 2008 .

[23]  Niyu Ge A Direct Syntax-Driven Reordering Model for Phrase-Based Machine Translation , 2010, HLT-NAACL.

[24]  Stefan Riezler,et al.  On Some Pitfalls in Automatic Evaluation and Significance Testing for MT , 2005, IEEvaluation@ACL.

[25]  Michael Collins,et al.  Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[26]  S. T. Buckland,et al.  Computer-Intensive Methods for Testing Hypotheses. , 1990 .

[27]  Qun Liu,et al.  A Structured Language Model for Incremental Tree-to-String Translation , 2014, COLING.

[28]  Nizar Habash,et al.  Improving Arabic-to-English Statistical Machine Translation by Reordering Post-Verbal Subjects for Alignment , 2010, ACL.

[29]  Jan Niehues,et al.  Wider Context by Using Bilingual Language Models in Machine Translation , 2011, WMT@EMNLP.

[30]  Dekai Wu,et al.  Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora , 1997, CL.

[31]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[32]  Andreas Stolcke,et al.  SRILM at Sixteen: Update and Outlook , 2011 .

[33]  Rico Sennrich,et al.  Modelling and Optimizing on Syntactic N-Grams for Statistical Machine Translation , 2015, TACL.

[34]  Daniel Jurafsky,et al.  Discriminative Reordering with Chinese Grammatical Relations Features , 2009, SSST@HLT-NAACL.

[35]  Heidi Fox,et al.  Phrasal Cohesion and Statistical Machine Translation , 2002, EMNLP.

[36]  Hermann Ney,et al.  A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[37]  Bing Xiang,et al.  Improving Reordering for Statistical Machine Translation with Smoothed Priors and Syntactic Features , 2011, SSST@ACL.

[38]  François Yvon,et al.  Improving Reordering with Linguistically Informed Bilingual n-grams , 2010, COLING.

[39]  Andreas Zollmann,et al.  Syntax Augmented Machine Translation via Chart Parsing , 2006, WMT@HLT-NAACL.