Shift-Reduce Word Reordering for Machine Translation

This paper presents a novel word reordering model that employs a shift-reduce parser for inversion transduction grammars. Our model uses rich syntax parsing features for word reordering and runs in linear time. We apply it to postordering of phrase-based machine translation (PBMT) for Japanese-to-English patent tasks. Our experimental results show that our method achieves a significant improvement of +3.1 BLEU scores against 30.15 BLEU scores of the baseline PBMT system.

[1]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[2]  Kenji Sagae,et al.  Dynamic Programming for Linear-Time Incremental Parsing , 2010, ACL.

[3]  John DeNero,et al.  Inducing Sentence Structure from Parallel Corpora for Reordering , 2011, EMNLP.

[4]  Kevin Knight,et al.  Automated Postediting of Documents , 1994, AAAI.

[5]  Kevin Duh,et al.  Post-ordering in Statistical Machine Translation , 2011, MTSUMMIT.

[6]  Andreas Stolcke,et al.  SRILM at Sixteen: Update and Outlook , 2011 .

[7]  Daniel Marcu,et al.  Scalable Inference and Training of Context-Rich Syntactic Translation Models , 2006, ACL.

[8]  Kevin Knight,et al.  Training Tree Transducers , 2004, NAACL.

[9]  Philipp Koehn,et al.  Clause Restructuring for Statistical Machine Translation , 2005, ACL.

[10]  Dekai Wu,et al.  Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora , 1997, CL.

[11]  David Chiang,et al.  A Hierarchical Phrase-Based Model for Statistical Machine Translation , 2005, ACL.

[12]  Masao Utiyama,et al.  Post-ordering by Parsing for Japanese-English Statistical Machine Translation , 2012, ACL.

[13]  DuhKevin,et al.  HPSG-Based Preprocessing for English-to-Japanese Translation , 2012 .

[14]  Kevin Duh,et al.  HPSG-Based Preprocessing for English-to-Japanese Translation , 2012, TALIP.

[15]  Dan Klein,et al.  Improved Inference for Unlexicalized Parsing , 2007, NAACL.

[16]  Jun'ichi Tsujii,et al.  Feature Forest Models for Probabilistic HPSG Parsing , 2008, CL.

[17]  Brian Roark,et al.  Incremental Parsing with the Perceptron Algorithm , 2004, ACL.