Reordering adverbial chunks in Chinese-English patent machine translation

Adverbial chunks are widely distributed in Chinese patent document, this paper proposes a rule-based method of reordering long distance adverbial chunks for Chinese-English patent MT system. After analyzing common types of adverbials, different reordering ideas and formal rules are designed for various adverbials according to semantic and expression features of them, as well as systematic differences of word orders between Chinese and English. Related BLEU evaluation scores have improved efficiently compared with baseline, indicating the reordering method was helpful to improve final translation results.

[1]  Dan Wang Chinese to English automatic patent machine translation at SIPO , 2009 .

[2]  Yuan Qi Research on approaches of syntactic reordering for statistical machine translation , 2009 .

[3]  Yun Zhu,et al.  A Chinese-English patent machine translation system based on the theory of hierarchical network of concepts , 2012 .

[4]  Karthik Visweswariah,et al.  Syntax Based Reordering with Automatically Derived Rules for Improved Statistical Machine Translation , 2010, COLING.

[5]  Chris Quirk,et al.  Dependency Treelet Translation: Syntactically Informed Phrasal SMT , 2005, ACL.

[6]  Chao Wang,et al.  Chinese Syntactic Reordering for Statistical Machine Translation , 2007, EMNLP.

[7]  Nitin Madnani,et al.  iBLEU: Interactively Debugging and Scoring Statistical Machine Translation Systems , 2011, 2011 IEEE Fifth International Conference on Semantic Computing.

[8]  Nizar Habash Syntactic preprocessing for statistical machine translation , 2007, MTSUMMIT.

[9]  Kevin Duh,et al.  Automatic Evaluation of Translation Quality for Distant Language Pairs , 2010, EMNLP.

[10]  Fei Xia,et al.  Improving a Statistical MT System with Automatically Learned Rewrite Patterns , 2004, COLING.

[11]  Yang Muyun Syntax-based reordering model for phrasal statistical machine translation , 2008 .

[12]  Jane List Review of machine translation in patents – Implications for search , 2012 .

[13]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[14]  Eiichiro Sumita,et al.  Overview of the Patent Machine Translation Task at the NTCIR-10 Workshop , 2011, NTCIR.

[15]  David Chiang,et al.  A Hierarchical Phrase-Based Model for Statistical Machine Translation , 2005, ACL.