NTT-NII Statistical Machine Translation for NTCIR-10 PatentMT

This paper describes details of the NTT-NII system in NTCIR10 PatentMT task. The system is an extension of the NTTUT system in NTCIR-9 by: a new English dependency parser (for EJ task), a syntactic rule-based pre-ordering (for JE task), a syntax-based post-ordering (for JE task). Our system ranked 1st in EJ subtask both in automatic and subjective evaluation, and was the best SMT system in JE subtask.

[1]  Kevin Duh,et al.  Generalized Minimum Bayes Risk System Combination , 2011, IJCNLP.

[2]  Yuji Matsumoto,et al.  Learning of Linear Ordering Problems and its Application to J-E Patent Translation in NTCIR-9 PatentMT , 2011, NTCIR.

[3]  Jason Katz-Brown,et al.  Syntactic Reordering in Preprocessing for Japanese → English Translation: MIT System Description for NTCIR-7 Patent Translation Task , 2008, NTCIR.

[4]  Kevin Duh,et al.  HPSG-Based Preprocessing for English-to-Japanese Translation , 2012, TALIP.

[5]  Kevin Duh,et al.  Syntax-Based Post-Ordering for Efficient Japanese-to-English Translation , 2013, TALIP.

[6]  Eiichiro Sumita,et al.  Overview of the Patent Machine Translation Task at the NTCIR-10 Workshop , 2011, NTCIR.

[7]  Franz Josef Och,et al.  Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[8]  Alexander M. Fraser,et al.  A Statistical Model for Unsupervised and Semi-supervised Transliteration Mining , 2012, ACL.

[9]  Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, NTCIR-10, National Center of Sciences, Tokyo, Japan, June 18-21, 2013 , 2013, NTCIR.

[10]  Kevin Duh,et al.  Post-ordering in Statistical Machine Translation , 2011, MTSUMMIT.

[11]  Kevin Duh,et al.  NTT-UT Statistical Machine Translation in NTCIR-9 PatentMT , 2011, NTCIR.

[12]  Xavier Carreras,et al.  An Empirical Study of Semi-supervised Structured Conditional Models for Dependency Parsing , 2009, EMNLP.

[13]  Cong Wang,et al.  A survey on learning to rank , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[14]  Yuji Matsumoto,et al.  Phrase reordering for statistical machine translation based on predicate-argument structure , 2006, IWSLT.

[15]  Thorsten Joachims,et al.  Training linear SVMs in linear time , 2006, KDD '06.

[16]  Yusuke Miyao,et al.  Two-Stage Pre-ordering for Japanese-to-English Statistical Machine Translation , 2013, IJCNLP.