论文信息 - Rule Based Katakana to Myanmar Transliteration for Post-editing Machine Translation

Rule Based Katakana to Myanmar Transliteration for Post-editing Machine Translation

Phrase based statistical machine translation (PBSMT) is a current state-of-the-art approach to machine translation, however its outputs often contain various types of errors such as lexical errors, and syntax errors (Koehn et al., 2003)(Bojar et al., 2013)(Bojar, 2011b). Incorporating deep linguistic knowledge directly into PBSMT is not easy and rarely leads to improvements in translation performance (Bojar, 2011a). One of the possible solution is to make automatic corrections on translated output in a post-editing process. This paper presents a rule based post-editing scheme for fixing translation errors based on out of vocabulary (OOV) Katakana words produced by Japanese to Myanmar PBSMT. Our experiments indicate that applying rule based Katakana to Myanmar transliteration leads to substantial improvements of translation quality both in terms of BLEU scores and OOV coverage.

[1] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[2] Eiichiro Sumita,et al. Creating corpora for speech-to-speech translation , 2003, INTERSPEECH.

[3] Daniel Marcu,et al. Statistical Phrase-Based Translation , 2003, NAACL.

[4] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[5] Michel Simard,et al. Statistical Phrase-Based Post-Editing , 2007, NAACL.

[6] Kemal Oflazer,et al. Exploring Different Representational Units in English-to-Turkish Statistical Machine Translation , 2007, WMT@ACL.

[7] Satoshi Nakamura,et al. A Bayesian Model of Transliteration and Its Human Evaluation When Integrated into a Machine Translation System , 2011, IEICE Trans. Inf. Syst..

[8] Ondrej Bojar,et al. Analyzing Error Types in English-Czech Machine Translation , 2011, Prague Bull. Math. Linguistics.

[9] Philipp Koehn,et al. Findings of the 2013 Workshop on Statistical Machine Translation , 2013, WMT@ACL.