Statistically Motivated Example-based Machine Translation using Translation Memory

In this paper we present a novel way of integrating Translation Memory into an Example-based Machine Translation System (EBMT) to deal with the issue of low resources. We have used a dialogue of 380 sentences as the example-base for our system. The translation units in the Translation Memories are automatically extracted based on the aligned phrases (words) of a statistical machine translation (SMT) system. We attempt to use the approach to improve translation from English to Bangla as many statistical machine translation systems have difficulty with such small amounts of training data. We have found the approach shows improvement over a baseline SMT system.

[1]  John Cocke,et al.  A statistical approach to French/English translation , 1988, TMI.

[2]  Sudip Kumar Naskar,et al.  A Phrasal EBMT System for Translating English to Bengali , 2005, MTSUMMIT.

[3]  Harold L. Somers 3. Translation memory systems , 2003 .

[4]  Daniel Marcu,et al.  Towards a Unified Approach to Memory- and Statistical-Based Machine Translation , 2001, ACL.

[5]  Pushpak Bhattacharyya,et al.  Simple Syntactic and Morphological Processing Can Help English-Hindi Statistical Machine Translation , 2008, IJCNLP.

[6]  Michael J. Fischer,et al.  The String-to-String Correction Problem , 1974, JACM.

[7]  Yves Lepage,et al.  The GREYCMachine Translation System for the IWSLT2008 Evaluation Campaign , 2008 .

[8]  MARTIN KAY The Proper Place of Men and Machines in Language Translation , 2004, Machine Translation.

[9]  Julien Bourdaillet,et al.  Enhancing the Bilingual Concordancer TransSearch with Word-Level Alignment , 2009, Canadian Conference on AI.

[10]  J. Smith,et al.  EBMT for SMT : A New EBMT-SMT Hybrid , 2009 .

[11]  Philip Koehn,et al.  Statistical Machine Translation , 2010, EAMT.

[12]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[13]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[14]  Andy Way,et al.  Recent Advances in Example-Based Machine Translation , 2004 .

[15]  Lynne Bowker,et al.  Computer-Aided Translation Technology: A Practical Introduction , 2002 .

[16]  Lluís Padró,et al.  FreeLing: From a multilingual open-source analyzer suite to an EBMT platform. , 2009 .

[17]  Mitchell P. Marcus Proceedings of the second international conference on Human Language Technology Research , 2002 .

[18]  Jaime G. Carbonell,et al.  Chunk-Based EBMT , 2010, EAMT.

[19]  Andy Way,et al.  OpenMaTrEx: A Free/Open-Source Marker-Driven Example-Based Machine Translation System , 2010, IceTAL.

[20]  Taro Watanabe,et al.  Online Large-Margin Training for Statistical Machine Translation , 2007, EMNLP.

[21]  George F. Foster,et al.  TransType: a Computer-Aided Translation Typing System , 2000 .

[22]  Pius ten Hacken Computers and translation: a translator's guide , 2004 .

[23]  Pushpak Bhattacharyya,et al.  Case markers and Morphology: Addressing the crux of the fluency problem in English-Hindi SMT , 2009, ACL.

[24]  Philippe Langlais,et al.  Trans Type: Development-Evaluation Cycles to Boost Translator's Productivity , 2002, Machine Translation.

[25]  Andy Way,et al.  A memory-based classification approach to marker-based EBMT , 2007 .

[26]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[27]  Harold L. Somers,et al.  An Overview of EBMT , 2003 .

[28]  U. Rohini,et al.  A Hybrid Approach to Example based Machine Translation for Indian Languages , 2007 .

[29]  Sudip Kumar Naskar,et al.  A review of EBMT using proportional analogies , 2009 .

[30]  Jörg Tiedemann,et al.  English to Bangla Phrase-Based Machine Translation , 2010, EAMT.

[31]  José A. R. Fonollosa,et al.  English-latvian SMT: the challenge of translating into a free word order language , 2010, SLTU.

[32]  Makoto Nagao,et al.  A framework of a mechanical translation between Japanese and English by analogy principle , 1984 .

[33]  Andy Way,et al.  The DCU machine translation system , 2006 .

[34]  Aaron B. Phillips,et al.  Cunei Machine Translation Platform : System Description , 2009 .

[35]  Yanjun Ma,et al.  MaTrEx: the DCU machine translation system for IWSLT 2007 , 2007, IWSLT.

[36]  Cyril Goutte Automatic Evaluation of Machine Translation Quality , 2006 .