OpenMaTrEx: A Free/Open-Source Marker-Driven Example-Based Machine Translation System

We describe OpenMaTrEx, a free/open-source example-based machine translation (EBMT) system based on the marker hypothesis, comprising a marker-driven chunker, a collection of chunk aligners, and two engines: one based on a simple proof-of-concept monotone EBMT recombinator and a Moses-based statistical decoder. Open-MaTrEx is a free/open-source release of the basic components of MaTrEx, the Dublin City University machine translation system.

[1]  Andy Way,et al.  Robust large-scale EBMT with marker-based segmentation , 2004, TMI.

[2]  Andy Way,et al.  Example-Based Machine Translation of the Basque Language , 2006 .

[3]  Andy Way,et al.  Hybridity in MT. Experiments on the Europarl Corpus , 2006, EAMT.

[4]  Franz Josef Och,et al.  Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[5]  Andy Way,et al.  MaTrEx: machine translation using examples , 2006 .

[6]  Yanjun Ma,et al.  MaTrEx: the DCU machine translation system for IWSLT 2007 , 2007, IWSLT.

[7]  Philipp Koehn,et al.  Statistical Significance Tests for Machine Translation Evaluation , 2004, EMNLP.

[8]  Declan Groves,et al.  Evaluating syntax-driven approaches to phrase extraction for MT , 2009 .

[9]  Philipp Koehn,et al.  Explorer Edinburgh System Description for the 2005 IWSLT Speech Translation Evaluation , 2005 .

[10]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[11]  P. Smith,et al.  Two Experiments with Artificial Languages , 1970 .

[12]  Thomas R. G. Green,et al.  The necessity of syntax markers: Two experiments with artificial languages , 1979 .

[13]  Andy Way,et al.  Hybrid Example-Based SMT: the Best of Both Worlds? , 2005, ParallelText@ACL.

[14]  ANDY WAY,et al.  Comparing example-based and statistical machine translation , 2005, Nat. Lang. Eng..

[15]  Francis M. Tyers,et al.  The Apertium machine translation platform: five years on , 2009 .

[16]  Andy Way,et al.  Marker-Based Filtering of Bilingual Phrase Pairs for SMT , 2009, EAMT.

[17]  Andy Way,et al.  MATREX: DCU machine translation system for IWSLT 2006. , 2006, IWSLT.

[18]  Yanjun Ma,et al.  MaTrEx: The DCU MT System for WMT 2008 , 2008, WMT@ACL.

[19]  Mauro Cettolo,et al.  Efficient Handling of N-gram Language Models for Statistical Machine Translation , 2007, WMT@ACL.

[20]  Andy Way,et al.  Hybrid rule-based - example-based MT: feeding Apertium with sub-sentential translation units , 2009 .

[21]  Aaron B. Phillips,et al.  Cunei Machine Translation Platform : System Description , 2009 .

[22]  Andy Way,et al.  A memory-based classification approach to marker-based EBMT , 2007 .