Touch-Based Pre-Post-Editing of Machine Translation Output

We introduce pre-post-editing, possibly the most basic form of interactive translation, as a touch-based interaction with iteratively improved translation hypotheses prior to classical post-editing. We report simulated experiments that yield very large improvements on classical evaluation metrics (up to 21 BLEU) as well as on a parameterized variant of the TER metric that takes into account the cost of matching/touching tokens, confirming the promising prospects of the novel translation scenarios offered by our approach.

[1]  Douglas W. Oard,et al.  Encouraging Consistent Translation Choices , 2012, NAACL.

[2]  Barry Haddow,et al.  Interactive Assistance to Human Translators using Statistical Machine Translation Methods , 2009, MTSUMMIT.

[3]  Olivia Buzek,et al.  Improving Translation via Targeted Paraphrasing , 2010, EMNLP.

[4]  Matthew G. Snover,et al.  A Study of Translation Edit Rate with Targeted Human Annotation , 2006, AMTA.

[5]  Michael Carl,et al.  The Process of Post-Editing: A Pilot Study , 2011 .

[6]  Benjamin Marie,et al.  A study in greedy oracle improvement of translation hypotheses , 2013, IWSLT.

[7]  Andreas Stolcke,et al.  SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[8]  George F. Foster,et al.  User-Friendly Text Prediction For Translators , 2002, EMNLP.

[9]  Jeffrey Heer,et al.  Human Effort and Machine Learnability in Computer Aided Translation , 2014, EMNLP.

[10]  Alexandre Allauzen,et al.  Assessing Phrase-Based Translation Models with Oracle Decoding , 2010, EMNLP.

[11]  George F. Foster,et al.  Batch Tuning Strategies for Statistical Machine Translation , 2012, NAACL.

[12]  Daniel Jurafsky,et al.  The Best Lexical Metric for Phrase-Based Statistical MT System Optimization , 2010, NAACL.

[13]  Laurent Besacier,et al.  An efficient two-pass decoder for SMT using word confidence estimation , 2014, EAMT.

[14]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[15]  Yaser Al-Onaizan,et al.  Goodness: A Method for Measuring Machine Translation Confidence , 2011, ACL.

[16]  Nello Cristianini,et al.  Learning to translate: a statistical and computational analysis , 2009, EAMT.

[17]  Alon Lavie,et al.  Learning from Post-Editing: Online Model Adaptation for Statistical Machine Translation , 2014, EACL.

[18]  Francisco Casacuberta,et al.  Interactive Machine Translation using Hierarchical Translation Models , 2013, EMNLP.

[19]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[20]  Ralph Weischedel,et al.  A STUDY OF TRANSLATION ERROR RATE WITH TARGETED HUMAN ANNOTATION , 2005 .