Beyond Prefix-Based Interactive Translation Prediction

Current automatic machine translation systems require heavy human proofreading to produce high-quality translations. We present a new interactive machine translation approach aimed at providing a natural collaboration between humans and translation systems. As such, we grant the user complete freedom to validate and correct any part of the translations suggested by the system. Our approach is then designed according to the requirements placed by this unrestricted proofreading protocol. In particular, the ability of the system to suggest new translations coherent with the set of potentially disjoint translation segments validated by the user. We evaluate our approach in a usersimulated setting where reference translations are considered the output desired by a human expert. Results show important reductions in the number of edits in comparison to decoupled post-editing and conventional prefix-based interactive translation prediction. Additionally, we provide evidence that it can also reduce the cognitive overload reported for interactive translation systems in previous user studies.

[1]  Alfons Juan-Císcar,et al.  Interactive handwriting recognition with limited user effort , 2014, International Journal on Document Analysis and Recognition (IJDAR).

[2]  Franz Josef Och,et al.  Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[3]  Matthew G. Snover,et al.  A Study of Translation Edit Rate with Targeted Human Annotation , 2006, AMTA.

[4]  Alejandro Héctor Toselli,et al.  Computer Assisted Transcription of Handwritten Text Images , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[5]  Francisco Casacuberta,et al.  Interactive Machine Translation using Hierarchical Translation Models , 2013, EMNLP.

[6]  Philippe Langlais,et al.  Trans Type: Development-Evaluation Cycles to Boost Translator's Productivity , 2002, Machine Translation.

[7]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[8]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[9]  Francisco Casacuberta,et al.  Computer Assisted Transcription of Speech , 2007, IbPRIA.

[10]  Ralph Weischedel,et al.  A STUDY OF TRANSLATION ERROR RATE WITH TARGETED HUMAN ANNOTATION , 2005 .

[11]  Ecnica De Valencia,et al.  Advances in Fully-Automatic and Interactive Phrase-Based Statistical Machine Translation , 2011 .

[12]  Germán Sanchis-Trilles,et al.  User Evaluation of Advanced Interaction Features for a Computer-Assisted Translation Workbench , 2013, MTSUMMIT.

[13]  Andreas Zollmann,et al.  Syntax Augmented Machine Translation via Chart Parsing , 2006, WMT@HLT-NAACL.

[14]  Francisco Casacuberta,et al.  Statistical Phrase-Based Models for Interactive Computer-Assisted Translation , 2006, ACL.

[15]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[16]  Daniel Ortiz,et al.  Advances in fully-automatic and interactive phrase-based statistical machine translation , 2011 .

[17]  David Chiang,et al.  A Hierarchical Phrase-Based Model for Statistical Machine Translation , 2005, ACL.

[18]  Giorgio Gallo,et al.  Directed Hypergraphs and Applications , 1993, Discret. Appl. Math..

[19]  Hermann Ney,et al.  Generation of Word Graphs in Statistical Machine Translation , 2002, EMNLP.

[20]  Philipp Koehn,et al.  Refinements to Interactive Translation Prediction Based on Search Graphs , 2014, ACL.

[21]  Germán Sanchis-Trilles,et al.  Evaluating the effects of interactivity in a post-editing workbench , 2014, LREC.

[22]  Daniel Marcu,et al.  Statistical Phrase-Based Translation , 2003, NAACL.

[23]  Germán Sanchis-Trilles,et al.  Interactive translation prediction versus conventional post-editing in practice: a study with the CasMaCat  workbench , 2014, Machine Translation.

[24]  Hermann Ney,et al.  Statistical Approaches to Computer-Assisted Translation , 2009, CL.

[25]  Hermann Ney,et al.  Discriminative Training and Maximum Entropy Models for Statistical Machine Translation , 2002, ACL.

[26]  Jeffrey Heer,et al.  Human Effort and Machine Learnability in Computer Aided Translation , 2014, EMNLP.

[27]  Germán Sanchis-Trilles,et al.  Improving Interactive Machine Translation via Mouse Actions , 2008, EMNLP.

[28]  Francisco Casacuberta,et al.  Balancing User Effort and Translation Error in Interactive Machine Translation via Confidence Measures , 2010, ACL.

[29]  Ying Zhang,et al.  Measuring Confidence Intervals for the Machine Translation Evaluation Metrics , 2004 .