An Open-Source Web-Based Tool for Resource-Agnostic Interactive Translation Prediction

We present a web-based open-source tool for interactive translation prediction (ITP) and describe its underlying architecture. ITP systems assist human translators by making context-based computer-generated suggestions as they type. Most of the ITP systems in literature are strongly coupled with a statistical machine translation system that is conveniently adapted to provide the suggestions. Our system, however, follows a resource-agnostic approach and suggestions are obtained from any unmodified black-box bilingual resource. This paper reviews our ITP method and describes the architecture of Forecat, a web tool, partly based on the recent technology of web components, that eases the use of our ITP approach in any web application requiring this kind of translation assistance. We also evaluate the performance of our method when using an unmodified Moses-based statistical machine translation system as the bilingual resource.

[1]  Francisco Casacuberta,et al.  Multimodal interactive machine translation , 2010, ICMI-MLMI '10.

[2]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[3]  Mikel L. Forcada,et al.  Black-box integration of heterogeneous bilingual resources into an interactive translation system , 2014, HaCaT@EACL.

[4]  Hermann Ney,et al.  Statistical Approaches to Computer-Assisted Translation , 2009, CL.

[5]  Francis M. Tyers,et al.  Apertium: a free/open-source platform for rule-based machine translation , 2011, Machine Translation.

[6]  Philipp Koehn,et al.  A Web-Based Interactive Computer Aided Translation Tool , 2009, ACL.

[7]  Pierre Isabelle,et al.  Target-Text Mediated Interactive Machine Translation , 2004, Machine Translation.

[8]  Alex Kulesza,et al.  Confidence Estimation for Machine Translation , 2004, COLING.

[9]  Germán Sanchis-Trilles,et al.  CASMACAT: An Open Source Workbench for Advanced Computer Aided Translation , 2013, Prague Bull. Math. Linguistics.

[10]  Sergei Nirenburg,et al.  Toward Multi-Engine Machine Translation , 1994, HLT.

[11]  Mikel L. Forcada,et al.  Using external sources of bilingual information for on-the-fly word alignment , 2012, ArXiv.

[12]  Daniel Ortiz,et al.  Advances in fully-automatic and interactive phrase-based statistical machine translation , 2011 .

[13]  Yaser Al-Onaizan,et al.  Distortion Models for Statistical Machine Translation , 2006, ACL.

[14]  Marcin Junczys-Dowmunt,et al.  Phrasal Rank-Encoding: Exploiting Phrase Redundancy and Translational Relations for Phrase Table Compression , 2012, Prague Bull. Math. Linguistics.

[15]  Ecnica De Valencia,et al.  Advances in Fully-Automatic and Interactive Phrase-Based Statistical Machine Translation , 2011 .

[16]  Felipe Sánchez-Martínez,et al.  Choosing the best machine translation system to translate a sentence by using only source-language information , 2011, EAMT.

[17]  Philipp Koehn,et al.  Analysing the Effect of Out-of-Domain Data on SMT Systems , 2012, WMT@NAACL-HLT.

[18]  Elliott Macklovitch TransType2 : The Last Word , 2006, LREC.

[19]  George F. Foster,et al.  Evaluation of TRANSTYPE, a Computer-aided Translation Typing System: A Comparison of a Theoretical- and a User-oriented Evaluation Procedures , 2000, LREC.

[20]  Chris Callison-Burch,et al.  Open Source Toolkit for Statistical Machine Translation: Factored Translation Models and Lattice Decoding , 2006 .