Leveraging Online User Feedback to Improve Statistical Machine Translation

In this article we present a three-step methodology for dynamically improving a statistical machine translation (SMT) system by incorporating human feedback in the form of free edits on the system translations. We target at feedback provided by casual users, which is typically error-prone. Thus, we first propose a filtering step to automatically identify the better user-edited translations and discard the useless ones. A second step produces a pivot-based alignment between source and user-edited sentences, focusing on the errors made by the system. Finally, a third step produces a new translation model and combines it linearly with the one from the original system. We perform a thorough evaluation on a real-world dataset collected from the Reverso.net translation service and show that every step in our methodology contributes significantly to improve a general purpose SMT system. Interestingly, the quality improvement is not only due to the increase of lexical coverage, but to a better lexical selection, reordering, and morphology. Finally, we show the robustness of the methodology by applying it to a different scenario, in which the new examples come from an automatically Web-crawled parallel corpus. Using exactly the same architecture and models provides again a significant improvement of the translation quality of a general purpose baseline SMT system.

[1]  F. Casacuberta,et al.  D4.2: Progress Report on Adaptive Translation Models , 2013 .

[2]  O. Cappé,et al.  On‐line expectation–maximization algorithm for latent data models , 2009 .

[3]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[4]  Roland Kuhn,et al.  Mixture-Model Adaptation for SMT , 2007, WMT@ACL.

[5]  James Mayfield,et al.  Character N-Gram Tokenization for European Language Text Retrieval , 2004, Information Retrieval.

[6]  Roland Kuhn,et al.  Discriminative Instance Weighting for Domain Adaptation in Statistical Machine Translation , 2010, EMNLP.

[7]  José B. Mariño,et al.  Guidelines for Word Alignment Evaluation and Manual Alignment , 2005, Lang. Resour. Evaluation.

[8]  Chris Callison-Burch,et al.  Scaling Phrase-Based Statistical Machine Translation to Larger Corpora and Longer Phrases , 2005, ACL.

[9]  Jaime G. Carbonell,et al.  Active Learning and Crowd-Sourcing for Machine Translation , 2010, LREC.

[10]  Rafael E. Banchs,et al.  Improving statistical machine translation through adaptation and learning , 2014 .

[11]  Rico Sennrich,et al.  Mixture-Modeling with Unsupervised Clusters for Domain Adaptation in Statistical Machine Translation , 2012, EAMT.

[12]  Mauro Cettolo,et al.  Cache-based Online Adaptation for Machine Translation Enhanced Computer Assisted Translation , 2013, MTSUMMIT.

[13]  Philipp Koehn,et al.  Dirt Cheap Web-Scale Parallel Text from the Common Crawl , 2013, ACL.

[14]  Thorsten Joachims,et al.  Making large-scale support vector machine learning practical , 1999 .

[15]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[16]  Mauro Cettolo,et al.  Online Learning Approaches in Computer Assisted Translation , 2013, WMT@ACL.

[17]  Jorge Nocedal,et al.  A Limited Memory Algorithm for Bound Constrained Optimization , 1995, SIAM J. Sci. Comput..

[18]  Rafael E. Banchs,et al.  Deriving translation units using small additional corpora , 2011, EAMT.

[19]  Philipp Koehn,et al.  Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.

[20]  Adam Lopez Tera-Scale Translation Models via Pattern Matching , 2008, COLING.

[21]  Stephan Vogel,et al.  Parallel Implementations of Word Alignment Tool , 2008, SETQALNLP.

[22]  Alon Lavie,et al.  Meteor 1.3: Automatic Metric for Reliable Optimization and Evaluation of Machine Translation Systems , 2011, WMT@EMNLP.

[23]  Geoffrey E. Hinton,et al.  A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[24]  Adam Lopez,et al.  Proceedings of the Seventh Workshop on Statistical Machine Translation , 2012 .

[25]  Holger Schwenk,et al.  Incremental adaptation using translation information and post-editing analysis , 2012, IWSLT.

[26]  José B. Mariño,et al.  The TALP-UPC phrase-based translation systems for WMT12: Morphology simplification and domain adaptation , 2012, WMT@NAACL-HLT.

[27]  Lluís Màrquez i Villodre,et al.  An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output , 2012, LREC.

[28]  Michel Simard,et al.  Using cognates to align sentences in bilingual corpora , 1993, TMI.

[29]  Philipp Koehn,et al.  Findings of the 2012 Workshop on Statistical Machine Translation , 2012, WMT@NAACL-HLT.

[30]  Bruno Pouliquen,et al.  Automatic Identification of Document Translations in Large Multilingual Document Collections , 2006, ArXiv.

[31]  Philipp Koehn,et al.  Large and Diverse Language Models for Statistical Machine Translation , 2008, IJCNLP.

[32]  Arianna Bisazza,et al.  Fill-up versus interpolation methods for phrase-based SMT adaptation , 2011, IWSLT.

[33]  Samuel Reese,et al.  FreeLing 2.1: Five Years of Open-source Language Processing Tools , 2010, LREC.

[34]  Philipp Koehn,et al.  Factored Translation Models , 2007, EMNLP.

[35]  George R. Doddington,et al.  Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics , 2002 .

[36]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[37]  Nitin Madnani,et al.  TER-Plus: paraphrase, semantic, and alignment enhancements to Translation Edit Rate , 2009, Machine Translation.

[38]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[39]  D. Hardt,et al.  Incremental Re-training for Post-editing SMT , 2010, AMTA.

[40]  Kenneth Ward Church,et al.  A Program for Aligning Sentences in Bilingual Corpora , 1993, CL.

[41]  Michel Simard,et al.  Statistical Phrase-Based Post-Editing , 2007, NAACL.

[42]  Ian Witten,et al.  Data Mining , 2000 .

[43]  Mikel L. Forcada,et al.  FAUST - Feedback Analysis for User adaptive Statistical Translation , 2012, EAMT.

[44]  Chris Callison-Burch,et al.  Stream-based Translation Models for Statistical Machine Translation , 2010, NAACL.

[45]  Pascual Martínez-Gómez,et al.  Online adaptation strategies for statistical machine translation in post-editing scenarios , 2012, Pattern Recognit..

[46]  Jianfeng Gao,et al.  Domain Adaptation via Pseudo In-Domain Data Selection , 2011, EMNLP.

[47]  Bohn Stafleu van Loghum Google translate , 2017 .

[48]  Alberto Barrón-Cedeño,et al.  Identifying Useful Human Correction Feedback from an On-Line Machine Translation Service , 2013, IJCAI.

[49]  Stefan Riezler,et al.  On Some Pitfalls in Automatic Evaluation and Significance Testing for MT , 2005, IEEvaluation@ACL.

[50]  Francisco Casacuberta,et al.  Online Learning for Interactive Statistical Machine Translation , 2010, NAACL.

[51]  Lluís Màrquez i Villodre,et al.  Asiya: An Open Toolkit for Automatic Machine Translation (Meta-)Evaluation , 2010, Prague Bull. Math. Linguistics.

[52]  Holger Schwenk,et al.  Issues in incremental adaptation of statistical MT from human post-edits , 2013, MTSUMMIT.

[53]  Hermann Ney,et al.  A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.