Agent and User-Generated Content and its Impact on Customer Support MT

This paper illustrates a new evaluation framework developed at Unbabel for measuring the quality of source language text and its effect on both Machine Translation (MT) and Human Post-Edition (PE) performed by non-professional post-editors. We examine both agent and user-generated content from the Customer Support domain and propose that differentiating the two is crucial to obtaining high quality translation output. Furthermore, we present results of initial experimentation with a new evaluation typology based on the Multidimensional Quality Metrics (MQM) Framework Lommel et al., 2014), specifically tailored toward the evaluation of source language text. We show how the MQM Framework Lommel et al., 2014) can be adapted to assess errors of monolingual source texts and demonstrate how very specific source errors propagate to the MT and PE targets. Finally, we illustrate how MT systems are not robust enough to handle very specific source noise in the context of Customer Support data.

[1]  Milan Straka,et al.  Understanding Model Robustness to User-generated Noisy Texts , 2021, WNUT.

[2]  Rouhullah Nemati Parsa Trends in e-tools and resources for translators and interpreters , 2021, The International Journal of Translation and Interpreting Research.

[3]  Graham Neubig,et al.  Improving Robustness of Machine Translation with Synthetic Noise , 2019, NAACL.

[4]  André F. T. Martins,et al.  Marian: Fast Neural Machine Translation in C++ , 2018, ACL.

[5]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[6]  Frédéric Béchet,et al.  Syntactic parsing of chat language in contact center conversation corpus , 2016, SIGDIAL Conference.

[7]  Eric Ras,et al.  Challenges of error annotation in native/non-native speaker chat , 2016, KONVENS.

[8]  Veronique Hoste,et al.  SCATE Taxonomy and Corpus of Machine Translation Errors , 2016 .

[9]  A. Burchardt,et al.  Multidimensional Quality Metrics (MQM): A Framework for Declaring and Describing Translation Quality Metrics , 2014 .

[10]  Adam Lind,et al.  Chat language : In the continuum of speech and writing , 2012 .

[11]  Johann Roturier,et al.  Evaluation of MT Systems to Translate User Generated Content , 2011, MTSUMMIT.

[12]  Dan Roth,et al.  Annotating ESL Errors: Challenges and Rewards , 2010 .

[13]  Ambalika Sinha,et al.  Interference of first language in the acquisition of second language , 2009 .

[14]  Stephanie Seneff,et al.  An analysis of grammatical errors in non-native speech in english , 2008, 2008 IEEE Spoken Language Technology Workshop.