Reinforced NMT for Sentiment and Content Preservation in Low-resource Scenario

The preservation of domain knowledge from source to the target is crucial in any translation workflows. Hence, translation service providers that use machine translation (MT) in production could reasonably expect that the translation process should transfer both the underlying pragmatics and the semantics of the source-side sentences into the target language. However, recent studies suggest that the MT systems often fail to preserve such crucial information (e.g., sentiment, emotion, gender traits) embedded in the source text in the target. In this context, the raw automatic translations are often directly fed to other natural language processing (NLP) applications (e.g., sentiment classifier) in a cross-lingual platform. Hence, the loss of such crucial information during the translation could negatively affect the performance of such downstream NLP tasks that heavily rely on the output of the MT systems. In our current research, we carefully balance both the sides (i.e., sentiment and semantics) during translation, by controlling a global-attention-based neural MT (NMT), to generate translations that encode the underlying sentiment of a source sentence while preserving its non-opinionated semantic content. Toward this, we use a state-of-the-art reinforcement learning method, namely, actor-critic, that includes a novel reward combination module, to fine-tune the NMT system so that it learns to generate translations that are best suited for a downstream task, viz. sentiment classification while ensuring the source-side semantics is intact in the process. Experimental results for Hindi–English language pair show that our proposed method significantly improves the performance of the sentiment classifier and alongside results in an improved NMT system.

[1]  Colin Cherry,et al.  A Systematic Comparison of Smoothing Techniques for Sentence-Level BLEU , 2014, WMT@ACL.

[2]  Christopher D. Manning,et al.  Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[3]  Markus Freitag,et al.  Beam Search Strategies for Neural Machine Translation , 2017, NMT@ACL.

[4]  Marine Carpuat,et al.  A Study of Style in Machine Translation: Controlling the Formality of Machine Translation Output , 2017, EMNLP.

[5]  Alon Lavie,et al.  Meteor 1.3: Automatic Metric for Reliable Optimization and Evaluation of Machine Translation Systems , 2011, WMT@EMNLP.

[6]  Stefan Riezler,et al.  A Reinforcement Learning Approach to Interactive-Predictive Neural Machine Translation , 2018, EAMT.

[7]  Rico Sennrich,et al.  Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.

[8]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .

[9]  Joelle Pineau,et al.  An Actor-Critic Algorithm for Sequence Prediction , 2016, ICLR.

[10]  Alexandra Balahur,et al.  Multilingual Sentiment Analysis using Machine Translation? , 2012, WASSA@ACL.

[11]  Lijun Wu,et al.  A Study of Reinforcement Learning for Neural Machine Translation , 2018, EMNLP.

[12]  Pushpak Bhattacharyya,et al.  The IIT Bombay English-Hindi Parallel Corpus , 2017, LREC.

[13]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[14]  Rico Sennrich,et al.  Controlling Politeness in Neural Machine Translation via Side Constraints , 2016, NAACL.

[15]  Boxing Chen,et al.  Bilingual Sentiment Consistency for Statistical Machine Translation , 2014, EACL.

[16]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[17]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[18]  Tommaso Caselli,et al.  Evalita 2018: Overview on the 6th Evaluation Campaign of Natural Language Processing and Speech Tools for Italian , 2018, EVALITA@CLiC-it.

[19]  José Manuel Perea Ortega,et al.  Resource Creation and Evaluation for Multilingual Sentiment Analysis in Social Media Texts , 2014, LREC.

[20]  Andy Way,et al.  Maintaining Sentiment Polarity in Translation of User-Generated Content , 2017, Prague Bull. Math. Linguistics.

[21]  Noah A. Smith,et al.  Evaluating Gender Bias in Machine Translation , 2019, ACL.

[22]  Markus Freitag,et al.  APE at Scale and Its Implications on MT Evaluation Biases , 2019, WMT.

[23]  Andy Way,et al.  Getting Gender Right in Neural Machine Translation , 2019, EMNLP.

[24]  Houfeng Wang,et al.  Unpaired Sentiment-to-Sentiment Translation: A Cycled Reinforcement Learning Approach , 2018, ACL.

[25]  Omer Levy,et al.  BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension , 2019, ACL.

[26]  Lucia Specia,et al.  Personalized Machine Translation: Preserving Original Author Traits , 2016, EACL.

[27]  Alberto Poncelas,et al.  The Impact of Indirect Machine Translation on Sentiment Classification , 2020, AMTA.

[28]  Marie Escribe,et al.  Human Evaluation of Neural Machine Translation: The Case of Deep Learning , 2019, Proceedings of the Second Workshop Human-Informed Translation and Interpreting Technology associated with RANLP 2019.

[29]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[30]  Stefan Riezler,et al.  Interactive-Predictive Neural Machine Translation through Reinforcement and Imitation , 2019, MTSummit.

[31]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[32]  Andy Way,et al.  Balancing Translation Quality and Sentiment Preservation (Non-archival Extended Abstract) , 2018, AMTA.

[33]  Graham Neubig,et al.  Extreme Adaptation for Personalized Neural Machine Translation , 2018, ACL.

[34]  Marine Carpuat,et al.  Controlling Text Complexity in Neural Machine Translation , 2019, EMNLP.

[35]  Saif Mohammad,et al.  How Translation Alters Sentiment , 2016, J. Artif. Intell. Res..

[36]  E. Matusov The Challenges of Using Neural Machine Translation for Literature , 2019 .

[37]  Hideki Mima,et al.  Improving Performance of Transfer-Driven Machine Translation with Extra-Linguistic Informatioon from Context, Situation and Environment , 1997, IJCAI.

[38]  Rada Mihalcea,et al.  Multilingual Subjectivity Analysis Using Machine Translation , 2008, EMNLP.

[39]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[40]  Hal Daumé,et al.  Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback , 2017, EMNLP.

[41]  Ronald J. Williams,et al.  Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[42]  Andy Way,et al.  Sentiment translation for low resourcedlanguages: experiments on Irish general electionTweets , 2017 .

[43]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[44]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[45]  Markus Freitag,et al.  Fast Domain Adaptation for Neural Machine Translation , 2016, ArXiv.

[46]  Pushpak Bhattacharyya,et al.  Lost in Translation: Viability of Machine Translation for Cross Language Sentiment Analysis , 2013, CICLing.

[47]  Watanabe Hideo,et al.  Deeper Sentiment Analysis Using Machine Translation Technology , 2004, COLING.

[48]  Caroline Brun,et al.  Motivating Personality-aware Machine Translation , 2015, EMNLP.

[49]  Matteo Negri,et al.  Machine Translation for Machines: the Sentiment Classification Use Case , 2019, EMNLP.

[50]  Philipp Koehn,et al.  Statistical Significance Tests for Machine Translation Evaluation , 2004, EMNLP.