DICTDIS: Dictionary Constrained Disambiguation for Improved NMT

Domain-specific neural machine translation (NMT) systems ( e.g. , in educational applications) are socially significant with the potential to help make information accessible to a di- verse set of users in multilingual societies. It is desirable that such NMT systems be lexically constrained and draw from domain-specific dictionaries. Dictionaries could present multiple candidate translations for a source words/phrases on ac- count of the polysemous nature of words. The onus is then on the NMT model to choose the contextually most appro- priate candidate. Prior work has largely ignored this problem and focused on the single candidate setting where the target word or phrase is replaced by a single constraint. In this work we present D ICT D IS , a lexically constrained NMT system that disambiguates between multiple candidate translations derived from dictionaries. We achieve this by augmenting training data with multiple dictionary candidates to actively encourage disambiguation during training. We demonstrate the utility of D ICT D IS via extensive experiments on English-Hindi sentences in a variety of domains including news, fi- nance, medicine and engineering. We obtain superior disambiguation performance on all domains with improved fluency in some domains of up to 4 BLEU points, when compared with existing approaches for lexically constrained and uncon- strained NMT.

[1]  Ganesh Ramakrishnan,et al.  UDAAN - Machine Learning based Post-Editing tool for Document Translation , 2022, COMAD/CODS.

[2]  Sunita Sarawagi,et al.  Accurate Online Posterior Alignments for Principled Lexically-Constrained Decoding , 2022, Annual Meeting of the Association for Computational Linguistics.

[3]  Pratyush Kumar,et al.  Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages , 2021, TACL.

[4]  Niccolò Campolungo,et al.  DiBiMT: A Novel Benchmark for Measuring Word Sense Disambiguation Biases in Machine Translation , 2022, ACL.

[5]  Yoshimasa Tsuruoka,et al.  Modeling Target-side Inflection in Placeholder Translation , 2021, MTSUMMIT.

[6]  Markus Freitag,et al.  Experts, Errors, and Context: A Large-Scale Study of Human Evaluation for Machine Translation , 2021, Transactions of the Association for Computational Linguistics.

[7]  Jean Senellart,et al.  Integrating Domain Terminology into Neural Machine Translation , 2020, COLING.

[8]  Yong Wang,et al.  Lexical-Constraint-Aware Neural Machine Translation via Data Augmentation , 2020, IJCAI.

[9]  Kun Wang,et al.  Alignment-Enhanced Transformer for Constraining NMT with Pre-Specified Translations , 2020, AAAI.

[10]  Shamil Chollampatt,et al.  Lexically Constrained Neural Machine Translation with Levenshtein Transformer , 2020, ACL.

[11]  Marjan Ghazvininejad,et al.  Multilingual Denoising Pre-training for Neural Machine Translation , 2020, Transactions of the Association for Computational Linguistics.

[12]  Marta R. Costa-jussà,et al.  Findings of the 2019 Conference on Machine Translation (WMT19) , 2019, WMT.

[13]  Yaser Al-Onaizan,et al.  Training Neural Machine Translation to Apply Terminology Constraints , 2019, ACL.

[14]  Yue Zhang,et al.  Code-Switching for Enhancing NMT with Pre-Specified Translation , 2019, NAACL.

[15]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[16]  Hermann Ney,et al.  On The Alignment Problem In Multi-Head Attention-Based Neural Machine Translation , 2018, WMT.

[17]  Matt Post,et al.  Fast Lexically Constrained Decoding with Dynamic Beam Allocation for Neural Machine Translation , 2018, NAACL.

[18]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[19]  Qun Liu,et al.  Lexically Constrained Decoding for Sequence Generation Using Grid Beam Search , 2017, ACL.

[20]  Geoffrey E. Hinton,et al.  Layer Normalization , 2016, ArXiv.

[21]  Bowen Zhou,et al.  Pointing the Unknown Words , 2016, ACL.

[22]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[23]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.