论文信息 - A Multi-Task Architecture on Relevance-based Neural Query Translation

A Multi-Task Architecture on Relevance-based Neural Query Translation

We describe a multi-task learning approach to train a Neural Machine Translation (NMT) model with a Relevance-based Auxiliary Task (RAT) for search query translation. The translation process for Cross-lingual Information Retrieval (CLIR) task is usually treated as a black box and it is performed as an independent step. However, an NMT model trained on sentence-level parallel data is not aware of the vocabulary distribution of the retrieval corpus. We address this problem with our multi-task learning architecture that achieves 16% improvement over a strong NMT baseline on Italian-English query-document dataset. We show using both quantitative and qualitative analysis that our model generates balanced and precise translations with the regularization effect it achieves from multi-task learning paradigm.

James Allan | Sheikh Muhammad Sarwar | Hamed Bonab | James Allan | Hamed Bonab

[1] Stefan Riezler,et al. Learning to translate queries for CLIR , 2014, SIGIR.

[2] Dong Zhou,et al. Translation techniques in cross-language information retrieval , 2012, CSUR.

[3] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[4] Steven Bird,et al. NLTK: The Natural Language Toolkit , 2002, ACL.

[5] Stefan Riezler,et al. Boosting Cross-Language Retrieval by Learning Bilingual Phrase Associations from Relevance Rankings , 2013, EMNLP.

[6] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[7] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[8] CHENGXIANG ZHAI,et al. A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[9] Marie-Francine Moens,et al. Monolingual and Cross-Lingual Information Retrieval Models Based on (Bilingual) Word Embeddings , 2015, SIGIR.

[10] Philipp Koehn,et al. Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.

[11] Satoshi Nakamura,et al. Training Neural Machine Translation using Word Embedding-based Loss , 2018, ArXiv.

[12] Sebastian Ruder,et al. An Overview of Multi-Task Learning in Deep Neural Networks , 2017, ArXiv.

[13] Gregory Grefenstette,et al. Cross-Language Information Retrieval , 1998, The Springer International Series on Information Retrieval.

[14] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.