论文信息 - STACL: Simultaneous Translation with Implicit Anticipation and Controllable Latency using Prefix-to-Prefix Framework - 字舞流文

STACL: Simultaneous Translation with Implicit Anticipation and Controllable Latency using Prefix-to-Prefix Framework

Simultaneous translation, which translates sentences before they are finished, is useful in many scenarios but is notoriously difficult due to word-order differences. While the conventional seq-to-seq framework is only suitable for full-sentence translation, we propose a novel prefix-to-prefix framework for simultaneous translation that implicitly learns to anticipate in a single translation model. Within this framework, we present a very simple yet surprisingly effective wait-k policy trained to generate the target sentence concurrently with the source sentence, but always k words behind. Experiments show our strategy achieves low latency and reasonable quality (compared to full-sentence translation) on 4 directions: zh en and de en.

Haifeng Wang | Zhongjun He | Xing Li | Renjie Zheng | Mingbo Ma | Baigong Zheng | Hairong Liu | Kaibo Liu | Hao Xiong | Hua Wu | Liang Huang | Chuanqiang Zhang | Hua Wu | Haifeng Wang | Hairong Liu | Baigong Zheng | Renjie Zheng | Liang Huang | Kaibo Liu | Hao Xiong | Zhongjun He | Mingbo Ma | Chuanqiang Zhang | Xing Li

[1] Anoop Sarkar,et al. Prediction Improves Simultaneous Neural Machine Translation , 2018, EMNLP.

[2] Samy Bengio,et al. An Online Sequence-to-Sequence Model Using Partial Conditioning , 2015, NIPS.

[3] Mingbo Ma,et al. Breaking the Beam Search Curse: A Study of (Re-)Scoring Methods and Stopping Criteria for Neural Machine Translation , 2018, EMNLP.

[4] Jordan L. Boyd-Graber,et al. Don't Until the Final Verb Wait: Reinforcement Learning for Simultaneous Machine Translation , 2014, EMNLP.

[5] He He,et al. Syntax-based Rewriting for Simultaneous Machine Translation , 2015, EMNLP.

[6] Tomoki Toda,et al. Syntax-based Simultaneous Translation through Prediction of Unseen Syntactic Constituents , 2015, ACL.

[7] Tomoki Toda,et al. Optimizing Segmentation Strategies for Simultaneous Speech Translation , 2014, ACL.

[8] Alexander M. Rush,et al. OpenNMT: Open-Source Toolkit for Neural Machine Translation , 2017, ACL.

[9] Mingbo Ma,et al. When to Finish? Optimal Beam Search for Neural Text Generation (modulo beam size) , 2017, EMNLP.

[10] S. Matsubara,et al. SIMULTANEOUS JAPANESE-ENGLISH INTERPRETATION BASED ON EARLY PREDICTION OF ENGLISH VERB , 2001 .

[11] Kyunghyun Cho,et al. Can neural machine translation do simultaneous translation? , 2016, ArXiv.

[12] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[13] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[14] Noah A. Smith,et al. You May Not Need Attention , 2018, ArXiv.

[15] Renjie Zheng,et al. Simultaneous Translation with Flexible Policy via Restricted Imitation Learning , 2019, ACL.

[16] Naho Orita,et al. Incremental Prediction of Sentence-final Verbs: Humans versus Machines , 2016, CoNLL.

[17] Tomoki Toda,et al. Simple, lexicalized choice of translation timing for simultaneous speech translation , 2013, INTERSPEECH.

[18] Srinivas Bangalore,et al. Real-time Incremental Speech-to-Speech Translation of Dialogs , 2012, NAACL.

[19] Graham Neubig,et al. Learning to Translate in Real-time with Neural Machine Translation , 2016, EACL.

[20] Barbara Moser-Mercer,et al. Prolonged turns in interpreting: effects on quality, physiological and psychological stress (Pilot study) , 1998 .

[21] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.