[1] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[2] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..
[3] Håkan Ringbom,et al. Language transfer. Cross-linguistic influence in language learning , 1990 .
[4] Rico Sennrich,et al. The AMU-UEDIN Submission to the WMT16 News Translation Task: Attention-based NMT Models as Feature Functions in Phrase-based SMT , 2016, WMT.
[5] Terence Odlin,et al. Language Transfer: Cross-Linguistic Influence in Language Learning , 1989 .
[6] Mauro Cettolo,et al. Bootstrapping Arabic-Italian SMT through Comparable Texts and Pivot Translation , 2011, EAMT.
[7] Karin M. Verspoor,et al. Findings of the 2016 Conference on Machine Translation , 2016, WMT.
[8] Josef van Genabith,et al. Statistical Post-Editing for a Statistical MT System , 2011, MTSUMMIT.
[9] Arianna Bisazza,et al. Neural versus phrase-based MT quality: An in-depth analysis on English-German and English-French , 2018, Comput. Speech Lang..
[10] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[11] Hava T. Siegelmann,et al. On the Computational Power of Neural Nets , 1995, J. Comput. Syst. Sci..
[12] Marcello Federico,et al. Domain Adaptation for Statistical Machine Translation with Monolingual Resources , 2009, WMT@EACL.
[13] Ashish Vaswani,et al. Self-Attention with Relative Position Representations , 2018, NAACL.
[14] Rico Sennrich,et al. Improving Neural Machine Translation Models with Monolingual Data , 2015, ACL.
[15] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[16] Marcello Federico,et al. Improving Zero-Shot Translation of Low-Resource Languages , 2018, IWSLT.
[17] Quoc V. Le,et al. Multi-task Sequence to Sequence Learning , 2015, ICLR.
[18] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[19] Yoshua Bengio,et al. On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.
[20] Alon Lavie,et al. Better Hypothesis Testing for Statistical Machine Translation: Controlling for Optimizer Instability , 2011, ACL.
[21] Philipp Koehn,et al. Six Challenges for Neural Machine Translation , 2017, NMT@ACL.
[22] Kemal Oflazer,et al. Exploring Different Representational Units in English-to-Turkish Statistical Machine Translation , 2007, WMT@ACL.
[23] Sebastian Stüker,et al. Overview of the IWSLT 2010 evaluation campaign , 2010, IWSLT.
[24] Rico Sennrich,et al. Nematus: a Toolkit for Neural Machine Translation , 2017, EACL.
[25] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.
[26] Marcello Federico,et al. Deep Neural Machine Translation with Weakly-Recurrent Units , 2018, EAMT.
[27] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..
[28] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.
[29] Mauro Cettolo,et al. The IWSLT 2016 Evaluation Campaign , 2016, IWSLT.
[30] Kevin Knight,et al. Multi-Source Neural Translation , 2016, NAACL.
[31] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[32] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.
[33] Dianhai Yu,et al. Multi-Task Learning for Multiple Language Translation , 2015, ACL.
[34] Jan Niehues,et al. Toward Multilingual Neural Machine Translation with Universal Encoder and Decoder , 2016, IWSLT.
[35] Yaser Al-Onaizan,et al. Zero-Resource Translation with Multi-Lingual Neural Machine Translation , 2016, EMNLP.
[36] Tie-Yan Liu,et al. Dual Learning for Machine Translation , 2016, NIPS.
[37] Deniz Yuret,et al. Transfer Learning for Low-Resource Neural Machine Translation , 2016, EMNLP.
[38] Martin Wattenberg,et al. Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation , 2016, TACL.
[39] Mauro Cettolo,et al. WIT3: Web Inventory of Transcribed and Translated Talks , 2012, EAMT.
[40] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.
[41] Jason Lee,et al. Fully Character-Level Neural Machine Translation without Explicit Segmentation , 2016, TACL.
[42] Alexander M. Rush,et al. OpenNMT: Open-Source Toolkit for Neural Machine Translation , 2017, ACL.
[43] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[44] Daniel Marcu,et al. Scalable Inference and Training of Context-Rich Syntactic Translation Models , 2006, ACL.
[45] Yoshua Bengio,et al. Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism , 2016, NAACL.
[46] Rico Sennrich,et al. The University of Edinburgh’s Neural MT Systems for WMT17 , 2017, WMT.
[47] Gavriel Salomon,et al. T RANSFER OF LEARNING , 1992 .