How Much Attention Do You Need? A Granular Analysis of Neural Machine Translation Architectures
暂无分享,去创建一个
[1] Richard Socher,et al. A Flexible Approach to Automated RNN Architecture Generation , 2017, ICLR.
[2] Philipp Koehn,et al. Six Challenges for Neural Machine Translation , 2017, NMT@ACL.
[3] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[4] Phil Blunsom,et al. Recurrent Continuous Translation Models , 2013, EMNLP.
[5] Quoc V. Le,et al. Neural Architecture Search with Reinforcement Learning , 2016, ICLR.
[6] Oriol Vinyals,et al. Hierarchical Representations for Efficient Architecture Search , 2017, ICLR.
[7] Alon Lavie,et al. Better Hypothesis Testing for Statistical Machine Translation: Controlling for Optimizer Instability , 2011, ACL.
[8] Geoffrey E. Hinton,et al. Layer Normalization , 2016, ArXiv.
[9] Quoc V. Le,et al. Massive Exploration of Neural Machine Translation Architectures , 2017, EMNLP.
[10] Yann Dauphin,et al. Language Modeling with Gated Convolutional Networks , 2016, ICML.
[11] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[12] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..
[13] Alex Graves,et al. Neural Machine Translation in Linear Time , 2016, ArXiv.
[14] Matt Post,et al. We start by defining the recurrent architecture as implemented in S OCKEYE , following , 2018 .
[15] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[16] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.
[17] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.
[18] Yann Dauphin,et al. Convolutional Sequence to Sequence Learning , 2017, ICML.
[19] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[20] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.
[21] Geoffrey J. Gordon,et al. DeepArchitect: Automatically Designing and Training Deep Architectures , 2017, ArXiv.
[22] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[23] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[24] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.
[25] Alon Lavie,et al. The Meteor metric for automatic evaluation of machine translation , 2009, Machine Translation.