暂无分享,去创建一个
George Kurian | Wei Wang | Quoc V. Le | Yuan Cao | Lukasz Kaiser | Yonghui Wu | Taku Kudo | Oriol Vinyals | Melvin Johnson | Wolfgang Macherey | Mohammad Norouzi | Zhifeng Chen | Yoshikiyo Kato | Jeffrey Dean | Qin Gao | Keith Stevens | Klaus Macherey | Gregory S. Corrado | Cliff Young | Nishant Patil | Jason Smith | Apurva Shah | Xiaobing Liu | Jeff Klingner | Jason Riesa | Mike Schuster | Maxim Krikun | Hideto Kazawa | Alex Rudnick | Macduff Hughes | Stephan Gouws | Lukasz Kaiser | Oriol Vinyals | J. Dean | G. Corrado | Z. Chen | Mohammad Norouzi | J. Klingner | M. Schuster | Yonghui Wu | Melvin Johnson | M. Krikun | Yuan Cao | Wolfgang Macherey | Xiaobing Liu | C. Young | Nishant Patil | Stephan Gouws | Taku Kudo | Macduff Hughes | Jason R. Smith | Qin Gao | H. Kazawa | Wei Wang | K. Stevens | Klaus Macherey | Jason Riesa | Y. Kato | George Kurian | Alex Rudnick | Apurva Shah
[1] Razvan Pascanu,et al. Understanding the exploding gradient problem , 2012, ArXiv.
[2] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.
[3] Quoc V. Le,et al. Multi-task Sequence to Sequence Learning , 2015, ICLR.
[4] Phil Blunsom,et al. Recurrent Continuous Translation Models , 2013, EMNLP.
[5] Kenneth Heafield,et al. N-gram Counts and Language Models from the Common Crawl , 2014, LREC.
[6] John Cocke,et al. A Statistical Approach to Language Translation , 1988, COLING.
[7] Marc'Aurelio Ranzato,et al. Sequence Level Training with Recurrent Neural Networks , 2015, ICLR.
[8] John Cocke,et al. A Statistical Approach to Machine Translation , 1990, CL.
[9] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[10] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[11] Jürgen Schmidhuber,et al. Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.
[12] Yoshua Bengio,et al. On Using Very Large Target Vocabulary for Neural Machine Translation , 2014, ACL.
[13] Christian Lebiere,et al. The Cascade-Correlation Learning Architecture , 1989, NIPS.
[14] Mike Schuster,et al. Japanese and Korean voice search , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[15] José A. R. Fonollosa,et al. Character-based Neural Machine Translation , 2016, ACL.
[16] Jian Cheng,et al. Quantized Convolutional Neural Networks for Mobile Devices , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[17] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[18] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.
[19] Lonnie Chrisman,et al. Learning Recursive Distributed Representations for Holistic Computation , 1991 .
[20] Bin Liu,et al. Ternary Weight Networks , 2016, ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[21] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.
[22] Kuldip K. Paliwal,et al. Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..
[23] Christopher D. Manning,et al. Achieving Open Vocabulary Neural Machine Translation with Hybrid Word-Character Models , 2016, ACL.
[24] Wei Xu,et al. Deep Recurrent Models with Fast-Forward Connections for Neural Machine Translation , 2016, TACL.
[25] Nadir Durrani,et al. Edinburgh’s Phrase-based Machine Translation Systems for WMT-14 , 2014, WMT@ACL.
[26] Richard M. Schwartz,et al. Fast and Robust Neural Network Joint Models for Statistical Machine Translation , 2014, ACL.
[27] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[28] Yoshua Bengio,et al. A Character-level Decoder without Explicit Segmentation for Neural Machine Translation , 2016, ACL.
[29] Song Han,et al. Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.
[30] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.
[31] Dianhai Yu,et al. Multi-Task Learning for Multiple Language Translation , 2015, ACL.
[32] Marc'Aurelio Ranzato,et al. Large Scale Distributed Deep Networks , 2012, NIPS.
[33] Yang Liu,et al. Coverage-based Neural Machine Translation , 2016, ArXiv.
[34] Pritish Narayanan,et al. Deep Learning with Limited Numerical Precision , 2015, ICML.
[35] Dale Schuurmans,et al. Reward Augmented Maximum Likelihood for Neural Structured Prediction , 2016, NIPS.
[36] Yang Liu,et al. Minimum Risk Training for Neural Machine Translation , 2015, ACL.
[37] Quoc V. Le,et al. Addressing the Rare Word Problem in Neural Machine Translation , 2014, ACL.
[38] Daniel Marcu,et al. Statistical Phrase-Based Translation , 2003, NAACL.
[39] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.
[40] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[41] Yoshua Bengio,et al. Gradient Flow in Recurrent Nets: the Difficulty of Learning Long-Term Dependencies , 2001 .