暂无分享,去创建一个
Bowen Zhou | Ramesh Nallapati | Yoshua Bengio | Sungjin Ahn | Çaglar Gülçehre | Yoshua Bengio | Çaglar Gülçehre | Bowen Zhou | Sungjin Ahn | Ramesh Nallapati
[1] Kuldip K. Paliwal,et al. Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..
[2] Yoshua Bengio,et al. Hierarchical Probabilistic Neural Network Language Model , 2005, AISTATS.
[3] M. Tomasello,et al. A new look at infant pointing. , 2007, Child development.
[4] Yoshua Bengio,et al. Adaptive Importance Sampling to Accelerate Training of a Neural Probabilistic Language Model , 2008, IEEE Transactions on Neural Networks.
[5] Aapo Hyvärinen,et al. Noise-Contrastive Estimation of Unnormalized Statistical Models, with Applications to Natural Image Statistics , 2012, J. Mach. Learn. Res..
[6] M. Tomasello,et al. Origins of the Human Pointing Gesture: a Training Study , 2022 .
[7] Razvan Pascanu,et al. Theano: new features and speed improvements , 2012, ArXiv.
[8] Matthew D. Zeiler. ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.
[9] Koray Kavukcuoglu,et al. Learning word embeddings efficiently with noise-contrastive estimation , 2013, NIPS.
[10] Razvan Pascanu,et al. On the difficulty of training recurrent neural networks , 2012, ICML.
[11] Alex Graves,et al. Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.
[12] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[13] Razvan Pascanu,et al. How to Construct Deep Recurrent Neural Networks , 2013, ICLR.
[14] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.
[15] Yoshua Bengio,et al. On Using Very Large Target Vocabulary for Neural Machine Translation , 2014, ACL.
[16] Quoc V. Le,et al. Addressing the Rare Word Problem in Neural Machine Translation , 2014, ACL.
[17] Jason Weston,et al. A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.
[18] Jason Weston,et al. Large-scale Simple Question Answering with Memory Networks , 2015, ArXiv.
[19] Navdeep Jaitly,et al. Pointer Networks , 2015, NIPS.
[20] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[21] Phil Blunsom,et al. Teaching Machines to Read and Comprehend , 2015, NIPS.
[22] Peter Kulchyski. and , 2015 .
[23] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[24] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.
[25] Mirella Lapata,et al. Neural Summarization by Extracting Sentences and Words , 2016, ACL.
[26] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[27] Oriol Vinyals,et al. Multilingual Language Processing From Bytes , 2015, NAACL.
[28] Misha Denil,et al. Noisy Activation Functions , 2016, ICML.
[29] John Salvatier,et al. Theano: A Python framework for fast computation of mathematical expressions , 2016, ArXiv.
[30] Hang Li,et al. “ Tony ” DNN Embedding for “ Tony ” Selective Read for “ Tony ” ( a ) Attention-based Encoder-Decoder ( RNNSearch ) ( c ) State Update s 4 SourceVocabulary Softmax Prob , 2016 .