暂无分享,去创建一个
Vladlen Koltun | J. Zico Kolter | Shaojie Bai | J. Z. Kolter | V. Koltun | Shaojie Bai | Sequence Modeling | Sequence Modeling
[1] Lukás Burget,et al. Recurrent neural network based language model , 2010, INTERSPEECH.
[2] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[3] Thomas S. Huang,et al. Dilated Recurrent Neural Networks , 2017, NIPS.
[4] John Miller,et al. When Recurrent Models Don't Need To Be Recurrent , 2018, ArXiv.
[5] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[6] Geoffrey E. Hinton,et al. Generating Text with Recurrent Neural Networks , 2011, ICML.
[7] Quoc V. Le,et al. Learning Longer-term Dependencies in RNNs with Auxiliary Losses , 2018, ICML.
[8] Tim Salimans,et al. Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks , 2016, NIPS.
[9] Ruslan Salakhutdinov,et al. Breaking the Softmax Bottleneck: A High-Rank RNN Language Model , 2017, ICLR.
[10] Ankur Bapna,et al. The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation , 2018, ACL.
[11] Tara N. Sainath,et al. Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[12] Yoshua Bengio,et al. On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.
[13] Yiming Yang,et al. Transformer-XL: Language Modeling with Longer-Term Dependency , 2018 .
[14] Fei-Fei Li,et al. Deep visual-semantic alignments for generating image descriptions , 2015, CVPR.
[15] Lawrence D. Jackel,et al. Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.
[16] Jeffrey L. Elman,et al. Finding Structure in Time , 1990, Cogn. Sci..
[17] Richard Socher,et al. Quasi-Recurrent Neural Networks , 2016, ICLR.
[18] Moustapha Cissé,et al. Efficient softmax approximation for GPUs , 2016, ICML.
[19] Alex Graves,et al. Supervised Sequence Labelling with Recurrent Neural Networks , 2012, Studies in Computational Intelligence.
[20] Trevor Darrell,et al. Sequence to Sequence -- Video to Text , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[21] Vladlen Koltun,et al. An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling , 2018, ArXiv.
[22] Chris Dyer,et al. On the State of the Art of Evaluation in Neural Language Models , 2017, ICLR.
[23] Angelika Steger,et al. Fast-Slow Recurrent Neural Networks , 2017, NIPS.
[24] Quoc V. Le,et al. Neural Architecture Search with Reinforcement Learning , 2016, ICLR.
[25] Samy Bengio,et al. Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[26] Zoubin Ghahramani,et al. A Theoretically Grounded Application of Dropout in Recurrent Neural Networks , 2015, NIPS.
[27] Vladlen Koltun,et al. Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.
[28] Yiming Yang,et al. DARTS: Differentiable Architecture Search , 2018, ICLR.
[29] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.
[30] Thomas Pock,et al. A Primal Dual Network for Low-Level Vision Problems , 2017, GCPR.
[31] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[32] Jürgen Schmidhuber,et al. Recurrent Highway Networks , 2016, ICML.
[33] Dit-Yan Yeung,et al. Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.
[34] Heiga Zen,et al. WaveNet: A Generative Model for Raw Audio , 2016, SSW.
[35] Paul J. Werbos,et al. Backpropagation Through Time: What It Does and How to Do It , 1990, Proc. IEEE.
[36] Geoffrey E. Hinton,et al. Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..
[37] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[38] Yann Dauphin,et al. Language Modeling with Gated Convolutional Networks , 2016, ICML.
[39] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..
[40] Alex Graves,et al. Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.
[41] Richard Socher,et al. Pointer Sentinel Mixture Models , 2016, ICLR.
[42] Yann Dauphin,et al. Convolutional Sequence to Sequence Learning , 2017, ICML.
[43] Razvan Pascanu,et al. Relational recurrent neural networks , 2018, NeurIPS.
[44] Richard Socher,et al. Regularizing and Optimizing LSTM Language Models , 2017, ICLR.
[45] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .
[46] Shuai Li,et al. Independently Recurrent Neural Network (IndRNN): Building A Longer and Deeper RNN , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[47] Alex Graves,et al. Neural Machine Translation in Linear Time , 2016, ArXiv.
[48] Quoc V. Le,et al. Efficient Neural Architecture Search via Parameter Sharing , 2018, ICML.
[49] Nicolas Usunier,et al. Improving Neural Language Models with a Continuous Cache , 2016, ICLR.
[50] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.
[51] Richard Socher,et al. An Analysis of Neural Language Modeling at Multiple Scales , 2018, ArXiv.
[52] Jürgen Schmidhuber,et al. LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.
[53] Yiming Yang,et al. Transformer-XL: Attentive Language Models beyond a Fixed-Length Context , 2019, ACL.
[54] Wojciech Zaremba,et al. An Empirical Exploration of Recurrent Network Architectures , 2015, ICML.
[55] Kyunghyun Cho,et al. Gated Word-Character Recurrent Language Model , 2016, EMNLP.
[56] Daniel Jurafsky,et al. Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context , 2018, ACL.