Fast-Slow Recurrent Neural Networks
暂无分享,去创建一个
[1] Tomasz Kornuta,et al. Surprisal-Driven Zoneout , 2016, 1610.07675.
[2] Ilya Sutskever,et al. Learning Recurrent Neural Networks with Hessian-Free Optimization , 2011, ICML.
[3] Herbert Jaeger,et al. Discovering multiscale dynamical features with hierarchical Echo State Networks , 2008 .
[4] Jürgen Schmidhuber,et al. Learning Complex, Extended Sequences Using the Principle of History Compression , 1992, Neural Computation.
[5] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[6] Razvan Pascanu,et al. How to Construct Deep Recurrent Neural Networks , 2013, ICLR.
[7] Wojciech Zaremba,et al. Recurrent Neural Network Regularization , 2014, ArXiv.
[8] Yoshua. Bengio,et al. Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..
[9] Yoshua Bengio,et al. End-to-end attention-based large vocabulary speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[10] Peter Tiño,et al. Learning long-term dependencies in NARX recurrent neural networks , 1996, IEEE Trans. Neural Networks.
[11] Jürgen Schmidhuber,et al. Recurrent Highway Networks , 2016, ICML.
[12] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.
[13] Yoshua Bengio,et al. Gated Feedback Recurrent Neural Networks , 2015, ICML.
[14] Alex Graves,et al. Neural Turing Machines , 2014, ArXiv.
[15] Alex Graves,et al. Neural Machine Translation in Linear Time , 2016, ArXiv.
[16] Jeffrey L. Elman,et al. Finding Structure in Time , 1990, Cogn. Sci..
[17] Quoc V. Le,et al. Neural Architecture Search with Reinforcement Learning , 2016, ICLR.
[18] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.
[19] Yoshua Bengio,et al. Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.
[20] Yoshua Bengio,et al. Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations , 2016, ICLR.
[21] Jakob Grue Simonsen,et al. A Hierarchical Recurrent Encoder-Decoder for Generative Context-Aware Query Suggestion , 2015, CIKM.
[22] Yoshua Bengio,et al. Hierarchical Multiscale Recurrent Neural Networks , 2016, ICLR.
[23] Jürgen Schmidhuber,et al. A Clockwork RNN , 2014, ICML.
[24] Yoshua Bengio,et al. Hierarchical Recurrent Neural Networks for Long-Term Dependencies , 1995, NIPS.
[25] Yann LeCun,et al. Tunable Efficient Unitary Neural Networks (EUNN) and their application to RNNs , 2016, ICML.
[26] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[27] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..
[28] Alex Graves,et al. Adaptive Computation Time for Recurrent Neural Networks , 2016, ArXiv.
[29] Ilya Sutskever,et al. SUBWORD LANGUAGE MODELING WITH NEURAL NETWORKS , 2011 .
[30] PAUL J. WERBOS,et al. Generalization of backpropagation with application to a recurrent gas market model , 1988, Neural Networks.
[31] Sepp Hochreiter,et al. The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions , 1998, Int. J. Uncertain. Fuzziness Knowl. Based Syst..
[32] Alex Graves,et al. Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.
[33] Nassir Navab,et al. Revisiting NARX Recurrent Neural Networks for Long-Term Dependencies , 2017, ArXiv.
[34] Sergio Gomez Colmenarejo,et al. Hybrid computing using a neural network with dynamic external memory , 2016, Nature.