暂无分享,去创建一个
Erich Elsen | Alex Graves | Simon Osindero | Karen Simonyan | Jacob Menick | Utku Evci | Simon Osindero | K. Simonyan | Jacob Menick | Erich Elsen | Utku Evci | Alex Graves
[1] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[2] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.
[3] Alex Graves,et al. Memory-Efficient Backpropagation Through Time , 2016, NIPS.
[4] Erich Elsen,et al. Rigging the Lottery: Making All Tickets Winners , 2020, ICML.
[5] Yann Ollivier,et al. Unbiased Online Recurrent Optimization , 2017, ICLR.
[6] Andreas Griewank,et al. Algorithm 799: revolve: an implementation of checkpointing for the reverse or adjoint mode of computational differentiation , 2000, TOMS.
[7] Yiming Yang,et al. Transformer-XL: Attentive Language Models beyond a Fixed-Length Context , 2019, ACL.
[8] Alex Graves,et al. Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes , 2016, NIPS.
[9] Erich Elsen,et al. Efficient Neural Audio Synthesis , 2018, ICML.
[10] Kevin Gimpel,et al. ALBERT: A Lite BERT for Self-supervised Learning of Language Representations , 2019, ICLR.
[11] Erich Elsen,et al. Fast Sparse ConvNets , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[12] Suyog Gupta,et al. To prune, or not to prune: exploring the efficacy of pruning for model compression , 2017, ICLR.
[13] Richard Socher,et al. Pointer Sentinel Mixture Models , 2016, ICLR.
[14] James Martens,et al. On the Variance of Unbiased Online Recurrent Optimization , 2019, ArXiv.
[15] Shane Legg,et al. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures , 2018, ICML.
[16] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[17] Jing Peng,et al. An Efficient Gradient-Based Algorithm for On-Line Training of Recurrent Network Trajectories , 1990, Neural Computation.
[18] Sergio Gomez Colmenarejo,et al. Hybrid computing using a neural network with dynamic external memory , 2016, Nature.
[19] Angelika Steger,et al. Approximating Real-Time Recurrent Learning with Random Kronecker Factors , 2018, NeurIPS.
[20] Michael Carbin,et al. The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks , 2018, ICLR.
[21] Ankur Bapna,et al. The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation , 2018, ACL.
[22] Angelika Steger,et al. Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning , 2019, ICML.
[23] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[24] Chong Wang,et al. Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.
[25] Erich Elsen,et al. Exploring Sparsity in Recurrent Neural Networks , 2017, ICLR.
[26] Nikko Ström,et al. Sparse connection and pruning in large dynamic artificial neural networks , 1997, EUROSPEECH.
[27] James M Murray,et al. Local online learning in recurrent networks with random feedback , 2018, bioRxiv.