Approximating Real-Time Recurrent Learning with Random Kronecker Factors
暂无分享,去创建一个
[1] Chris Dyer,et al. On the State of the Art of Evaluation in Neural Language Models , 2017, ICLR.
[2] Guillaume Charpiat,et al. Training recurrent networks online without backtracking , 2015, ArXiv.
[3] Alex Graves,et al. Decoupled Neural Interfaces using Synthetic Gradients , 2016, ICML.
[4] Ilya Sutskever,et al. SUBWORD LANGUAGE MODELING WITH NEURAL NETWORKS , 2011 .
[5] Yann Ollivier,et al. Unbiasing Truncated Backpropagation Through Time , 2017, ArXiv.
[6] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.
[7] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.
[8] Yoshua Bengio,et al. Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.
[9] Yann Ollivier,et al. Unbiased Online Recurrent Optimization , 2017, ICLR.
[10] Jürgen Schmidhuber,et al. Recurrent Highway Networks , 2016, ICML.
[11] Thierry Catfolis,et al. A method for improving the real-time recurrent learning algorithm , 1993, Neural Networks.
[12] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.
[13] Herbert Jaeger,et al. Reservoir computing approaches to recurrent neural network training , 2009, Comput. Sci. Rev..
[14] Richard Socher,et al. An Analysis of Neural Language Modeling at Multiple Scales , 2018, ArXiv.
[15] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.
[16] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.
[17] Herbert Jaeger,et al. The''echo state''approach to analysing and training recurrent neural networks , 2001 .
[18] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.
[19] Jing Peng,et al. An Efficient Gradient-Based Algorithm for On-Line Training of Recurrent Network Trajectories , 1990, Neural Computation.
[20] Henry Markram,et al. Real-Time Computing Without Stable States: A New Framework for Neural Computation Based on Perturbations , 2002, Neural Computation.
[21] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[22] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.