Gated Orthogonal Recurrent Units: On Learning to Forget
暂无分享,去创建一个
Yoshua Bengio | Max Tegmark | Çaglar Gülçehre | Li Jing | Marin Soljacic | Yichen Shen | John Peurifoy | Yoshua Bengio | Çaglar Gülçehre | Li Jing | Yichen Shen | J. Peurifoy | Max Tegmark | M. Soljačić
[1] Gunnar Rätsch,et al. Learning Unitary Operators with Help From u(n) , 2016, AAAI.
[2] Les E. Atlas,et al. Full-Capacity Unitary Recurrent Neural Networks , 2016, NIPS.
[3] Jason Weston,et al. Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks , 2015, ICLR.
[4] Yoshua Bengio,et al. Attention-Based Models for Speech Recognition , 2015, NIPS.
[5] Yoshua Bengio,et al. Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.
[6] Razvan Pascanu,et al. On the difficulty of training recurrent neural networks , 2012, ICML.
[7] Fei-Fei Li,et al. Visualizing and Understanding Recurrent Networks , 2015, ArXiv.
[8] Jascha Sohl-Dickstein,et al. Intelligible Language Modeling with Input Switched Affine Networks , 2016, ArXiv.
[9] Geoffrey E. Hinton,et al. A Simple Way to Initialize Recurrent Networks of Rectified Linear Units , 2015, ArXiv.
[10] Aaron C. Courville,et al. Recurrent Batch Normalization , 2016, ICLR.
[11] F. Gers,et al. Long short-term memory in recurrent neural networks , 2001 .
[12] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[13] Quoc V. Le,et al. Listen, Attend and Spell , 2015, ArXiv.
[14] Yoshua Bengio,et al. On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.
[15] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.
[16] Yoshua Bengio,et al. Unitary Evolution Recurrent Neural Networks , 2015, ICML.
[17] James Bailey,et al. Efficient Orthogonal Parametrisation of Recurrent Neural Networks Using Householder Reflections , 2016, ICML.
[18] Jonathan G. Fiscus,et al. DARPA TIMIT:: acoustic-phonetic continuous speech corpus CD-ROM, NIST speech disc 1-1.1 , 1993 .
[19] Jürgen Schmidhuber,et al. Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.
[20] Yann LeCun,et al. Tunable Efficient Unitary Neural Networks (EUNN) and their application to RNNs , 2016, ICML.
[21] Sepp Hochreiter,et al. Untersuchungen zu dynamischen neuronalen Netzen , 1991 .
[22] Michael I. Jordan. Serial Order: A Parallel Distributed Processing Approach , 1997 .
[23] Yann LeCun,et al. Recurrent Orthogonal Networks and Long-Memory Tasks , 2016, ICML.
[24] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[25] Jascha Sohl-Dickstein,et al. Input Switched Affine Networks: An RNN Architecture Designed for Interpretability , 2016, ICML.
[26] Alex Graves,et al. Neural Turing Machines , 2014, ArXiv.
[27] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[28] Alex Graves,et al. Associative Long Short-Term Memory , 2016, ICML.
[29] Richard S. Zemel,et al. Gated Graph Sequence Neural Networks , 2015, ICLR.
[30] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.