暂无分享,去创建一个
[1] References , 1971 .
[2] J. Urgen Schmidhuber. Neural Sequence Chunkers , 1991 .
[3] Michael C. Mozer,et al. Induction of Multiscale Temporal Structure , 1991, NIPS.
[4] Jürgen Schmidhuber,et al. Learning Complex, Extended Sequences Using the Principle of History Compression , 1992, Neural Computation.
[5] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.
[6] C. Bishop. Mixture density networks , 1994 .
[7] Yoshua Bengio,et al. Hierarchical Recurrent Neural Networks for Long-Term Dependencies , 1995, NIPS.
[8] Peter Tiño,et al. Learning long-term dependencies in NARX recurrent neural networks , 1996, IEEE Trans. Neural Networks.
[9] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[10] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[11] Jon M. Kleinberg,et al. Bursty and Hierarchical Structure in Streams , 2002, Data Mining and Knowledge Discovery.
[12] Marcus Liwicki,et al. IAM-OnDB - an on-line English sentence database acquired from handwritten text on a whiteboard , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).
[13] Matthew V. Mahoney,et al. Adaptive weighing of context models for lossless data compression , 2005 .
[14] Jürgen Schmidhuber,et al. Sequence Labelling in Structured Domains with Hierarchical Recurrent Neural Networks , 2007, IJCAI.
[15] Jürgen Schmidhuber,et al. Unconstrained On-line Handwriting Recognition with Recurrent Neural Networks , 2007, NIPS.
[16] Yoshua. Bengio,et al. Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..
[17] Lukás Burget,et al. Recurrent neural network based language model , 2010, INTERSPEECH.
[18] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.
[19] Ilya Sutskever,et al. SUBWORD LANGUAGE MODELING WITH NEURAL NETWORKS , 2011 .
[20] Geoffrey E. Hinton,et al. Generating Text with Recurrent Neural Networks , 2011, ICML.
[21] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[22] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[23] Maneesh Sahani,et al. Regularization and nonlinearities for neural language models: when are they needed? , 2013, ArXiv.
[24] Yoshua Bengio,et al. Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation , 2013, ArXiv.
[25] Razvan Pascanu,et al. On the difficulty of training recurrent neural networks , 2012, ICML.
[26] Alex Graves,et al. Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.
[27] Karol Gregor,et al. Neural Variational Inference and Learning in Belief Networks , 2014, ICML.
[28] Yoshua Bengio,et al. How transferable are features in deep neural networks? , 2014, NIPS.
[29] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[30] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.
[31] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..
[32] Jürgen Schmidhuber,et al. A Clockwork RNN , 2014, ICML.
[33] Parul Parashar,et al. Neural Networks in Machine Learning , 2014 .
[34] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[35] Trevor Darrell,et al. One-Shot Adaptation of Supervised Deep Convolutional Models , 2013, ICLR.
[36] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.
[37] Yoshua Bengio,et al. A Recurrent Latent Variable Model for Sequential Data , 2015, NIPS.
[38] Jürgen Schmidhuber,et al. Deep learning in neural networks: An overview , 2014, Neural Networks.
[39] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[40] Yoshua Bengio,et al. BinaryConnect: Training Deep Neural Networks with binary weights during propagations , 2015, NIPS.
[41] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[42] Jakob Grue Simonsen,et al. A Hierarchical Recurrent Encoder-Decoder for Generative Context-Aware Query Suggestion , 2015, CIKM.
[43] Yoshua Bengio,et al. Gated Feedback Recurrent Neural Networks , 2015, ICML.
[44] Samy Bengio,et al. Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[45] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[46] Ying Zhang,et al. On Multiplicative Integration with Recurrent Neural Networks , 2016, NIPS.
[47] Quoc V. Le,et al. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[48] Alex Graves,et al. Strategic Attentive Writer for Learning Macro-Actions , 2016, NIPS.
[49] José A. R. Fonollosa,et al. Character-based Neural Machine Translation , 2016, ACL.
[50] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..
[51] Kilian Q. Weinberger,et al. Deep Networks with Stochastic Depth , 2016, ECCV.
[52] Alex Graves,et al. Grid Long Short-Term Memory , 2015, ICLR.
[53] John Salvatier,et al. Theano: A Python framework for fast computation of mathematical expressions , 2016, ArXiv.
[54] Noah A. Smith,et al. Segmental Recurrent Neural Networks , 2015, ICLR.
[55] Ran El-Yaniv,et al. Binarized Neural Networks , 2016, NIPS.
[56] Kamil M Rocki,et al. Recurrent Memory Array Structures , 2016, ArXiv.
[57] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[58] Yoshua Bengio,et al. End-to-end attention-based large vocabulary speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[59] Alexander M. Rush,et al. Character-Aware Neural Language Models , 2015, AAAI.
[60] Geoffrey E. Hinton,et al. Layer Normalization , 2016, ArXiv.
[61] Yoshua Bengio,et al. A Character-level Decoder without Explicit Segmentation for Neural Machine Translation , 2016, ACL.
[62] Daan Wierstra,et al. One-shot Learning with Memory-Augmented Neural Networks , 2016, ArXiv.
[63] Kamil Rocki,et al. Surprisal-Driven Feedback in Recurrent Networks , 2016, ArXiv.
[64] Yoshua Bengio,et al. Architectural Complexity Measures of Recurrent Neural Networks , 2016, NIPS.
[65] Roland Memisevic,et al. Regularizing RNNs by Stabilizing Activations , 2015, ICLR.
[66] Jürgen Schmidhuber,et al. Recurrent Highway Networks , 2016, ICML.
[67] Aaron C. Courville,et al. Recurrent Batch Normalization , 2016, ICLR.
[68] Yoshua Bengio,et al. Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations , 2016, ICLR.
[69] Omer Levy,et al. Published as a conference paper at ICLR 2018 S IMULATING A CTION D YNAMICS WITH N EURAL P ROCESS N ETWORKS , 2018 .