暂无分享,去创建一个
Yoshua Bengio | Hugo Larochelle | Amartya Sanyal | Bart van Merrienboer | Yoshua Bengio | H. Larochelle | Amartya Sanyal | B. V. Merrienboer
[1] J. Greenberg. A Quantitative Approach to the Morphological Typology of Language , 1960, International Journal of American Linguistics.
[2] Yoshua Bengio,et al. Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.
[3] Matthew V. Mahoney,et al. Text Compression as a Test for Artificial Intelligence , 1999, AAAI/IAAI.
[4] Matthew V. Mahoney,et al. Fast Text Compression with Neural Networks , 2000, FLAIRS Conference.
[5] Jitendra Malik,et al. Learning a classification model for segmentation , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.
[6] Sang Joon Kim,et al. A Mathematical Theory of Communication , 2006 .
[7] Jürgen Schmidhuber,et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.
[8] Mirella Lapata,et al. Vector-based Models of Semantic Composition , 2008, ACL.
[9] Naonori Ueda,et al. Bayesian Unsupervised Word Segmentation with Nested Pitman-Yor Language Modeling , 2009, ACL.
[10] Lukás Burget,et al. Recurrent neural network based language model , 2010, INTERSPEECH.
[11] Ilya Sutskever,et al. SUBWORD LANGUAGE MODELING WITH NEURAL NETWORKS , 2011 .
[12] Geoffrey E. Hinton,et al. Generating Text with Recurrent Neural Networks , 2011, ICML.
[13] Vysoké Učení,et al. Statistical Language Models Based on Neural Networks , 2012 .
[14] Klaus-Robert Müller,et al. Efficient BackProp , 2012, Neural Networks: Tricks of the Trade.
[15] Razvan Pascanu,et al. On the difficulty of training recurrent neural networks , 2012, ICML.
[16] Alex Graves,et al. Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.
[17] Tomas Mikolov,et al. Alternative structures for character-level RNNs , 2015, ArXiv.
[18] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[19] Ying Zhang,et al. On Multiplicative Integration with Recurrent Neural Networks , 2016, NIPS.
[20] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.
[21] Alexander M. Rush,et al. Character-Aware Neural Language Models , 2015, AAAI.
[22] Geoffrey E. Hinton,et al. Layer Normalization , 2016, ArXiv.
[23] Liang Lu,et al. Top-down Tree Long Short-Term Memory Networks , 2015, NAACL.
[24] Yu Zhang,et al. Latent Sequence Decompositions , 2016, ICLR.
[25] Yoshua Bengio,et al. Hierarchical Multiscale Recurrent Neural Networks , 2016, ICLR.
[26] Xiangang Li,et al. Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling , 2017, ICML.
[27] Tomas Mikolov,et al. Enriching Word Vectors with Subword Information , 2016, TACL.