Training Very Deep Networks
暂无分享,去创建一个
Jürgen Schmidhuber | Klaus Greff | Rupesh Kumar Srivastava | J. Schmidhuber | R. Srivastava | Klaus Greff
[1] J. Håstad. Computational limitations of small-depth circuits , 1987 .
[2] Sepp Hochreiter,et al. Untersuchungen zu dynamischen neuronalen Netzen , 1991 .
[3] Jürgen Schmidhuber,et al. Learning Complex, Extended Sequences Using the Principle of History Compression , 1992, Neural Computation.
[4] Corso Elvezia. Bridging Long Time Lags by Weight Guessing and \long Short Term Memory" , 1996 .
[5] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[6] Jürgen Schmidhuber,et al. Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.
[7] Johan Håstad,et al. On the power of small-depth threshold circuits , 1991, computational complexity.
[8] Yee Whye Teh,et al. A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.
[9] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.
[10] Luca Maria Gambardella,et al. Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence Flexible, High Performance Convolutional Neural Networks for Image Classification , 2022 .
[11] Jürgen Schmidhuber,et al. Multi-column deep neural networks for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[12] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[13] Tapani Raiko,et al. Deep Learning Made Easier by Linear Transformations in Perceptrons , 2012, AISTATS.
[14] Ilya Sutskever,et al. Training Deep and Recurrent Networks with Hessian-Free Optimization , 2012, Neural Networks: Tricks of the Trade.
[15] Jinyu Li,et al. Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks. , 2013, ICLR 2013.
[16] Alex Graves,et al. Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.
[17] Geoffrey E. Hinton,et al. On the importance of initialization and momentum in deep learning , 2013, ICML.
[18] Yoshua Bengio,et al. Maxout Networks , 2013, ICML.
[19] Jürgen Schmidhuber,et al. Compete to Compute , 2013, NIPS.
[20] Franco Scarselli,et al. On the Complexity of Neural Network Classifiers: A Comparison Between Shallow and Deep Architectures , 2014, IEEE Transactions on Neural Networks and Learning Systems.
[21] James Martens,et al. On the Expressive Efficiency of Sum Product Networks , 2014, ArXiv.
[22] Qiang Chen,et al. Network In Network , 2013, ICLR.
[23] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.
[24] Jürgen Schmidhuber,et al. Deep Networks with Internal Selective Attention through Feedback Connections , 2014, NIPS.
[25] Surya Ganguli,et al. Identifying and attacking the saddle point problem in high-dimensional non-convex optimization , 2014, NIPS.
[26] Surya Ganguli,et al. Exact solutions to the nonlinear dynamics of learning in deep linear neural networks , 2013, ICLR.
[27] Razvan Pascanu,et al. On the Number of Linear Regions of Deep Neural Networks , 2014, NIPS.
[28] Benjamin Graham,et al. Spatially-sparse convolutional neural networks , 2014, ArXiv.
[29] L. F. Abbott,et al. Random Walk Initialization for Training Very Deep Feedforward Networks , 2014, 1412.6558.
[30] Thomas Brox,et al. Striving for Simplicity: The All Convolutional Net , 2014, ICLR.
[31] Jürgen Schmidhuber,et al. Binding via Reconstruction Clustering , 2015, ArXiv.
[32] Jürgen Schmidhuber,et al. Understanding Locally Competitive Networks , 2014, ICLR.
[33] Yoshua Bengio,et al. FitNets: Hints for Thin Deep Nets , 2014, ICLR.
[34] Jian Sun,et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[35] Jürgen Schmidhuber,et al. Highway Networks , 2015, ArXiv.
[36] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[37] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[38] Zhuowen Tu,et al. Deeply-Supervised Nets , 2014, AISTATS.
[39] Alex Graves,et al. Grid Long Short-Term Memory , 2015, ICLR.