暂无分享,去创建一个
[1] D. Widder. The heat equation , 1975 .
[2] D. Ackley. A connectionist machine for genetic hillclimbing , 1987 .
[3] L. Vese. A method to convexify functions via curve evolution , 1999 .
[4] Yoshua Bengio,et al. Greedy Layer-Wise Training of Deep Networks , 2006, NIPS.
[5] Jason Weston,et al. Curriculum learning , 2009, ICML '09.
[6] Ilya Sutskever,et al. Learning Recurrent Neural Networks with Hessian-Free Optimization , 2011, ICML.
[7] Nitish Srivastava,et al. Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.
[8] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.
[9] Martin J. Wainwright,et al. Randomized Smoothing for Stochastic Optimization , 2011, SIAM J. Optim..
[10] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[11] Hossein Mobahi,et al. Optimization by Gaussian smoothing with application to geometric alignment , 2012 .
[12] Geoffrey E. Hinton,et al. On the importance of initialization and momentum in deep learning , 2013, ICML.
[13] Philip Bachman,et al. Learning with Pseudo-Ensembles , 2014, NIPS.
[14] Hossein Mobahi,et al. On the Link between Gaussian Homotopy Continuation and Convex Envelopes , 2015, EMMCVPR.
[15] Surya Ganguli,et al. On the saddle point problem for non-convex optimization , 2014, ArXiv.
[16] Surya Ganguli,et al. Identifying and attacking the saddle point problem in high-dimensional non-convex optimization , 2014, NIPS.
[17] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.
[18] Yoshua Bengio,et al. Marginalized Denoising Auto-encoders for Nonlinear Representations , 2014, ICML.
[19] Hui Jiang,et al. Annealed Gradient Descent for Deep Learning , 2015, UAI.
[20] Surya Ganguli,et al. Deep Unsupervised Learning using Nonequilibrium Thermodynamics , 2015, ICML.
[21] Quoc V. Le,et al. Adding Gradient Noise Improves Learning for Very Deep Networks , 2015, ArXiv.
[22] Anima Anandkumar,et al. Generalization Bounds for Neural Networks through Tensor Factorization , 2015, ArXiv.
[23] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[24] Yuchen Zhang,et al. L1-regularized Neural Networks are Improperly Learnable in Polynomial Time , 2015, ICML.
[25] Yoram Singer,et al. Train faster, generalize better: Stability of stochastic gradient descent , 2015, ICML.
[26] Ohad Shamir,et al. On the Quality of the Initial Basin in Overspecified Neural Networks , 2015, ICML.
[27] H. Mobahi. Closed Form for Some Gaussian Convolutions , 2016, 1602.05610.
[28] Shai Shalev-Shwartz,et al. On Graduated Optimization for Stochastic Non-Convex Problems , 2015, ICML.