暂无分享,去创建一个
Surya Ganguli | Razvan Pascanu | Yoshua Bengio | Yann Dauphin | Yoshua Bengio | Razvan Pascanu | Yann Dauphin | S. Ganguli | Y. Dauphin
[1] E. Wigner. On the Distribution of the Roots of Certain Symmetric Matrices , 1958 .
[2] Kurt Hornik,et al. Neural networks and principal component analysis: Learning from examples without local minima , 1989, Neural Networks.
[3] Saad,et al. On-line learning in soft committee machines. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.
[4] Magnus Rattray,et al. Natural gradient descent for on-line learning , 1998 .
[5] D K Smith,et al. Numerical Optimization , 2001, J. Oper. Res. Soc..
[6] Hyeyoung Park,et al. On-Line Learning Theory of Soft Committee Machines with Correlated Hidden Units : Steepest Gradient Descent and Natural Gradient Descent , 2002, cond-mat/0212006.
[7] Christopher K. I. Williams,et al. Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning) , 2005 .
[8] Yan V Fyodorov,et al. Replica Symmetry Breaking Condition Exposed by Random Matrix Calculation of Landscape Complexity , 2007, cond-mat/0702601.
[9] A. Bray,et al. Statistics of critical points of Gaussian fields on large-dimensional spaces. , 2006, Physical review letters.
[10] Nicolas Le Roux,et al. Topmoumoute Online Natural Gradient Algorithm , 2007, NIPS.
[11] Eiji Mizutani,et al. An analysis on negative curvature induced by singularity in multi-layer neural-network learning , 2010, NIPS.
[12] James Martens,et al. Deep learning via Hessian-free optimization , 2010, ICML.
[13] J. Callahan. Advanced Calculus: A Geometric View , 2010 .
[14] W. Murray. Newton‐Type Methods , 2011 .
[15] Yoshua Bengio,et al. Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..
[16] Razvan Pascanu,et al. Theano: new features and speed improvements , 2012, ArXiv.
[17] James L. McClelland,et al. Learning hierarchical category structure in deep neural networks , 2013 .
[18] Surya Ganguli,et al. Exact solutions to the nonlinear dynamics of learning in deep linear neural networks , 2013, ICLR.
[19] Razvan Pascanu,et al. Revisiting Natural Gradient for Deep Networks , 2013, ICLR.