暂无分享,去创建一个
[1] Pierre Baldi,et al. Linear Learning: Landscapes and Algorithms , 1988, NIPS.
[2] Surya Ganguli,et al. Exact solutions to the nonlinear dynamics of learning in deep linear neural networks , 2013, ICLR.
[3] Ohad Shamir,et al. Distribution-Specific Hardness of Learning Neural Networks , 2016, J. Mach. Learn. Res..
[4] Kenji Kawaguchi,et al. Deep Learning without Poor Local Minima , 2016, NIPS.
[5] Yu Maruyama,et al. Global Continuous Optimization with Error Bound and Fast Convergence , 2016, J. Artif. Intell. Res..
[6] Tomaso Poggio,et al. Learning Functions: When Is Deep Better Than Shallow , 2016, 1603.00988.
[7] Razvan Pascanu,et al. Local minima in training of deep networks , 2017, ArXiv.
[8] Raman Arora,et al. Understanding Deep Neural Networks with Rectified Linear Units , 2016, Electron. Colloquium Comput. Complex..
[9] Tomaso A. Poggio,et al. Learning Real and Boolean Functions: When Is Deep Better Than Shallow , 2016, ArXiv.
[10] Yann LeCun,et al. The Loss Surfaces of Multilayer Networks , 2014, AISTATS.
[11] Ronald L. Rivest,et al. Training a 3-node neural network is NP-complete , 1988, COLT '88.
[12] Leslie Pack Kaelbling,et al. Bayesian Optimization with Exponential Convergence , 2015, NIPS.
[13] P. Wedin. Perturbation bounds in connection with singular value decomposition , 1972 .
[14] Roi Livni,et al. On the Computational Efficiency of Training Neural Networks , 2014, NIPS.
[15] Paul C. Kainen,et al. Functionally Equivalent Feedforward Neural Networks , 1994, Neural Computation.
[16] René Vidal,et al. Global Optimality in Tensor Factorization, Deep Learning, and Beyond , 2015, ArXiv.
[17] Tengyu Ma,et al. Matrix Completion has No Spurious Local Minimum , 2016, NIPS.
[18] Lorenzo Rosasco,et al. Why and when can deep-but not shallow-networks avoid the curse of dimensionality: A review , 2016, International Journal of Automation and Computing.
[19] Razvan Pascanu,et al. Local minima in training of neural networks , 2016, 1611.06310.
[20] Bolei Zhou,et al. Optimization as Estimation with Gaussian Processes in Bandit Settings , 2015, AISTATS.
[21] Joan Bruna,et al. Topology and Geometry of Half-Rectified Network Optimization , 2016, ICLR.
[22] Elad Hoffer,et al. Exponentially vanishing sub-optimal local minima in multilayer neural networks , 2017, ICLR.
[23] Surya Ganguli,et al. Identifying and attacking the saddle point problem in high-dimensional non-convex optimization , 2014, NIPS.