Theory IIIb: Generalization in Deep Networks
暂无分享,去创建一个
T. Poggio | X. Boix | B. Miranda | Q. Liao | J. Hidary | Andrzej Banburski
[1] Tomaso A. Poggio,et al. Fisher-Rao Metric, Geometry, and Complexity of Neural Networks , 2017, AISTATS.
[2] Nathan Srebro,et al. Implicit Bias of Gradient Descent on Linear Convolutional Networks , 2018, NeurIPS.
[3] Mikhail Belkin,et al. To understand deep learning we need to understand kernel learning , 2018, ICML.
[4] Tomaso A. Poggio,et al. Theory of Deep Learning IIb: Optimization Properties of SGD , 2018, ArXiv.
[5] Lorenzo Rosasco,et al. Theory of Deep Learning III: explaining the non-overfitting puzzle , 2017, ArXiv.
[6] Nathan Srebro,et al. The Implicit Bias of Gradient Descent on Separable Data , 2017, J. Mach. Learn. Res..
[7] Guillermo Sapiro,et al. Robust Large Margin Deep Neural Networks , 2017, IEEE Transactions on Signal Processing.
[8] Nathan Srebro,et al. Exploring Generalization in Deep Learning , 2017, NIPS.
[9] Matus Telgarsky,et al. Spectrally-normalized margin bounds for neural networks , 2017, NIPS.
[10] Noah Golowich,et al. Musings on Deep Learning: Properties of SGD , 2017 .
[11] Tomaso A. Poggio,et al. Theory II: Landscape of the Empirical Risk in Deep Learning , 2017, ArXiv.
[12] Samy Bengio,et al. Understanding deep learning requires rethinking generalization , 2016, ICLR.
[13] Lorenzo Rosasco,et al. Why and when can deep-but not shallow-networks avoid the curse of dimensionality: A review , 2016, International Journal of Automation and Computing.
[14] T. Poggio,et al. Deep vs. shallow networks : An approximation theory perspective , 2016, ArXiv.
[15] Tim Salimans,et al. Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks , 2016, NIPS.
[16] Yoram Singer,et al. Train faster, generalize better: Stability of stochastic gradient descent , 2015, ICML.
[17] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .
[18] Y. Yao,et al. On Early Stopping in Gradient Descent Learning , 2007 .
[19] 김희라. Waiting for Godot에 나타난 희망의 구조 , 2003 .
[20] B. Aulbach,et al. The Hartman-Grobman theorem for Carathéodory-type differential equations in Banach spaces , 2000 .
[21] Peter L. Bartlett,et al. Neural Network Learning - Theoretical Foundations , 1999 .
[22] Aleksej F. Filippov,et al. Differential Equations with Discontinuous Righthand Sides , 1988, Mathematics and Its Applications.
[23] G. P. Szegö,et al. Stability theory of dynamical systems , 1970 .