Generalization Properties and Implicit Regularization for Multiple Passes SGM
暂无分享,去创建一个
[1] Dimitri P. Bertsekas,et al. Nonlinear Programming , 1997 .
[2] Peter L. Bartlett,et al. Rademacher and Gaussian Complexities: Risk Bounds and Structural Results , 2003, J. Mach. Learn. Res..
[3] Ron Meir,et al. Generalization Error Bounds for Bayesian Mixture Algorithms , 2003, J. Mach. Learn. Res..
[4] P. Bartlett,et al. Local Rademacher complexities , 2005, math/0508275.
[5] Stephen P. Boyd,et al. Stochastic Subgradient Methods , 2007 .
[6] Léon Bottou,et al. The Tradeoffs of Large Scale Learning , 2007, NIPS.
[7] Stephen P. Boyd,et al. Subgradient Methods , 2007 .
[8] Felipe Cucker,et al. Learning Theory: An Approximation Theory Viewpoint (Cambridge Monographs on Applied & Computational Mathematics) , 2007 .
[9] Andreas Christmann,et al. Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.
[10] Massimiliano Pontil,et al. Online Gradient Descent Learning Algorithms , 2008, Found. Comput. Math..
[11] Alexander Shapiro,et al. Stochastic Approximation approach to Stochastic Programming , 2013 .
[12] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.
[13] Ohad Shamir,et al. Stochastic Gradient Descent for Non-smooth Optimization: Convergence Results and Optimal Averaging Schemes , 2012, ICML.
[14] Francesco Orabona,et al. Simultaneous Model Selection and Optimization through Parameter-free Stochastic Learning , 2014, NIPS.
[15] Yuan Yao,et al. Online Learning as Stochastic Approximation of Regularization Paths: Optimality and Almost-Sure Convergence , 2011, IEEE Transactions on Information Theory.
[16] F. Bach,et al. Non-parametric Stochastic Approximation with Large Step sizes , 2014, 1408.0361.
[17] Lorenzo Rosasco,et al. Learning with Incremental Iterative Regularization , 2014, NIPS.
[18] Dimitri P. Bertsekas,et al. Incremental Gradient, Subgradient, and Proximal Methods for Convex Optimization: A Survey , 2015, ArXiv.
[19] Yoram Singer,et al. Train faster, generalize better: Stability of stochastic gradient descent , 2015, ICML.
[20] Lorenzo Rosasco,et al. Iterative Regularization for Learning with Convex Loss Functions , 2015, J. Mach. Learn. Res..
[21] Mark W. Schmidt,et al. Minimizing finite sums with the stochastic average gradient , 2013, Mathematical Programming.