Variance Reduced Stochastic Gradient Descent with Neighbors
暂无分享,去创建一个
Aurélien Lucchi | Thomas Hofmann | Brian McWilliams | Simon Lacoste-Julien | T. Hofmann | Aurélien Lucchi | S. Lacoste-Julien | Brian McWilliams | B. McWilliams | Simon Lacoste-Julien
[1] Mark W. Schmidt,et al. Minimizing finite sums with the stochastic average gradient , 2013, Mathematical Programming.
[2] Sanjoy Dasgupta,et al. Randomized Partition Trees for Nearest Neighbor Search , 2014, Algorithmica.
[3] Yoram Singer,et al. Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..
[4] H. Robbins. A Stochastic Approximation Method , 1951 .
[5] Léon Bottou,et al. Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.
[6] Francis Bach,et al. SAGA: A Fast Incremental Gradient Method With Support for Non-Strongly Convex Composite Objectives , 2014, NIPS.
[7] Mark W. Schmidt. Convergence rate of stochastic gradient with constant step size , 2014 .
[8] Alexandr Andoni,et al. Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).
[9] Tong Zhang,et al. Accelerating Stochastic Gradient Descent using Predictive Variance Reduction , 2013, NIPS.
[10] Andreas Krause,et al. Scalable Training of Mixture Models via Coresets , 2011, NIPS.
[11] Peter Richtárik,et al. Semi-Stochastic Gradient Descent Methods , 2013, Front. Appl. Math. Stat..
[12] Shai Shalev-Shwartz,et al. Stochastic dual coordinate ascent methods for regularized loss , 2012, J. Mach. Learn. Res..
[13] Alexandr Andoni,et al. Optimal Data-Dependent Hashing for Approximate Near Neighbors , 2015, STOC.