FLAG n' FLARE: Fast Linearly-Coupled Adaptive Gradient Methods
暂无分享,去创建一个
Peter L. Bartlett | Michael W. Mahoney | Fred Roosta | Xiang Cheng | Stefan Palombo | P. Bartlett | Xiang Cheng | Fred Roosta | Stefan Palombo
[1] Dimitri P. Bertsekas,et al. Convex Optimization Algorithms , 2015 .
[2] Marc'Aurelio Ranzato,et al. Large Scale Distributed Deep Networks , 2012, NIPS.
[3] R. Tibshirani,et al. Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.
[4] Elad Hazan,et al. Introduction to Online Convex Optimization , 2016, Found. Trends Optim..
[5] Yurii Nesterov,et al. Smooth minimization of non-smooth functions , 2005, Math. Program..
[6] Davide Anguita,et al. A Public Domain Dataset for Human Activity Recognition using Smartphones , 2013, ESANN.
[7] Y. Nesterov. A method for solving the convex programming problem with convergence rate O(1/k^2) , 1983 .
[8] John Darzentas,et al. Problem Complexity and Method Efficiency in Optimization , 1983 .
[9] José M. Bioucas-Dias,et al. A New TwIST: Two-Step Iterative Shrinkage/Thresholding Algorithms for Image Restoration , 2007, IEEE Transactions on Image Processing.
[10] Y. Nesterov. Gradient methods for minimizing composite objective function , 2007 .
[11] Yurii Nesterov,et al. Rounding of convex sets and efficient gradient methods for linear programming problems , 2004, Optim. Methods Softw..
[12] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.
[13] Yonina C. Eldar,et al. Convex Optimization in Signal Processing and Communications , 2009 .
[14] Yoshua Bengio,et al. Equilibrated adaptive learning rates for non-convex optimization , 2015, NIPS.
[15] Stephen P. Boyd,et al. A Differential Equation for Modeling Nesterov's Accelerated Gradient Method: Theory and Insights , 2014, J. Mach. Learn. Res..
[16] Uri M. Ascher,et al. Stochastic Algorithms for Inverse Problems Involving PDEs and many Measurements , 2014, SIAM J. Sci. Comput..
[17] Yurii Nesterov,et al. Lectures on Convex Optimization , 2018 .
[18] Trevor Hastie,et al. Statistical Learning with Sparsity: The Lasso and Generalizations , 2015 .
[19] Michael Elad,et al. Coordinate and subspace optimization methods for linear least squares with non-quadratic regularization , 2007 .
[20] Uri M. Ascher,et al. Assessing stochastic algorithms for large scale nonlinear least squares problems using extremal probabilities of linear combinations of gamma random variables , 2014, SIAM/ASA J. Uncertain. Quantification.
[21] Matthew D. Zeiler. ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.
[22] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[23] Zeyuan Allen Zhu,et al. Linear Coupling: An Ultimate Unification of Gradient and Mirror Descent , 2014, ITCS.
[24] Marc Teboulle,et al. A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..
[25] Andre Wibisono,et al. A variational perspective on accelerated methods in optimization , 2016, Proceedings of the National Academy of Sciences.
[26] K. Bredies,et al. Linear Convergence of Iterative Soft-Thresholding , 2007, 0709.1598.
[27] Adil M. Bagirov,et al. Introduction to Nonsmooth Optimization: Theory, Practice and Software , 2014 .
[28] Jorge Nocedal,et al. Optimization Methods for Large-Scale Machine Learning , 2016, SIAM Rev..
[29] Steve R. Gunn,et al. Result Analysis of the NIPS 2003 Feature Selection Challenge , 2004, NIPS.
[30] Dinesh Kumar,et al. Comment Volume Prediction Using Neural Networks and Decision Trees , 2015 .
[31] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .
[32] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..
[33] Peter Richtárik,et al. Smooth minimization of nonsmooth functions with parallel coordinate descent methods , 2013, Modeling and Optimization: Theory and Applications.
[34] Brian Kulis,et al. Metric Learning: A Survey , 2013, Found. Trends Mach. Learn..
[35] Yurii Nesterov,et al. Interior-point polynomial algorithms in convex programming , 1994, Siam studies in applied mathematics.
[36] Jean-Yves Audibert. Optimization for Machine Learning , 1995 .
[37] Benjamin Recht,et al. Analysis and Design of Optimization Algorithms via Integral Quadratic Constraints , 2014, SIAM J. Optim..
[38] Ken Lang,et al. NewsWeeder: Learning to Filter Netnews , 1995, ICML.
[39] Krisztian Buza,et al. Feedback Prediction for Blogs , 2012, GfKl.
[40] I. Daubechies,et al. An iterative thresholding algorithm for linear inverse problems with a sparsity constraint , 2003, math/0307152.
[41] Patrick L. Combettes,et al. Signal Recovery by Proximal Forward-Backward Splitting , 2005, Multiscale Model. Simul..
[42] Sébastien Bubeck,et al. Convex Optimization: Algorithms and Complexity , 2014, Found. Trends Mach. Learn..
[43] Francis R. Bach,et al. From Averaging to Acceleration, There is Only a Step-size , 2015, COLT.
[44] Alexandre M. Bayen,et al. Accelerated Mirror Descent in Continuous and Discrete Time , 2015, NIPS.
[45] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.
[46] Mohit Singh,et al. A geometric alternative to Nesterov's accelerated gradient descent , 2015, ArXiv.