论文信息 - Parallel Stochastic Newton Method

Parallel Stochastic Newton Method

We propose a parallel stochastic Newton method (PSN) for minimizing unconstrained smooth convex functions. We analyze the method in the strongly convex case, and give conditions under which acceleration can be expected when compared to its serial counterpart. We show how PSN can be applied to the empirical risk minimization problem, and demonstrate the practical efficiency of the method through numerical experiments and models of simple matrix classes.

Peter Richtárik | Mojm'ir Mutn'y

[1] Jorge Nocedal,et al. On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[2] Fuzhen Zhang. Matrix Theory: Basic Results and Techniques , 1999 .

[3] Samuel Williams,et al. Roofline: an insightful visual performance model for multicore architectures , 2009, CACM.

[4] Alexander J. Smola,et al. Parallelized Stochastic Gradient Descent , 2010, NIPS.

[5] Stephen J. Wright,et al. Hogwild: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent , 2011, NIPS.

[6] Shai Shalev-Shwartz,et al. Stochastic dual coordinate ascent methods for regularized loss , 2012, J. Mach. Learn. Res..

[7] Lothar Reichel,et al. Tridiagonal Toeplitz matrices: properties and novel applications , 2013, Numer. Linear Algebra Appl..

[8] Peter Richtárik,et al. Iteration complexity of randomized block-coordinate descent methods for minimizing a composite function , 2011, Mathematical Programming.

[9] Peter Richtárik,et al. Randomized Dual Coordinate Ascent with Arbitrary Sampling , 2014, ArXiv.

[10] Kimon Fountoulakis,et al. A Flexible Coordinate Descent Method for Big Data Applications , 2015 .

[11] Peter Richtárik,et al. Distributed Block Coordinate Descent for Minimizing Partially Separable Functions , 2014, 1406.0238.