A proof that Anderson acceleration increases the convergence rate in linearly converging fixed point methods (but not in quadratically converging ones)

This paper provides the first proof that Anderson acceleration (AA) increases the convergence rate of general fixed point iterations. AA has been used for decades to speed up nonlinear solvers in many applications, however a rigorous mathematical justification of the improved convergence rate remained lacking. The key ideas of the proof are relating errors with residuals, using results arising from the optimization, and explicitly defining the gain in the optimization stage to be the ratio of improvement over a step of the unaccelerated fixed point iteration. The main result we prove is that AA improves a the convergence rate of a fixed point iteration to first order by a factor of the gain at each step. In addition to improving the convergence rate, our results also show that AA increases the radius of convergence (even beyond a set where the fixed point operator is contractive). Lastly, our estimate shows that while the linear convergence rate is improved, additional quadratic terms arise in the estimate, which shows why AA does not typically improve convergence in quadratically converging fixed point iterations. Results of several numerical tests are given which illustrate the theory.

[1]  C. T. Kelley,et al.  Convergence Analysis for Anderson Acceleration , 2015, SIAM J. Numer. Anal..

[2]  Claude Brezinski,et al.  Convergence acceleration during the 20th century , 2000 .

[3]  Matthieu Geist,et al.  Anderson Acceleration for Reinforcement Learning , 2018, EWRL 2018.

[4]  Nicola Parolini,et al.  Numerical investigation on the stability of singular driven cavity flow , 2002 .

[5]  Donald G. M. Anderson Iterative Procedures for Nonlinear Integral Equations , 1965, JACM.

[6]  Carl Tim Kelley,et al.  Numerical methods for nonlinear equations , 2018, Acta Numerica.

[7]  William Layton,et al.  Introduction to the Numerical Analysis of Incompressible Viscous Flows , 2008 .

[8]  Carol S. Woodward,et al.  Considerations on the implementation and use of Anderson acceleration on distributed memory and GPU-based parallel computers , 2016 .

[9]  Bailin Deng,et al.  Anderson acceleration for geometry optimization and physics simulation , 2018, ACM Trans. Graph..

[10]  Homer F. Walker,et al.  Anderson Acceleration for Fixed-Point Iterations , 2011, SIAM J. Numer. Anal..

[11]  Leo G. Rebholz,et al.  Anderson-Accelerated Convergence of Picard Iterations for Incompressible Navier-Stokes Equations , 2018, SIAM J. Numer. Anal..

[12]  M. Matsen,et al.  Efficiency of pseudo-spectral algorithms with Anderson mixing for the SCFT of periodic block-copolymer phases , 2011, The European physical journal. E, Soft matter.

[13]  Yousef Saad,et al.  Two classes of multisecant methods for nonlinear acceleration , 2009, Numer. Linear Algebra Appl..

[14]  Roger P. Pawlowski,et al.  Analysis of Anderson Acceleration on a Simplified Neutronics/Thermal Hydraulics System , 2015 .

[15]  Nicholas J. Higham,et al.  Anderson acceleration of the alternating projections method for computing the nearest correlation matrix , 2016, Numerical Algorithms.

[16]  Homer F. Walker,et al.  An accelerated Picard method for nonlinear systems related to variably saturated flow , 2012 .

[17]  Homer F. Walker,et al.  Anderson acceleration and application to the three-temperature energy equations , 2017, J. Comput. Phys..