Second order quantitative bounds for unadjusted generalized Hamiltonian Monte Carlo

This paper provides a convergence analysis for generalized Hamiltonian Monte Carlo samplers, a family of Markov Chain Monte Carlo methods based on leapfrog integration of Hamiltonian dynamics and kinetic Langevin diffusion, that encompasses the unadjusted Hamiltonian Monte Carlo method. Assuming that the target distribution $\pi$ satisfies a log-Sobolev inequality and mild conditions on the corresponding potential function, we establish quantitative bounds on the relative entropy of the iterates defined by the algorithm, with respect to $\pi$. Our approach is based on a perturbative and discrete version of the modified entropy method developed to establish hypocoercivity for the continuous-time kinetic Langevin process. As a corollary of our main result, we are able to derive complexity bounds for the class of algorithms at hand. In particular, we show that the total number of iterations to achieve a target accuracy $\varepsilon>0$ is of order $d/\varepsilon^{1/4}$, where $d$ is the dimension of the problem. This result can be further improved in the case of weakly interacting mean field potentials, for which we find a total number of iterations of order $(d/\varepsilon)^{1/4}$.

[1]  B. Leimkuhler,et al.  Contraction and Convergence Rates for Discretized Kinetic Langevin Dynamics , 2023, ArXiv.

[2]  Murat A. Erdogdu,et al.  Improved Discretization Analysis for Underdamped Langevin Monte Carlo , 2023, COLT.

[3]  Pierre Monmarch'e An entropic approach for Hamiltonian Monte Carlo: the idealized case , 2022, 2209.13405.

[4]  A. Guillin,et al.  Uniform Poincaré and logarithmic Sobolev inequalities for mean field particle systems , 2022, The Annals of Applied Probability.

[5]  Pierre Le Bris,et al.  HMC and underdamped Langevin united in the unadjusted convex smooth case , 2022, 2202.00977.

[6]  Oren Mangoubi,et al.  Mixing of Hamiltonian Monte Carlo on strongly log-concave distributions: Continuous dynamics , 2021, The Annals of Applied Probability.

[7]  Alain Durmus,et al.  Asymptotic bias of inexact Markov Chain Monte Carlo methods in high dimension , 2021, The Annals of Applied Probability.

[8]  É. Moulines,et al.  Uniform minorization condition and convergence bounds for discretizations of kinetic Langevin dynamics , 2021, ArXiv.

[9]  A. Diez,et al.  Propagation of chaos: A review of models, methods and applications. Ⅱ. Applications , 2021, Kinetic and Related Models.

[10]  A. Eberle,et al.  Mixing Time Guarantees for Unadjusted Hamiltonian Monte Carlo , 2021, Bernoulli.

[11]  N. Bou-Rabee,et al.  Convergence of unadjusted Hamiltonian Monte Carlo for mean-field models , 2020, Electronic Journal of Probability.

[12]  Pierre Monmarch'e High-dimensional MCMC with a standard splitting scheme for the underdamped Langevin diffusion. , 2020, Electronic Journal of Statistics.

[13]  Yu Cao,et al.  On explicit $L^2$-convergence rate estimate for underdamped Langevin dynamics , 2019, 1908.04746.

[14]  Martin J. Wainwright,et al.  Fast mixing of Metropolized Hamiltonian Monte Carlo: Benefits of multi-step gradients , 2019, J. Mach. Learn. Res..

[15]  Santosh S. Vempala,et al.  Optimal Convergence Rate of Hamiltonian Monte Carlo for Strongly Logconcave Distributions , 2019, APPROX-RANDOM.

[16]  Santosh S. Vempala,et al.  Rapid Convergence of the Unadjusted Langevin Algorithm: Isoperimetry Suffices , 2019, NeurIPS.

[17]  Michael I. Jordan,et al.  Is There an Analog of Nesterov Acceleration for MCMC? , 2019, ArXiv.

[18]  Michael I. Jordan,et al.  Sampling can be faster than optimization , 2018, Proceedings of the National Academy of Sciences.

[19]  Christophe Andrieu,et al.  Hypocoercivity of piecewise deterministic Markov process-Monte Carlo , 2018, The Annals of Applied Probability.

[20]  A. Doucet,et al.  Randomized Hamiltonian Monte Carlo as scaling limit of the bouncy particle sampler and dimension-free convergence rates , 2018, The Annals of Applied Probability.

[21]  Arnak S. Dalalyan,et al.  On sampling from a log-concave density using kinetic Langevin diffusions , 2018, Bernoulli.

[22]  A. Eberle,et al.  Coupling and convergence for Hamiltonian Monte Carlo , 2018, The Annals of Applied Probability.

[23]  Michael I. Jordan,et al.  Underdamped Langevin MCMC: A non-asymptotic analysis , 2017, COLT.

[24]  A. Eberle,et al.  Couplings and quantitative contraction rates for Langevin dynamics , 2017, The Annals of Probability.

[25]  Patrick L. Combettes,et al.  Perspective Functions: Properties, Constructions, and Examples , 2016, Set-Valued and Variational Analysis.

[26]  Gabriel Stoltz,et al.  Partial differential equations and stochastic methods in molecular dynamics* , 2016, Acta Numerica.

[27]  É. Moulines,et al.  Non-asymptotic convergence analysis for the Unadjusted Langevin Algorithm , 2015, 1507.05021.

[28]  Assyr Abdulle,et al.  Long Time Accuracy of Lie-Trotter Splitting Methods for Langevin Dynamics , 2015, SIAM J. Numer. Anal..

[29]  A. Dalalyan Theoretical guarantees for approximate sampling from smooth and log‐concave densities , 2014, 1412.7392.

[30]  Susan P. Holmes,et al.  Positive Curvature and Hamiltonian Monte Carlo , 2014, NIPS.

[31]  Yurii Nesterov,et al.  Introductory Lectures on Convex Optimization - A Basic Course , 2014, Applied Optimization.

[32]  M. Ledoux,et al.  Analysis and Geometry of Markov Diffusion Operators , 2013 .

[33]  B. Leimkuhler,et al.  Rational Construction of Stochastic Numerical Methods for Molecular Sampling , 2012, 1203.5428.

[34]  A. Eberle Reflection coupling and Wasserstein contractivity without convexity , 2011 .

[35]  Yee Whye Teh,et al.  Bayesian Learning via Stochastic Gradient Langevin Dynamics , 2011, ICML.

[36]  Radford M. Neal MCMC Using Hamiltonian Dynamics , 2011, 1206.1901.

[37]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[38]  M. Parrinello,et al.  Accurate sampling using Langevin dynamics. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[39]  F. Hérau Short and long time behavior of the Fokker-Planck equation in a confining potential and applications , 2005, math/0501363.

[40]  Jesús A. Izaguirre,et al.  An impulse integrator for Langevin dynamics , 2002 .

[41]  Jonathan C. Mattingly,et al.  Ergodicity for SDEs and approximations: locally Lipschitz vector fields and degenerate noise , 2002 .

[42]  R. Tweedie,et al.  Exponential convergence of Langevin distributions and their discrete approximations , 1996 .

[43]  Michael I. Miller,et al.  REPRESENTATIONS OF KNOWLEDGE IN COMPLEX SYSTEMS , 1994 .

[44]  D. Ermak,et al.  Numerical integration of the Langevin equation: Monte Carlo simulation , 1980 .

[45]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[46]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[47]  Nando de Freitas,et al.  An Introduction to MCMC for Machine Learning , 2004, Machine Learning.

[48]  D. Talay Stochastic Hamiltonian Systems : Exponential Convergence to the Invariant Measure , and Discretization by the Implicit Euler Scheme , 2002 .

[49]  J. D. Doll,et al.  Brownian dynamics as smart Monte Carlo simulation , 1978 .

[50]  S. Chandrasekhar Stochastic problems in Physics and Astronomy , 1943 .