Quantitative contraction rates for Markov chains on general state spaces

We investigate the problem of quantifying contraction coefficients of Markov transition kernels in Kantorovich ($L^1$ Wasserstein) distances. For diffusion processes, relatively precise quantitative bounds on contraction rates have recently been derived by combining appropriate couplings with carefully designed Kantorovich distances. In this paper, we partially carry over this approach from diffusions to Markov chains. We derive quantitative lower bounds on contraction rates for Markov chains on general state spaces that are powerful if the dynamics is dominated by small local moves. For Markov chains on $\mathbb{R^d}$ with isotropic transition kernels, the general bounds can be used efficiently together with a coupling that combines maximal and reflection coupling. The results are applied to Euler discretizations of stochastic differential equations with non-globally contractive drifts, and to the Metropolis adjusted Langevin algorithm for sampling from a class of probability measures on high dimensional state spaces that are not globally log-concave.

[1]  Richard L. Tweedie,et al.  Markov Chains and Stochastic Stability , 1993, Communications and Control Engineering Series.

[2]  Feng-Yu Wang Application of coupling methods to the Neumann eigenvalue problem , 1994 .

[3]  Mu-Fa Chen,et al.  Estimation of the First Eigenvalue of Second Order Elliptic Operators , 1995 .

[4]  Feng-Yu Wang,et al.  Estimation of spectral gap for elliptic operators , 1997 .

[5]  R. McCann Exact solutions to the transportation problem on the line , 1999, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[6]  Eric Vigoda,et al.  Elementary bounds on Poincaré and log-Sobolev constants for decomposable Markov chains , 2004, math/0503537.

[7]  Karl-Theodor Sturm,et al.  Transport inequalities, gradient estimates, entropy and Ricci curvature , 2005 .

[8]  Persi Diaconis,et al.  Separation cut-offs for birth and death chains , 2006, math/0702411.

[9]  Y. Ollivier Ricci curvature of Markov chains on metric spaces , 2007, math/0701886.

[10]  Jonathan C. Mattingly,et al.  Spectral gaps in Wasserstein distances and the 2D stochastic Navier–Stokes equations , 2006, math/0602479.

[11]  Jian Ding,et al.  Total variation cutoff in birth-and-death chains , 2008, 0801.2625.

[12]  Jonathan C. Mattingly,et al.  Asymptotic coupling and a general form of Harris’ theorem with applications to stochastic delay equations , 2009, 0902.4495.

[13]  Y. Ollivier,et al.  CURVATURE, CONCENTRATION AND ERROR ESTIMATES FOR MARKOV CHAIN MONTE CARLO , 2009, 0904.1312.

[14]  A. Eberle Reflection coupling and Wasserstein contractivity without convexity , 2011 .

[15]  Jonathan C. Mattingly,et al.  Yet Another Look at Harris’ Ergodic Theorem for Markov Chains , 2008, 0810.2777.

[16]  T. Komorowski,et al.  Central limit theorem for Markov processes with spectral gap in the Wasserstein metric , 2011, 1102.1842.

[17]  M. Ledoux,et al.  Analysis and Geometry of Markov Diffusion Operators , 2013 .

[18]  Martin Hairer,et al.  Exponential ergodicity for Markov processes with random switching , 2013, 1303.6999.

[19]  N. Pillai,et al.  Ergodicity of Approximate MCMC Chains with Applications to Large Data Sets , 2014, 1405.0182.

[20]  D. Paulin Mixing and Concentration by Ricci Curvature , 2014, 1404.2802.

[21]  A. Stuart,et al.  Spectral gaps for a Metropolis–Hastings algorithm in infinite dimensions , 2011, 1112.1392.

[22]  O. Butkovsky Subgeometric rates of convergence of Markov processes in the Wasserstein metric , 2012, 1211.4273.

[23]  A. Dalalyan Theoretical guarantees for approximate sampling from smooth and log‐concave densities , 2014, 1412.7392.

[24]  É. Moulines,et al.  Subgeometric rates of convergence in Wasserstein distance for Markov chains , 2014, 1402.4577.

[25]  A. Eberle Error bounds for Metropolis–Hastings algorithms applied to perturbations of Gaussian measures in high dimensions , 2012, 1210.1180.

[26]  É. Moulines,et al.  Non-asymptotic convergence analysis for the Unadjusted Langevin Algorithm , 2015, 1507.05021.

[27]  Eric Moulines,et al.  Quantitative bounds of convergence for geometrically ergodic Markov chain in the Wasserstein distance with application to the Metropolis Adjusted Langevin Algorithm , 2014, Statistics and Computing.

[28]  Raphael Zimmer,et al.  Explicit contraction rates for a class of degenerate and infinite-dimensional diffusions , 2016, 1605.07863.

[29]  Jian Wang L-Wasserstein distance for stochastic differential equations driven by Lévy processes , 2016 .

[30]  A. Eberle Couplings, distances and contractivity for diffusion processes revisited , 2013 .

[31]  James Zou,et al.  Quantifying the accuracy of approximate diffusions and Markov chains , 2016, AISTATS.

[32]  Jonathan C. Mattingly,et al.  Error bounds for Approximations of Markov chains , 2017 .

[33]  Jonathan C. Mattingly,et al.  Error bounds for Approximations of Markov chains used in Bayesian Sampling , 2017, 1711.05382.

[34]  Mateusz B. Majka Coupling and exponential ergodicity for stochastic differential equations driven by Lévy processes , 2015, 1509.08816.

[35]  A. Eberle,et al.  Quantitative Harris-type theorems for diffusions and McKean–Vlasov processes , 2016, Transactions of the American Mathematical Society.

[36]  D. Rudolf,et al.  Perturbation theory for Markov chains via Wasserstein distance , 2015, Bernoulli.

[37]  Mateusz B. Majka,et al.  Nonasymptotic bounds for sampling algorithms without log-concavity , 2018, The Annals of Applied Probability.

[38]  Mateusz B. Majka Transportation inequalities for non-globally dissipative SDEs with jumps via Malliavin calculus and coupling , 2016, Annales de l'Institut Henri Poincaré, Probabilités et Statistiques.

[39]  Arnak S. Dalalyan,et al.  User-friendly guarantees for the Langevin Monte Carlo with inaccurate gradient , 2017, Stochastic Processes and their Applications.

[40]  Jian Wang,et al.  Refined basic couplings and Wasserstein-type distances for SDEs with Lévy noises , 2016, Stochastic Processes and their Applications.

[41]  Alain Durmus,et al.  High-dimensional Bayesian inference via the unadjusted Langevin algorithm , 2016, Bernoulli.

[42]  A. Eberle,et al.  Sticky couplings of multidimensional diffusions with different drifts , 2016, Annales de l'Institut Henri Poincaré, Probabilités et Statistiques.

[43]  A. Eberle,et al.  Coupling and convergence for Hamiltonian Monte Carlo , 2018, The Annals of Applied Probability.