Function-Specific Mixing Times and Concentration Away from Equilibrium

Slow mixing is the central hurdle when working with Markov chains, especially those used for Monte Carlo approximations (MCMC). In many applications, it is only of interest to estimate the stationary expectations of a small set of functions, and so the usual definition of mixing based on total variation convergence may be too conservative. Accordingly, we introduce function-specific analogs of mixing times and spectral gaps, and use them to prove Hoeffding-like function-specific concentration inequalities. These results show that it is possible for empirical expectations of functions to concentrate long before the underlying chain has mixed in the classical sense, and we show that the concentration rates we achieve are optimal up to constants. We use our techniques to derive confidence intervals that are sharper than those implied by both classical Markov chain Hoeffding bounds and Berry-Esseen-corrected CLT bounds. For applications that require testing, rather than point estimation, we show similar improvements over recent sequential testing results for MCMC. We conclude by applying our framework to real data examples of MCMC, providing evidence that our theory is both accurate and relevant to practice.

[1]  P. Diaconis,et al.  SHUFFLING CARDS AND STOPPING-TIMES , 1986 .

[2]  P. Diaconis,et al.  Strong Stationary Times Via a New Form of Duality , 1990 .

[3]  Alistair Sinclair,et al.  Improved Bounds for Mixing Rates of Markov Chains and Multicommodity Flow , 1992, Combinatorics, Probability and Computing.

[4]  Richard L. Tweedie,et al.  Markov Chains and Stochastic Stability , 1993, Communications and Control Engineering Series.

[5]  D. Rubin,et al.  The analysis of repeated-measures data on schizophrenic reaction times using mixture models. , 1995, Statistics in medicine.

[6]  D. Gillman A Chernoff bound for random walks on expander graphs , 1998, Proceedings of 1993 IEEE 34th Annual Foundations of Computer Science.

[7]  Paul-Marie Samson,et al.  Concentration of measure inequalities for Markov chains and $\Phi$-mixing processes , 2000 .

[8]  Radford M. Neal Markov Chain Sampling Methods for Dirichlet Process Mixture Models , 2000 .

[9]  P. Lezaud Chernoff and Berry-Esséen inequalities for Markov processes , 2001 .

[10]  Galin L. Jones,et al.  Honest Exploration of Intractable Probability Distributions via Markov Chain Monte Carlo , 2001 .

[11]  Optimal Hoeffding bounds for discrete reversible Markov chains , 2004, math/0405296.

[12]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Thomas L. Griffiths,et al.  Infinite latent feature models and the Indian buffet process , 2005, NIPS.

[14]  Christian P. Robert,et al.  Monte Carlo Statistical Methods (Springer Texts in Statistics) , 2005 .

[15]  D. Viswanath,et al.  Riffle shuffles of decks with repeated cards , 2005, math/0503233.

[16]  Y. Ollivier Ricci curvature of Markov chains on metric spaces , 2007, math/0701886.

[17]  Radford M. Neal,et al.  Splitting and merging components of a nonconjugate Dirichlet process mixture model , 2007 .

[18]  Murali Haran,et al.  Markov chain Monte Carlo: Can we trust the third significant figure? , 2007, math/0703746.

[19]  P. Glynn,et al.  Asymptotic Validity of Batch Means Steady-State Confidence Intervals , 2009 .

[20]  Bradley P. Carlin,et al.  Bayesian Adaptive Methods for Clinical Trials , 2010 .

[21]  Y. Ollivier,et al.  CURVATURE, CONCENTRATION AND ERROR ESTIMATES FOR MARKOV CHAIN MONTE CARLO , 2009, 0904.1312.

[22]  David M. Blei,et al.  Sparse stochastic inference for latent Dirichlet allocation , 2012, ICML.

[23]  D. Paulin Concentration inequalities for Markov chains by Marton couplings and spectral methods , 2012, 1212.2015.

[24]  Kai-Min Chung,et al.  Chernoff-Hoeffding Bounds for Markov Chains: Generalized and Simplified , 2012, STACS.

[25]  Benjamin M. Gyori,et al.  Non-asymptotic confidence intervals for MCMC in practice , 2012, 1212.2016.

[26]  V. Climenhaga Markov chains and mixing times , 2013 .

[27]  David B. Dunson,et al.  Bayesian data analysis, third edition , 2013 .

[28]  Aryeh Kontorovich,et al.  Uniform Chernoff and Dvoretzky-Kiefer-Wolfowitz-Type Inequalities for Markov Chains and Related Processes , 2012, J. Appl. Probab..

[29]  Benjamin M. Gyori,et al.  Hypothesis testing for Markov chain Monte Carlo , 2014, Stat. Comput..

[30]  Persi Diaconis,et al.  Random walk on unipotent matrix groups , 2015, Annales scientifiques de l'École Normale Supérieure.