论文信息 - Divide and Conquer: A Mixture-Based Approach to Regional Adaptation for MCMC

Divide and Conquer: A Mixture-Based Approach to Regional Adaptation for MCMC

The efficiency of Markov chain Monte Carlo (MCMC) algorithms can vary dramatically with the choice of simulation parameters. Adaptive MCMC (AMCMC) algorithms allow the automatic tuning of the parameters while the simulation is in progress. A multimodal target distribution may call for regional adaptation of Metropolis–Hastings samplers so that the proposal distribution varies across regions in the sample space. Establishing such a partition is not straightforward and, in many instances, the learning required for its specification takes place gradually, as the simulation proceeds. In the case in which the target distribution is approximated by a mixture of Gaussians, we propose an adaptation process for the partition. It involves fitting the mixture using the available samples via an online EM algorithm and, based on the current mixture parameters, constructing the regional adaptive algorithm with online recursion (RAPTOR). The method is compared with other regional AMCMC samplers and is tested on simulated as well as real data examples. Relevant theoretical proofs, code and datasets are posted as an online supplement.

Yan Bai | Radu V. Craiu | Antonio F. Di Narzo

[1] A. F. Smith,et al. Statistical analysis of finite mixture distributions , 1986 .

[2] H. Haario,et al. An adaptive Metropolis algorithm , 2001 .

[3] J. Rosenthal,et al. Coupling and Ergodicity of Adaptive Markov Chain Monte Carlo Algorithms , 2007, Journal of Applied Probability.

[4] Cristian Sminchisescu,et al. Covariance scaled sampling for monocular 3D body tracking , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[5] P. Giordani,et al. Adaptive Independent Metropolis–Hastings by Fast Estimation of Mixtures of Normals , 2008, 0801.1864.

[6] Radford M. Neal. Annealed importance sampling , 1998, Stat. Comput..

[7] P. Green,et al. Corrigendum: On Bayesian analysis of mixtures with an unknown number of components , 1997 .

[8] C. Robert,et al. Controlled MCMC for Optimal Sampling , 2001 .

[9] M J Small,et al. Parametric distributions of regional lake chemistry: fitted and derived. , 1988, Environmental science & technology.

[10] M. West. Approximating posterior distributions by mixtures , 1993 .

[11] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[12] C. Geyer,et al. Annealing Markov chain Monte Carlo with applications to ancestral inference , 1995 .

[13] Radford M. Neal. Sampling from multimodal distributions using tempered transitions , 1996, Stat. Comput..

[14] Carissa A. Sanchez,et al. Determination of the frequency of loss of heterozygosity in esophageal adenocarcinoma by cell sorting, whole genome amplification and microsatellite polymorphisms. , 1996, Oncogene.

[15] G. Warnes. The Normal Kernel Coupler: An Adaptive Markov Chain Monte Carlo Method for Efficiently Sampling From Multi-Modal Distributions , 2001 .

[16] P. Sen,et al. Large sample methods in statistics , 1993 .