论文信息 - Optimizing and Adapting the Metropolis Algorithm

Optimizing and Adapting the Metropolis Algorithm

Many modern scientific questions involve high dimensional data and complicated statistical models. For example, data on weather consist of huge numbers of measurements across spatial grids, over a period of time. Even in simpler settings, data can be complex: for example, Bartolucci et al. (2007) consider recurrence rates for melanoma (skin cancer) patients after surgery. The probability of recurrence for an individual may depend on physical or biological characteristics of their cancerous lesion, as well as other factors. A statistical model in this context may involve a large number of variables and a correspondingly large number of parameters, which are often represented by a vector θ of some dimension d. To assess the relevance of specific variables for disease recurrence, and to build models that give a risk of recurrence for any given individual, researchers often use Bayesian analysis (see e.g. Box and Tiao, 1973; Gelman et al., 2003; Carlin and Louis, 2008). In this framework, the parameter vector is assumed to follow some probability distribution (of dimension d), and the challenge is to combine a “prior” distribution for θ (typically based on background information about the scientific area) with data that are collected, so as to produce a “posterior” distribution for θ. This probability distribution (call it π(θ)) can then be used to answer important scientific questions (e.g., is the size of a cancerous lesion related to the risk of recurrence after surgery?) and to calculate specific probabilities (e.g., this person has a 20% probability of a recurrence within the next five years). One challenge for Bayesian analysis in situations where the data and parameter vectors are high dimensional is that it is difficult or impossible to compute probabilities based on the posterior distribution. If there is some outcome A of interest (e.g., the outcome that a specific individual’s cancer

Jeffrey S. Rosenthal | J. Rosenthal

[1] Gareth O. Roberts,et al. Minimising MCMC variance via diffusion limits, with an application to simulated tempering , 2014 .

[2] J. Rosenthal,et al. Adaptive Gibbs samplers and related MCMC methods , 2011, 1101.5838.

[3] S. Richardson,et al. Bayesian Models for Sparse Regression Analysis of High Dimensional Data , 2012 .

[4] G. Fort,et al. Convergence of adaptive and interacting Markov chain Monte Carlo algorithms , 2011, 1203.3036.

[5] Gareth O. Roberts,et al. Towards optimal scaling of metropolis-coupled Markov chain Monte Carlo , 2011, Stat. Comput..

[6] Nando de Freitas,et al. Intracluster Moves for Constrained Discrete-Space MCMC , 2010, UAI.

[7] Z. Q. John Lu,et al. Bayesian methods for data analysis, third edition , 2010 .

[8] Robert E Weiss,et al. Bayesian methods for data analysis. , 2010, American journal of ophthalmology.

[9] E. Saksman,et al. On the ergodicity of the adaptive Metropolis algorithm on unbounded domains , 2008, 0806.2933.

[10] Chao Yang,et al. Learn From Thy Neighbor: Parallel-Chain and Regional Adaptive MCMC , 2009 .

[11] L. McCandless. Bayesian methods for data analysis (3rd edn). Bradley P. Carlin and Thomas A. Louis, Chapman & Hall/CRC, Boca Raton, 2008. No. of pages: 552. Price: $69.95. ISBN 9781584886976 , 2009 .