Path Sampling to Compute Integrated Likelihoods: An Adaptive Approach

Computing integrated likelihoods (ILs) to perform Bayesian model selection is a challenging task, particularly when the models considered involve a large number of parameters. In this article, we propose the use of an adaptive quadrature algorithm to automate the selection of the grid in path sampling (PS), an integration technique recognized as one of the most powerful Monte Carlo integration statistical methods for IL estimation. We begin by examining the impact of two tuning parameters of PS, the choice of the auxiliary density and the specification of the grid, both of which are shown to be potentially very influential. We then present the Grid Selection by Adaptive Quadrature (GSAQ) approach and provide a probabilistic bound for the bias of the PS grid estimator in this context. We perform a comparison between the GSAQ and standard grid implementations of PS using two well-studied datasets; GSAQ is found to yield superior results. We then examine other alternatives for PS implementation. In particular, a comparison of the fixed-GSAQ and random approaches to PS is presented. Finally, GSAQ is successfully applied to a longitudinal hierarchical regression model selection problem in multiple sclerosis research. Supplemental materials for this article are available online (see section Supplemental Materials).

[1]  J. Geweke,et al.  Bayesian Inference in Econometric Models Using Monte Carlo Integration , 1989 .

[2]  H. Philippe,et al.  Computing Bayes factors using thermodynamic integration. , 2006, Systematic biology.

[3]  Xin-Yuan Song,et al.  Model comparison of generalized linear mixed models , 2006, Statistics in medicine.

[4]  T. Ferguson A Course in Large Sample Theory , 1996 .

[5]  J. Kingman A FIRST COURSE IN STOCHASTIC PROCESSES , 1967 .

[6]  J. Kurtzke Rating neurologic impairment in multiple sclerosis , 1983, Neurology.

[7]  C. Robert,et al.  Contemplating Evidence: properties, extensions of, and alternatives to Nested Sampling , 2008 .

[8]  Luisa Scaccia,et al.  Bayesian Growth Curves Using Normal Mixtures With Nonparametric Weights , 2003 .

[9]  Ming-Hui Chen,et al.  ESTIMATING RATIOS OF NORMALIZING CONSTANTS FOR DENSITIES WITH DIFFERENT DIMENSIONS , 1997 .

[10]  Xin-Yuan Song,et al.  A Bayesian model selection method with applications , 2002 .

[11]  P. Moral,et al.  Sequential Monte Carlo samplers , 2002, cond-mat/0212648.

[12]  Radford M. Neal Annealed importance sampling , 1998, Stat. Comput..

[13]  W. Gander,et al.  Adaptive Quadrature—Revisited , 2000 .

[14]  Walter R. Gilks,et al.  Introduction to general state-space Markov chain theory , 1995 .

[15]  W. M. McKeeman,et al.  Algorithm 145: Adaptive numerical integration by Simpson's rule , 1962, Communications of the ACM.

[16]  Xiao-Li Meng,et al.  Simulating Normalizing Constants: From Importance Sampling to Bridge Sampling to Path Sampling , 1998 .

[17]  A. Gelfand,et al.  Bayesian Model Choice: Asymptotics and Exact Calculations , 1994 .

[18]  J. N. Lyness When not to use an automatic quadrature routine , 1983 .

[19]  Sik-Yum Lee,et al.  Maximum Likelihood Analysis of a General Latent Variable Model with Hierarchically Mixed Data , 2004, Biometrics.

[20]  Xiao-Li Meng,et al.  SIMULATING RATIOS OF NORMALIZING CONSTANTS VIA A SIMPLE IDENTITY: A THEORETICAL EXPLORATION , 1996 .

[21]  Adrian E. Raftery,et al.  Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors , 1999 .

[22]  Adrian E. Raftery,et al.  Bayesian model averaging: development of an improved multi-class, gene selection and classification tool for microarray data , 2005, Bioinform..

[23]  C. Robert,et al.  Properties of nested sampling , 2008, 0801.3887.

[24]  B. Carlin,et al.  Bayesian Model Choice Via Markov Chain Monte Carlo Methods , 1995 .

[25]  S. E. Hills,et al.  Illustration of Bayesian Inference in Normal Data Models Using Gibbs Sampling , 1990 .