Bootstrapped synthetic likelihood

The development of approximate Bayesian computation (ABC) and synthetic likelihood (SL) techniques has enabled the use of Bayesian inference for models that may be simulated, but for which the likelihood is not available to evaluate pointwise at values of an unknown parameter $\theta$. The main idea in ABC and SL is to, for different values of $\theta$ (usually chosen using a Monte Carlo algorithm), build estimates of the likelihood based on simulations from the model conditional on $\theta$. The quality of these estimates determines the efficiency of an ABC/SL algorithm. In standard ABC/SL, the only means to improve an estimated likelihood at $\theta$ is to simulate more times from the model conditional on $\theta$, which is infeasible in cases where the simulator is computationally expensive. In this paper we describe how to use bootstrapping as a means for improving synthetic likelihood estimates whilst using fewer simulations from the model, and also investigate its use in ABC. Further, we investigate the use of the bag of little bootstraps as a means for applying this approach to large datasets, yielding to Monte Carlo algorithms that accurately approximate posterior distributions whilst only simulating subsamples of the full data. Examples of the approach applied to i.i.d., temporal and spatial data are given.

[1]  Elise Jennings,et al.  astroABC : An Approximate Bayesian Computation Sequential Monte Carlo sampler for cosmological parameter estimation , 2016, Astron. Comput..

[2]  Richard G. Everitt,et al.  A rare event approach to high-dimensional approximate Bayesian computation , 2016, Statistics and Computing.

[3]  David T. Frazier,et al.  Auxiliary Likelihood-Based Approximate Bayesian Computation in State Space Models , 2016, Journal of Computational and Graphical Statistics.

[4]  I. Bojak,et al.  Fast approximate Bayesian inference for stable differential equation models , 2017, 1706.00689.

[5]  Ingram Olkin,et al.  Unbiased Estimation of Some Multivariate Probability Densities and Related Functions , 1969 .

[6]  D. Gillespie Exact Stochastic Simulation of Coupled Chemical Reactions , 1977 .

[7]  H. Künsch The Jackknife and the Bootstrap for General Stationary Observations , 1989 .

[8]  Max Welling,et al.  GPS-ABC: Gaussian Process Surrogate Approximate Bayesian Computation , 2014, UAI.

[9]  S. Wood Statistical inference for noisy nonlinear ecological dynamic systems , 2010, Nature.

[10]  C. Andrieu,et al.  The pseudo-marginal approach for efficient Monte Carlo computations , 2009, 0903.5480.

[11]  Pierre Alquier,et al.  Noisy Monte Carlo: convergence of Markov chains with approximate transition kernels , 2014, Statistics and Computing.

[12]  Christopher C. Drovandi,et al.  Accelerating Bayesian Synthetic Likelihood With the Graphical Lasso , 2019, Journal of Computational and Graphical Statistics.

[13]  Christopher C. Drovandi,et al.  Pre-processing for approximate Bayesian computation in image analysis , 2015, Stat. Comput..

[14]  J. Cochran What is the bootstrap? , 2019, Significance.

[15]  Richard G. Everitt,et al.  Bayesian Parameter Estimation for Latent Markov Random Fields and Social Networks , 2012, ArXiv.

[16]  M. Feldman,et al.  Population growth of human Y chromosomes: a study of Y chromosome microsatellites. , 1999, Molecular biology and evolution.

[17]  Mario V. Wuthrich,et al.  Chain Ladder Method: Bayesian Bootstrap Versus Classical Bootstrap , 2009 .

[18]  Noah A. Rosenberg,et al.  AABC: approximate approximate Bayesian computation when simulating a large number of data sets is computationally infeasible , 2013, 1301.6282.

[19]  Richard G. Everitt,et al.  Delayed Acceptance ABC-SMC , 2017, J. Comput. Graph. Stat..

[20]  Anthony N. Pettitt,et al.  Bayesian Parametric Bootstrap for Models with Intractable Likelihoods , 2019, Bayesian Analysis.

[21]  D. Balding,et al.  Approximate Bayesian computation in population genetics. , 2002, Genetics.

[22]  ABootstrapLikelihoodapproachtoBayesianComputation,et al.  A Bootstrap Likelihood Approach to Bayesian Computation , 2015, 1510.07287.

[23]  David T. Frazier,et al.  Bayesian Synthetic Likelihood , 2017, 2305.05120.

[24]  P. Moral,et al.  Sequential Monte Carlo samplers , 2002, cond-mat/0212648.

[25]  Purnamrita Sarkar,et al.  A scalable bootstrap for massive data , 2011, 1112.5016.

[26]  Zoubin Ghahramani,et al.  MCMC for Doubly-intractable Distributions , 2006, UAI.

[27]  Michael U. Gutmann,et al.  Bayesian Optimization for Likelihood-Free Inference of Simulator-Based Statistical Models , 2015, J. Mach. Learn. Res..

[28]  A. Amara,et al.  Approximate Bayesian computation for forward modeling in cosmology , 2015, 1504.07245.

[29]  Allan Sly,et al.  Critical Ising on the Square Lattice Mixes in Polynomial Time , 2010, 1001.1613.

[30]  Arnaud Doucet,et al.  On Markov chain Monte Carlo methods for tall data , 2015, J. Mach. Learn. Res..

[31]  Richard Wilkinson,et al.  Accelerating ABC methods using Gaussian processes , 2014, AISTATS.

[32]  C. Robert,et al.  ABC likelihood-free methods for model choice in Gibbs random fields , 2008, 0807.2767.

[33]  David Hinkley,et al.  Bootstrap Methods: Another Look at the Jackknife , 2008 .

[34]  Iain Murray,et al.  Fast $\epsilon$-free Inference of Simulation Models with Bayesian Conditional Density Estimation , 2016 .

[35]  Andrew Golightly,et al.  Adaptive, Delayed-Acceptance MCMC for Targets With Expensive Likelihoods , 2015, 1509.00172.

[36]  Christophe Andrieu,et al.  Establishing some order amongst exact approximations of MCMCs , 2014, 1404.6909.

[37]  Alice S.A. Johnston,et al.  Calibration and evaluation of individual-based models using Approximate Bayesian Computation , 2015 .

[38]  Nikolay Laptev BOOT-TS : A Scalable Bootstrap for Massive Time-Series Data , 2012 .

[39]  Theodore Kypraios,et al.  A tutorial introduction to Bayesian inference for stochastic epidemic models using Approximate Bayesian Computation. , 2017, Mathematical biosciences.

[40]  W. Härdle,et al.  Bootstrap Methods for Time Series , 2003 .

[41]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[42]  Richard G. Everitt,et al.  Bayesian model comparison with un-normalised likelihoods , 2015, Stat. Comput..

[43]  Christopher C. Drovandi,et al.  Accelerating pseudo-marginal MCMC using Gaussian processes , 2018, Comput. Stat. Data Anal..

[44]  Alberto Caimo,et al.  Bayesian inference for exponential random graph models , 2010, Soc. Networks.

[45]  J. Møller,et al.  An efficient Markov chain Monte Carlo method for distributions with intractable normalising constants , 2006 .