Piecewise Approximate Bayesian Computation: fast inference for discretely observed Markov models using a factorised posterior distribution

Many modern statistical applications involve inference for complicated stochastic models for which the likelihood function is difficult or even impossible to calculate, and hence conventional likelihood-based inferential techniques cannot be used. In such settings, Bayesian inference can be performed using Approximate Bayesian Computation (ABC). However, in spite of many recent developments to ABC methodology, in many applications the computational cost of ABC necessitates the choice of summary statistics and tolerances that can potentially severely bias the estimate of the posterior.We propose a new “piecewise” ABC approach suitable for discretely observed Markov models that involves writing the posterior density of the parameters as a product of factors, each a function of only a subset of the data, and then using ABC within each factor. The approach has the advantage of side-stepping the need to choose a summary statistic and it enables a stringent tolerance to be set, making the posterior “less approximate”. We investigate two methods for estimating the posterior density based on ABC samples for each of the factors: the first is to use a Gaussian approximation for each factor, and the second is to use a kernel density estimate. Both methods have their merits. The Gaussian approximation is simple, fast, and probably adequate for many applications. On the other hand, using instead a kernel density estimate has the benefit of consistently estimating the true piecewise ABC posterior as the number of ABC samples tends to infinity. We illustrate the piecewise ABC approach with four examples; in each case, the approach offers fast and accurate inference.

[1]  Calyampudi Radhakrishna Rao,et al.  Stochastic processes: modelling and simulation , 2001 .

[2]  Eddie McKenzie,et al.  Discrete variate time series , 2003 .

[3]  Darren J. Wilkinson Stochastic Modelling for Systems Biology , 2006 .

[4]  A. Doucet,et al.  Particle Markov chain Monte Carlo methods , 2010 .

[5]  Paul Fearnhead,et al.  Constructing summary statistics for approximate Bayesian computation: semi‐automatic approximate Bayesian computation , 2012 .

[6]  A. Gallant,et al.  Numerical Techniques for Maximum Likelihood Estimation of Continuous-Time Diffusion Processes , 2002 .

[7]  William T. Freeman,et al.  Nonparametric belief propagation , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[8]  W. Ebeling Stochastic Processes in Physics and Chemistry , 1995 .

[9]  T. Alderweireld,et al.  A Theory for the Term Structure of Interest Rates , 2004, cond-mat/0405293.

[10]  David Moriña,et al.  A statistical model for hospital admissions caused by seasonal diseases , 2011, Statistics in medicine.

[11]  Tom Minka,et al.  Expectation Propagation for approximate Bayesian inference , 2001, UAI.

[12]  A. Cook,et al.  Inference in Epidemic Models without Likelihoods , 2009 .

[13]  D. Balding,et al.  Approximate Bayesian computation in population genetics. , 2002, Genetics.

[14]  Peter Neal,et al.  MCMC for Integer‐Valued ARMA processes , 2007 .

[15]  David Welch,et al.  Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems , 2009, Journal of The Royal Society Interface.

[16]  Jean-Michel Marin,et al.  Approximate Bayesian computational methods , 2011, Statistics and Computing.

[17]  Arnaud Doucet,et al.  An adaptive sequential Monte Carlo method for approximate Bayesian computation , 2011, Statistics and Computing.

[18]  Nicolas Chopin,et al.  Expectation-Propagation for Summary-Less, Likelihood-Free Inference , 2011 .

[19]  Maria L. Rizzo,et al.  A new test for multivariate normality , 2005 .

[20]  A. P. Dawid,et al.  Parameter inference for stochastic kinetic models of bacterial gene regulation : a Bayesian approach to systems biology , 2010 .

[21]  D. J. Nott,et al.  Approximate Bayesian computation via regression density estimation , 2012, 1212.1479.

[22]  Keinosuke Fukunaga,et al.  Introduction to statistical pattern recognition (2nd ed.) , 1990 .

[23]  Paul Marjoram,et al.  Markov chain Monte Carlo without likelihoods , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Sumeetpal S. Singh,et al.  Parameter Estimation for Hidden Markov Models with Intractable Likelihoods , 2011 .

[25]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[26]  Darren J. Wilkinson,et al.  Bayesian inference for a discretely observed stochastic kinetic model , 2008, Stat. Comput..

[27]  P. J. Green,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[28]  Susan A. Murphy,et al.  Monographs on statistics and applied probability , 1990 .

[29]  Olivier François,et al.  Non-linear regression models for Approximate Bayesian Computation , 2008, Stat. Comput..

[30]  M. Feldman,et al.  Population growth of human Y chromosomes: a study of Y chromosome microsatellites. , 1999, Molecular biology and evolution.

[31]  Yacine Aït-Sahalia Maximum Likelihood Estimation of Discretely Sampled Diffusions: A Closed‐form Approximation Approach , 2002 .

[32]  Mohamed Alosh,et al.  FIRST‐ORDER INTEGER‐VALUED AUTOREGRESSIVE (INAR(1)) PROCESS , 1987 .

[33]  Darren J Wilkinson,et al.  Bayesian parameter inference for stochastic biochemical network models using particle Markov chain Monte Carlo , 2011, Interface Focus.

[34]  G. Pflug Kernel Smoothing. Monographs on Statistics and Applied Probability - M. P. Wand; M. C. Jones. , 1996 .

[35]  Thomas A. Dean,et al.  Asymptotic behaviour of approximate Bayesian estimators , 2011, 1105.3655.

[36]  P J Diggle,et al.  Spatio-temporal epidemiology of Campylobacter jejuni enteritis, in an area of Northwest England, 2000–2002 , 2010, Epidemiology and Infection.

[37]  R. Wilkinson Approximate Bayesian computation (ABC) gives exact results under the assumption of model error , 2008, Statistical applications in genetics and molecular biology.