Computationally efficient Bayesian estimation of high-dimensional Archimedean copulas with discrete and mixed margins

Estimating copulas with discrete marginal distributions is challenging, especially in high dimensions, because computing the likelihood contribution of each observation requires evaluating $$2^{J}$$2J terms, with J the number of discrete variables. Our article focuses on the estimation of Archimedean copulas, for example, Clayton and Gumbel copulas. Currently, data augmentation methods are used to carry out inference for discrete copulas and, in practice, the computation becomes infeasible when J is large. Our article proposes two new fast Bayesian approaches for estimating high-dimensional Archimedean copulas with discrete margins, or a combination of discrete and continuous margins. Both methods are based on recent advances in Bayesian methodology that work with an unbiased estimate of the likelihood rather than the likelihood itself, and our key observation is that we can estimate the likelihood of a discrete Archimedean copula unbiasedly with much less computation than evaluating the likelihood exactly or with current simulation methods that are based on augmenting the model with latent variables. The first approach builds on the pseudo-marginal method that allows Markov chain Monte Carlo simulation from the posterior distribution using only an unbiased estimate of the likelihood. The second approach is based on a variational Bayes approximation to the posterior and also uses an unbiased estimate of the likelihood. We show that the two new approaches enable us to carry out Bayesian inference for high values of J for the Archimedean copulas where the computation was previously too expensive. The methodology is illustrated through several real and simulated data examples.

[1]  M. Wand,et al.  Explaining Variational Approximations , 2010 .

[2]  Jared S. Murray,et al.  Bayesian Gaussian Copula Factor Models for Mixed Data , 2011, Journal of the American Statistical Association.

[3]  David J. Nott,et al.  Variational Bayes With Intractable Likelihood , 2015, 1503.08621.

[4]  C. Andrieu,et al.  The pseudo-marginal approach for efficient Monte Carlo computations , 2009, 0903.5480.

[5]  M. Beaumont Estimation of population growth or decline in genetically monitored populations. , 2003, Genetics.

[6]  Ralph S. Silva,et al.  On Some Properties of Markov Chain Monte Carlo Simulation Methods Based on the Particle Filter , 2012 .

[7]  M. Pitt,et al.  Efficient Bayesian inference for Gaussian copula regression models , 2006 .

[8]  Marius Hofert,et al.  Sampling Archimedean copulas , 2008, Comput. Stat. Data Anal..

[9]  Alexander J. McNeil,et al.  Likelihood inference for Archimedean copulas in high dimensions under known margins , 2012, J. Multivar. Anal..

[10]  J. Rosenthal,et al.  On the efficiency of pseudo-marginal random walk Metropolis algorithms , 2013, The Annals of Statistics.

[11]  A. Doucet,et al.  The correlated pseudomarginal method , 2015, Journal of the Royal Statistical Society: Series B (Statistical Methodology).

[12]  P. H. Garthwaite,et al.  Adaptive optimal scaling of Metropolis–Hastings algorithms using the Robbins–Monro process , 2010, 1006.3690.

[13]  A. Gelman,et al.  Weak convergence and optimal scaling of random walk Metropolis algorithms , 1997 .

[14]  Claudia Czado,et al.  Pair Copula Constructions for Multivariate Discrete Data , 2012 .

[15]  A. Doucet,et al.  Efficient implementation of Markov chain Monte Carlo when using an unbiased likelihood estimator , 2012, 1210.1871.

[16]  Pravin K. Trivedi,et al.  Copula Modeling: An Introduction for Practitioners , 2007 .

[17]  N. Shephard,et al.  BAYESIAN INFERENCE BASED ONLY ON SIMULATED LIKELIHOOD: PARTICLE FILTER ANALYSIS OF DYNAMIC ECONOMIC MODELS , 2011, Econometric Theory.

[18]  Ari Pakman,et al.  Exact Hamiltonian Monte Carlo for Truncated Multivariate Gaussians , 2012, 1208.4118.

[19]  J. Ware SF-36 health survey: Manual and interpretation guide , 2003 .

[20]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[21]  Claudia Czado,et al.  Model selection for discrete regular vine copulas , 2017, Comput. Stat. Data Anal..

[22]  H. Robbins A Stochastic Approximation Method , 1951 .

[23]  M. Smith,et al.  Estimation of Copula Models With Discrete Margins via Bayesian Data Augmentation , 2011 .