High-Dimensional Copula Variational Approximation Through Transformation

ABSTRACT Variational methods are attractive for computing Bayesian inference when exact inference is impractical. They approximate a target distribution—either the posterior or an augmented posterior—using a simpler distribution that is selected to balance accuracy with computational feasibility. Here, we approximate an element-wise parametric transformation of the target distribution as multivariate Gaussian or skew-normal. Approximations of this kind are implicit copula models for the original parameters, with a Gaussian or skew-normal copula function and flexible parametric margins. A key observation is that their adoption can improve the accuracy of variational inference in high dimensions at limited or no additional computational cost. We consider the Yeo–Johnson and inverse G&H transformations, along with sparse factor structures for the scale matrix of the Gaussian or skew-normal. We also show how to implement efficient reparameterization gradient methods for these copula-based approximations. The efficacy of the approach is illustrated by computing posterior inference for three different models using six real datasets. In each case, we show that our proposed copula model distributions are more accurate variational approximations than Gaussian or skew-normal distributions, but at only a minor or no increase in computational cost. Supplementary materials comprising an online appendix, MATLAB code to implement the method, and the datasets employed, are available online.

[1]  Léon Bottou,et al.  Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[2]  H. Robbins A Stochastic Approximation Method , 1951 .

[3]  Toshinao Yoshiba Maximum likelihood estimation of skew-t copulas with its applications to stock returns , 2018 .

[4]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[5]  M. Smith,et al.  Inversion copulas from nonlinear state space models with an application to inflation forecasting , 2018, International Journal of Forecasting.

[6]  Jiqiang Guo,et al.  Stan: A Probabilistic Programming Language. , 2017, Journal of statistical software.

[7]  M. Smith,et al.  Estimation of Copula Models With Discrete Margins via Bayesian Data Augmentation , 2011 .

[8]  M. Wand,et al.  Explaining Variational Approximations , 2010 .

[9]  David J. Nott,et al.  Gaussian variational approximation with sparse precision matrices , 2016, Statistics and Computing.

[10]  A. Frigessi,et al.  Pair-copula constructions of multiple dependence , 2009 .

[11]  M. Smith,et al.  Copula Modelling of Dependence in Multivariate Time Series , 2013 .

[12]  S. Kotz,et al.  The Meta-elliptical Distributions with Given Marginals , 2002 .

[13]  David M. Blei,et al.  Variational Inference: A Review for Statisticians , 2016, ArXiv.

[14]  David Duvenaud,et al.  Sticking the Landing: Simple, Lower-Variance Gradient Estimators for Variational Inference , 2017, NIPS.

[15]  Dustin Tran,et al.  Automatic Differentiation Variational Inference , 2016, J. Mach. Learn. Res..

[16]  Manfred Opper,et al.  The Variational Gaussian Approximation Revisited , 2009, Neural Computation.

[17]  A. Azzalini,et al.  The multivariate skew-normal distribution , 1996 .

[18]  David Barber,et al.  Gaussian Kullback-Leibler approximate inference , 2013, J. Mach. Learn. Res..

[19]  Todd C. Headrick,et al.  Parametric Probability Densities and Distribution Functions for Tukey g -and- h Transformations and their Use for Fitting Data , 2008 .

[20]  Richard Gerlach,et al.  Estimating Quantile Families of Loss Distributions for Non-Life Insurance Modelling via L-Moments , 2016 .

[21]  D. Nott,et al.  Gaussian Variational Approximation With a Factor Covariance Structure , 2017, Journal of Computational and Graphical Statistics.

[22]  Jared S. Murray,et al.  Bayesian Gaussian Copula Factor Models for Mixed Data , 2011, Journal of the American Statistical Association.

[23]  A. Azzalini,et al.  Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t‐distribution , 2003, 0911.2342.

[24]  Marc G. Genton,et al.  Skew-elliptical distributions and their applications : a journey beyond normality , 2004 .

[25]  M. Smith,et al.  Variational Bayes Estimation of Discrete-Margined Copula Models With Application to Time Series , 2017, Journal of Computational and Graphical Statistics.

[26]  P. X. Song,et al.  Multivariate Dispersion Models Generated From Gaussian Copula , 2000 .

[27]  Rodney X. Sturdivant,et al.  Applied Logistic Regression: Hosmer/Applied Logistic Regression , 2005 .

[28]  Dong Hwan Oh,et al.  Modeling Dependence in High Dimensions With Factor Copulas , 2015 .

[29]  Daan Wierstra,et al.  Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[30]  A. McNeil,et al.  The t Copula and Related Copulas , 2005 .

[31]  Richard A. Johnson,et al.  A new family of power transformations to improve normality or symmetry , 2000 .

[32]  Tim Salimans,et al.  Fixed-Form Variational Posterior Approximation through Stochastic Linear Regression , 2012, ArXiv.