Structured Stein Variational Inference for Continuous Graphical Models

We propose a novel distributed inference algorithm for continuous graphical models by extending Stein variational gradient descent (SVGD) to leverage the Markov dependency structure of the distribution of interest. The idea is to use a set of local kernel functions over the Markov blanket of each node, which alleviates the problem of the curse of high dimensionality and simultaneously yields a distributed algorithm for decentralized inference tasks. We justify our method with theoretical analysis and show that the use of local kernels can be viewed as a new type of localized approximation that matches the target distribution on the conditional distributions of each node over its Markov blanket. Our empirical results demonstrate that our method outperforms a variety of baselines including standard MCMC and particle message passing methods.

[1]  William T. Freeman,et al.  Nonparametric belief propagation , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[2]  Bernhard Schölkopf,et al.  A Kernel Two-Sample Test , 2012, J. Mach. Learn. Res..

[3]  Andrew Gelman,et al.  The No-U-turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo , 2011, J. Mach. Learn. Res..

[4]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[5]  William T. Freeman,et al.  Understanding belief propagation and its generalizations , 2003 .

[6]  John W. Fisher,et al.  Nonparametric belief propagation for self-localization of sensor networks , 2005, IEEE Journal on Selected Areas in Communications.

[7]  Michael J. Black,et al.  Preserving Modes and Messages via Diverse Particle Selection , 2014, ICML.

[8]  Qiang Liu,et al.  A Kernelized Stein Discrepancy for Goodness-of-fit Tests , 2016, ICML.

[9]  Arthur Gretton,et al.  A Kernel Test of Goodness of Fit , 2016, ICML.

[10]  Dilin Wang,et al.  Stein Variational Gradient Descent: A General Purpose Bayesian Inference Algorithm , 2016, NIPS.

[11]  Qiang Liu,et al.  Scoring Workers in Crowdsourcing: How Many Control Questions are Enough? , 2013, NIPS.

[12]  David A. McAllester,et al.  Particle Belief Propagation , 2009, AISTATS.

[13]  Andrew W. Fitzgibbon,et al.  PMBP: PatchMatch Belief Propagation for Correspondence Field Estimation , 2014, International Journal of Computer Vision.

[14]  Sebastian Fischer,et al.  Exploring Artificial Intelligence In The New Millennium , 2016 .

[15]  Michael I. Jordan Graphical Models , 2003 .

[16]  Erik B. Sudderth,et al.  Proteins, Particles, and Pseudo-Max-Marginals: A Submodular Approach , 2015, ICML.

[17]  Michael I. Jordan,et al.  Graphical Models, Exponential Families, and Variational Inference , 2008, Found. Trends Mach. Learn..

[18]  Le Song,et al.  Kernel Belief Propagation , 2011, AISTATS.

[19]  Eva Chen,et al.  Identifying Expertise to Extract the Wisdom of Crowds , 2015, Manag. Sci..

[20]  Pravin K. Trivedi,et al.  Copula Modeling: An Introduction for Practitioners , 2007 .

[21]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[22]  John W. Fisher,et al.  Efficient Observation Selection in Probabilistic Graphical Models Using Bayesian Lower Bounds , 2016, UAI.

[23]  Bernhard Schölkopf,et al.  Kernel Mean Embedding of Distributions: A Review and Beyonds , 2016, Found. Trends Mach. Learn..

[24]  A. Berlinet,et al.  Reproducing kernel Hilbert spaces in probability and statistics , 2004 .

[25]  Lester W. Mackey,et al.  Measuring Sample Quality with Kernels , 2017, ICML.