论文信息 - Probabilistic Circuits for Variational Inference in Discrete Graphical Models

Probabilistic Circuits for Variational Inference in Discrete Graphical Models

Inference in discrete graphical models with variational methods is difficult because of the inability to re-parameterize gradients of the Evidence Lower Bound (ELBO). Many sampling-based methods have been proposed for estimating these gradients, but they suffer from high bias or variance. In this paper, we propose a new approach that leverages the tractability of probabilistic circuit models, such as Sum Product Networks (SPN), to compute ELBO gradients exactly (without sampling) for a certain class of densities. In particular, we show that selective-SPNs are suitable as an expressive variational distribution, and prove that when the log-density of the target model is a polynomial the corresponding ELBO can be computed analytically. To scale to graphical models with thousands of variables, we develop an efficient and effective construction of selective-SPNs with size $O(kn)$, where $n$ is the number of variables and $k$ is an adjustable hyperparameter. We demonstrate our approach on three types of graphical models -- Ising models, Latent Dirichlet Allocation, and factor graphs from the UAI Inference Competition. Selective-SPNs give a better lower bound than mean-field and structured mean-field, and is competitive with approximations that do not provide a lower bound, such as Loopy Belief Propagation and Tree-Reweighted Belief Propagation. Our results show that probabilistic circuits are promising tools for variational inference in discrete graphical models as they combine tractability and expressivity.

Stefano Ermon | Andy Shih | S. Ermon | Andy Shih | Stefano Ermon

[1] Stefano Ermon,et al. Neural Variational Inference and Learning in Undirected Graphical Models , 2017, NIPS.

[2] Franz Pernkopf,et al. On the Latent Variable Interpretation in Sum-Product Networks , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3] Beate Bollig,et al. On the Relative Succinctness of Sentential Decision Diagrams , 2018, Theory of Computing Systems.

[4] Chong Wang,et al. Stochastic variational inference , 2012, J. Mach. Learn. Res..

[5] Guy Van den Broeck,et al. Learning the Structure of Probabilistic Sentential Decision Diagrams , 2017, UAI.

[6] Jascha Sohl-Dickstein,et al. REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models , 2017, NIPS.

[7] Guy Van den Broeck,et al. On Tractable Computation of Expected Predictions , 2019, NeurIPS.

[8] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[9] Pedro M. Domingos,et al. Sum-product networks: A new deep architecture , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[10] Pedro M. Domingos,et al. Approximate Inference by Compilation to Arithmetic Circuits , 2010, NIPS.

[11] Guy Van den Broeck,et al. What to Expect of Classifiers? Reasoning about Logistic Regression with Missing Features , 2019, IJCAI.

[12] Sam Wiseman,et al. Amortized Bethe Free Energy Minimization for Learning MRFs , 2019, NeurIPS.

[13] Dustin Tran,et al. Autoconj: Recognizing and Exploiting Conjugacy Without a Domain-Specific Language , 2018, NeurIPS.

[14] Adnan Darwiche,et al. Tractable Operations for Arithmetic Circuits of Probabilistic Models , 2016, NIPS.

[15] Nir Friedman,et al. Probabilistic Graphical Models - Principles and Techniques , 2009 .

[16] Pierre Marquis,et al. A Knowledge Compilation Map , 2002, J. Artif. Intell. Res..

[17] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[18] Fu Jie Huang,et al. A Tutorial on Energy-Based Learning , 2006 .

[19] Ben Poole,et al. Categorical Reparametrization with Gumble-Softmax , 2017, ICLR 2017.

[20] Guy Van den Broeck,et al. Probabilistic Sentential Decision Diagrams , 2014, KR.

[21] Guy Van den Broeck,et al. Probabilistic Circuits: A Unifying Framework for Tractable Probabilistic Models∗ , 2020 .