论文信息 - Variational (Gradient) Estimate of the Score Function in Energy-based Latent Variable Models

Variational (Gradient) Estimate of the Score Function in Energy-based Latent Variable Models

The learning and evaluation of energy-based latent variable models (EBLVMs) without any structural assumptions are highly challenging, because the true posteriors and the partition functions in such models are generally intractable. This paper presents variational estimates of the score function and its gradient with respect to the model parameters in a general EBLVM, referred to as VaES and VaGES respectively. The variational posterior is trained to minimize a certain divergence to the true model posterior and the bias in both estimates can be bounded by the divergence theoretically. With a minimal model assumption, VaES and VaGES can be applied to the kernelized Stein discrepancy (KSD) and score matching (SM)-based methods to learn EBLVMs. Besides, VaES can also be used to estimate the exact Fisher divergence between the data and general EBLVMs.

[1] Geoffrey E. Hinton. Reducing the Dimensionality of Data with Neural , 2008 .

[2] Stefano Ermon,et al. Neural Variational Inference and Learning in Undirected Graphical Models , 2017, NIPS.

[3] Guang Cheng,et al. Stein Neural Sampler , 2018, ArXiv.

[4] Yee Whye Teh,et al. Bayesian Learning via Stochastic Gradient Langevin Dynamics , 2011, ICML.

[5] Radford M. Neal. Probabilistic Inference Using Markov Chain Monte Carlo Methods , 2011 .

[6] Ruslan Salakhutdinov,et al. On the quantitative analysis of deep belief networks , 2008, ICML '08.

[7] Andrew M. Dai,et al. Flow Contrastive Estimation of Energy-Based Models , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Nitish Srivastava,et al. Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..

[9] Xiao Wang,et al. Unbiased Contrastive Divergence Algorithm for Training Energy-Based Latent Variable Models , 2020, ICLR.

[10] Igor Mordatch,et al. Implicit Generation and Modeling with Energy Based Models , 2019, NeurIPS.

[11] Ruslan Salakhutdinov,et al. Importance Weighted Autoencoders , 2015, ICLR.

[12] Aapo Hyvärinen,et al. Noise-contrastive estimation: A new estimation principle for unnormalized statistical models , 2010, AISTATS.

[13] Hang Su,et al. Bi-level Score Matching for Learning Energy-based Latent Variable Models , 2020, NeurIPS.

[14] Bernhard Schölkopf,et al. Deep Energy Estimator Networks , 2018, ArXiv.

[15] Bo Zhang,et al. Learning Implicit Generative Models by Teaching Explicit Ones , 2018, ArXiv.

[16] Zengyi Li,et al. Learning Energy-Based Models in High-Dimensional Spaces with Multi-scale Denoising Score Matching , 2019, 1910.07762.

[17] Yvik Swan,et al. Stein’s density approach and information inequalities , 2012, 1210.3921.

[18] Bo Zhang,et al. Adversarial Variational Inference and Learning in Markov Random Fields , 2019, ArXiv.

[19] Erik Nijkamp,et al. On Learning Non-Convergent Short-Run MCMC Toward Energy-Based Model , 2019, ArXiv.