论文信息 - Quasi-symplectic Langevin Variational Autoencoder

Quasi-symplectic Langevin Variational Autoencoder

Variational autoencoder (VAE) as one of the well investigated generative model is very popular in nowadays neural learning research works. To leverage VAE in practical tasks which have high dimensions and huge dataset often face the problem of low variance evidence lower bounds construction. Markov chain Monte Carlo (MCMC) is an effective approach to tight the evidence lower bound (ELBO) for approximating the posterior distribution. Hamiltonian Variational Autoencoder (HVAE) is one of the effective MCMC inspired approaches for constructing the unbiased low-variance ELBO which is also amenable for reparameterization trick. The solution significantly improves the performance of the posterior estimation effectiveness, yet, a main drawback of HVAE is the leapfrog method need to access the posterior gradient twice which leads to bad inference efficiency performance and the GPU memory requirement is fair large. This flaw limited the application of Hamiltonian based inference framework for large scale networks inference. To tackle this problem, we propose a Quasi-symplectic Langevin Variational autoencoder (Langevin-VAE), which can be a significant improvement over resource usage efficiency. We qualitatively and quantitatively demonstrate the effectiveness of the Langevin-VAE compared to the state-of-art gradients informed inference framework.

Zihao Wang | Hervé Delingette

[1] Sean Gerrish,et al. Black Box Variational Inference , 2013, AISTATS.

[2] Patrick van der Smagt,et al. Variational Inference with Hamiltonian Monte Carlo , 2016, 1609.08203.

[3] David M. Blei,et al. Variational Inference: A Review for Statisticians , 2016, ArXiv.

[4] Arnaud Doucet,et al. Hamiltonian Variational Auto-Encoder , 2018, NeurIPS.

[5] Shiliang Sun,et al. Dynamical Sampling with Langevin Normalization Flows , 2019, Entropy.

[6] Hervé Delingette,et al. Learning a Probabilistic Model for Diffeomorphic Registration , 2018, IEEE Transactions on Medical Imaging.

[7] M. Girolami,et al. Riemann manifold Langevin and Hamiltonian Monte Carlo methods , 2011, Journal of the Royal Statistical Society: Series B (Statistical Methodology).

[8] Murat A. Erdogdu,et al. Stochastic Runge-Kutta Accelerates Langevin Monte Carlo and Beyond , 2019, NeurIPS.

[9] N. G.,et al. Quasi-symplectic methods for Langevin-type equations , 2003 .

[10] Eric Nalisnick,et al. Normalizing Flows for Probabilistic Modeling and Inference , 2019, J. Mach. Learn. Res..

[11] Mark A. Girolami,et al. Geometry and Dynamics for Markov Chain Monte Carlo , 2017, ArXiv.