论文信息 - Reducing the Amortization Gap in Variational Autoencoders: A Bayesian Random Function Approach

Reducing the Amortization Gap in Variational Autoencoders: A Bayesian Random Function Approach

Variational autoencoder (VAE) is a very successful generative model whose key element is the so called amortized inference network, which can perform test time inference using a single feed forward pass. Unfortunately, this comes at the cost of degraded accuracy in posterior approximation, often underperforming the instance-wise variational optimization. Although the latest semi-amortized approaches mitigate the issue by performing a few variational optimization updates starting from the VAE’s amortized inference output, they inherently suffer from computational overhead for inference at test time. In this paper, we address the problem in a completely different way by considering a random inference model, where we model the mean and variance functions of the variational posterior as random Gaussian processes (GP). The motivation is that the deviation of the VAE’s amortized posterior distribution from the true posterior can be regarded as random noise, which allows us to take into account the uncertainty in posterior approximation in a principled manner. In particular, our model can quantify the difficulty in posterior approximation by a Gaussian variational density. Inference in our GP model is done by a single feed forward pass through the network, significantly faster than semi-amortized methods. We show that our approach attains higher test data likelihood than the state-of-the-arts on several benchmark datasets.

Vladimir Pavlovic | Minyoung Kim | Minyoung Kim | V. Pavlovic

[1] Jaehoon Lee,et al. Deep Neural Networks as Gaussian Processes , 2017, ICLR.

[2] Ruslan Salakhutdinov,et al. Importance Weighted Autoencoders , 2015, ICLR.

[3] Edwin V. Bonilla,et al. Scalable Inference for Gaussian Process Models with Black-Box Likelihoods , 2015, NIPS.

[4] Gunhee Kim,et al. Variational Laplace Autoencoders , 2019, ICML.

[5] Yun-Nung Chen,et al. Compound Variational Auto-encoder , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6] Xiaogang Wang,et al. Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[7] Stephen J. Roberts,et al. Variational Inference for Gaussian Process Modulated Poisson Processes , 2014, ICML.

[8] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[9] Yee Whye Teh,et al. Functional Regularisation for Continual Learning using Gaussian Processes , 2019, ICLR.

[10] Laurence Aitchison,et al. Deep Convolutional Networks as shallow Gaussian Processes , 2018, ICLR.

[11] David Duvenaud,et al. Inference Suboptimality in Variational Autoencoders , 2018, ICML.

[12] Deli Zhao,et al. Scalable Gaussian Process Regression Using Deep Neural Networks , 2015, IJCAI.

[13] Zoubin Ghahramani,et al. Sparse Gaussian Processes using Pseudo-inputs , 2005, NIPS.

[14] Max Welling,et al. Improving Variational Auto-Encoders using Householder Flow , 2016, ArXiv.

[15] Vladimir Pavlovic,et al. Recursive Inference for Variational Autoencoders , 2020, NeurIPS.

[16] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[17] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .