论文信息 - GILBO: One Metric to Measure Them All - 字舞流文

GILBO: One Metric to Measure Them All

We propose a simple, tractable lower bound on the mutual information contained in the joint generative density of any latent variable generative model: the GILBO (Generative Information Lower BOund). It offers a data-independent measure of the complexity of the learned latent variable description, giving the log of the effective description length. It is well-defined for both VAEs and GANs. We compute the GILBO for 800 GANs and VAEs each trained on four datasets (MNIST, FashionMNIST, CIFAR-10 and CelebA) and discuss the results.

Alexander A. Alemi | Ian Fischer | Alexander A. Alemi | Ian Fischer

[1] Sreeram Kannan,et al. Estimating Mutual Information for Discrete-Continuous Mixtures , 2017, NIPS.

[2] Thomas K. Landauer,et al. How Much do People Remember? Some Estimates of the Quantity of Learned Information in Long-Term Memory , 1986, Cogn. Sci..

[3] Oriol Vinyals,et al. Representation Learning with Contrastive Predictive Coding , 2018, ArXiv.

[4] Ruslan Salakhutdinov,et al. On the Quantitative Analysis of Decoder-Based Generative Models , 2016, ICLR.

[5] Charles O. Marsh. Introduction to Continuous Entropy , 2013 .

[6] Pieter Abbeel,et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[7] Alexander A. Alemi,et al. Fixing a Broken ELBO , 2017, ICML.

[8] Naftali Tishby,et al. Deep learning and the information bottleneck principle , 2015, 2015 IEEE Information Theory Workshop (ITW).

[9] Yoshua Bengio,et al. Mutual Information Neural Estimation , 2018, ICML.

[10] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.

[11] Felix Agakov,et al. Variational Information Maximization in Stochastic Environments , 2006 .

[12] Aki Vehtari,et al. Validating Bayesian Inference Algorithms with Simulation-Based Calibration , 2018, 1804.06788.

[13] Subarna Tripathi,et al. Precise Recovery of Latent Vectors from Generative Adversarial Networks , 2017, ICLR.

[14] Yi Zhang,et al. Do GANs actually learn the distribution? An empirical study , 2017, ArXiv.

[15] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[16] He Ma,et al. Quantitatively Evaluating GANs With Divergences Proposed for Training , 2018, ICLR.

[17] Mario Lucic,et al. Are GANs Created Equal? A Large-Scale Study , 2017, NeurIPS.

[18] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[19] Peter Dayan,et al. Comparison of Maximum Likelihood and GAN-based training of Real NVPs , 2017, ArXiv.

[20] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.