论文信息 - GibbsNet: Iterative Adversarial Inference for Deep Graphical Models

GibbsNet: Iterative Adversarial Inference for Deep Graphical Models

Directed latent variable models that formulate the joint distribution as $p(x,z) = p(z) p(x \mid z)$ have the advantage of fast and exact sampling. However, these models have the weakness of needing to specify $p(z)$, often with a simple fixed prior that limits the expressiveness of the model. Undirected latent variable models discard the requirement that $p(z)$ be specified with a prior, yet sampling from them generally requires an iterative procedure such as blocked Gibbs-sampling that may require many steps to draw samples from the joint distribution $p(x, z)$. We propose a novel approach to learning the joint distribution between the data and a latent code which uses an adversarially learned iterative procedure to gradually refine the joint distribution, $p(x, z)$, to better match with the data distribution on each step. GibbsNet is the best of both worlds both in theory and in practice. Achieving the speed and simplicity of a directed latent variable model, it is guaranteed (assuming the adversarial game reaches the virtual training criteria global minimum) to produce samples from $p(x, z)$ with only a few sampling iterations. Achieving the expressiveness and flexibility of an undirected latent variable model, GibbsNet does away with the need for an explicit $p(z)$ and has the ability to do attribute prediction, class-conditional generation, and joint image-attribute modeling in a single model which is not trained for any of these specific tasks. We show empirically that GibbsNet is able to learn a more complex $p(z)$ and show that this leads to improved inpainting and iterative refinement of $p(x, z)$ for dozens of steps and stable generation without collapse for thousands of steps, despite being trained on only a few steps.

[1] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[2] Yoshua Bengio,et al. Boundary-Seeking Generative Adversarial Networks , 2017, ICLR 2017.

[3] Aaron C. Courville,et al. Adversarially Learned Inference , 2016, ICLR.

[4] Geoffrey E. Hinton. A Practical Guide to Training Restricted Boltzmann Machines , 2012, Neural Networks: Tricks of the Trade.

[5] Yoshua Bengio,et al. Training opposing directed models using geometric mean matching , 2015, ArXiv.

[6] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.

[7] Trevor Darrell,et al. Adversarial Feature Learning , 2016, ICLR.

[8] Ian J. Goodfellow,et al. NIPS 2016 Tutorial: Generative Adversarial Networks , 2016, ArXiv.

[9] Ole Winther,et al. Autoencoding beyond pixels using a learned similarity metric , 2015, ICML.

[10] Geoffrey E. Hinton,et al. Deep Boltzmann Machines , 2009, AISTATS.

[11] Shakir Mohamed,et al. Variational Inference with Normalizing Flows , 2015, ICML.

[12] Geoffrey E. Hinton,et al. The Helmholtz Machine , 1995, Neural Computation.

[13] Yoshua Bengio,et al. Professor Forcing: A New Algorithm for Training Recurrent Networks , 2016, NIPS.

[14] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[15] Ferenc Huszár,et al. Variational Inference using Implicit Distributions , 2017, ArXiv.

[16] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[17] Yoshua Bengio,et al. Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.

[18] Yee Whye Teh,et al. A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[19] Yoshua Bengio,et al. Deep Generative Stochastic Networks Trainable by Backprop , 2013, ICML.

[20] Yoshua Bengio,et al. Better Mixing via Deep Representations , 2012, ICML.

[21] Matthias Bethge,et al. A note on the evaluation of generative models , 2015, ICLR.

[22] Andrew Y. Ng,et al. Reading Digits in Natural Images with Unsupervised Feature Learning , 2011 .

[23] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[24] Xiaogang Wang,et al. Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[25] Stefano Ermon,et al. Generative Adversarial Learning of Markov Chains , 2017, International Conference on Learning Representations.

[26] Nebojsa Jojic,et al. Iterative Refinement of the Approximate Posterior for Directed Belief Networks , 2015, NIPS.

[27] Surya Ganguli,et al. Deep Unsupervised Learning using Nonequilibrium Thermodynamics , 2015, ICML.