论文信息 - IB-GAN: Disentangled Representation Learning with Information Bottleneck Generative Adversarial Networks - 字舞流文

IB-GAN: Disentangled Representation Learning with Information Bottleneck Generative Adversarial Networks

We propose a new GAN-based unsupervised model for disentangled representation learning. The new model is discovered in an attempt to utilize the Information Bottleneck (IB) framework to the optimization of GAN, thereby named IB-GAN. The architecture of IB-GAN is partially similar to that of InfoGAN but has a critical difference; an intermediate layer of the generator is leveraged to constrain the mutual information between the input and the generated output. The intermediate stochastic layer can serve as a learnable latent distribution that is trained with the generator jointly in an end-to-end fashion. As a result, the generator of IB-GAN can harness the latent space in a disentangled and interpretable manner. With the experiments on dSprites and Color-dSprites dataset, we demonstrate that IB-GAN achieves competitive disentanglement scores to those of state-of-the-art β-VAEs and outperforms InfoGAN. Moreover, the visual quality and the diversity of samples generated by IB-GAN are often better than those by β-VAEs and Info-GAN in terms of FID score on CelebA and 3D Chairs dataset.

Myeongjang Pyeon | Gunhee Kim | Wonkwang Lee | In S. Jeon | Insu Jeon | Gunhee Kim | Wonkwang Lee | Myeongjang Pyeon

[1] Yuting Zhang,et al. Learning to Disentangle Factors of Variation with Manifold Interaction , 2014, ICML.

[2] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[3] Rishi Sharma,et al. A Note on the Inception Score , 2018, ArXiv.

[4] Alexei A. Efros,et al. Seeing 3D Chairs: Exemplar Part-Based 2D-3D Alignment Using a Large Dataset of CAD Models , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[5] Christopher Burgess,et al. DARLA: Improving Zero-Shot Transfer in Reinforcement Learning , 2017, ICML.

[6] Brendan J. Frey,et al. PixelGAN Autoencoders , 2017, NIPS.

[7] Michael I. Jordan,et al. Graphical Models, Exponential Families, and Variational Inference , 2008, Found. Trends Mach. Learn..

[8] David Barber,et al. The IM algorithm: a variational approach to Information Maximization , 2003, NIPS 2003.

[9] Stefano Soatto,et al. Information Dropout: Learning Optimal Representations Through Noisy Computation , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10] Murray Shanahan,et al. SCAN: Learning Hierarchical Compositional Visual Concepts , 2017, ICLR.

[11] Stefano Soatto,et al. Emergence of Invariance and Disentanglement in Deep Representations , 2017, 2018 Information Theory and Applications Workshop (ITA).

[12] Roger B. Grosse,et al. Isolating Sources of Disentanglement in Variational Autoencoders , 2018, NeurIPS.

[13] Alexander A. Alemi,et al. Deep Variational Information Bottleneck , 2017, ICLR.

[14] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Michael I. Jordan,et al. An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[16] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[17] Alexander A. Alemi,et al. GILBO: One Metric to Measure Them All , 2018, NeurIPS.

[18] Pascal Vincent,et al. Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19] Michael Satosi Watanabe,et al. Information Theoretical Analysis of Multivariate Correlation , 1960, IBM J. Res. Dev..

[20] Geoffrey E. Hinton,et al. Transforming Auto-Encoders , 2011, ICANN.

[21] S. Ermon,et al. The Information-Autoencoding Family: A Lagrangian Perspective on Latent Variable Generative Modeling , 2018 .

[22] Andriy Mnih,et al. Disentangling by Factorising , 2018, ICML.

[23] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Nash Equilibrium , 2017, ArXiv.

[24] Sang Joon Kim,et al. A Mathematical Theory of Communication , 2006 .

[25] David Barber,et al. Kernelized Infomax Clustering , 2005, NIPS.

[26] Maneesh Kumar Singh,et al. Disentangling Factors of Variation with Cycle-Consistent Variational Auto-Encoders , 2018, ECCV.

[27] Naftali Tishby,et al. Deep learning and the information bottleneck principle , 2015, 2015 IEEE Information Theory Workshop (ITW).

[28] Christopher Burgess,et al. beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.

[29] Xiaogang Wang,et al. Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[30] Aaron C. Courville,et al. Adversarially Learned Inference , 2016, ICLR.

[31] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[32] Yee Whye Teh,et al. Disentangling Disentanglement in Variational Autoencoders , 2018, ICML.

[33] Karl Ridgeway,et al. A Survey of Inductive Biases for Factorial Representation-Learning , 2016, ArXiv.

[34] Max Welling,et al. Semi-supervised Learning with Deep Generative Models , 2014, NIPS.

[35] Yann LeCun,et al. Disentangling factors of variation in deep representation using adversarial training , 2016, NIPS.

[36] Pieter Abbeel,et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[37] Guillaume Desjardins,et al. Understanding disentangling in β-VAE , 2018, ArXiv.

[38] Alexander A. Alemi,et al. Fixing a Broken ELBO , 2017, ICML.

[39] Naftali Tishby,et al. The information bottleneck method , 2000, ArXiv.

[40] Vighnesh Birodkar,et al. Unsupervised Learning of Disentangled Representations from Video , 2017, NIPS.

[41] Frank D. Wood,et al. Learning Disentangled Representations with Semi-Supervised Deep Generative Models , 2017, NIPS.

[42] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[43] Charles A. Sutton,et al. VEEGAN: Reducing Mode Collapse in GANs using Implicit Variational Learning , 2017, NIPS.

[44] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[45] Trevor Darrell,et al. Adversarial Feature Learning , 2016, ICLR.

[46] Sergey Levine,et al. Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow , 2018, ICLR.

[47] Bernhard Schölkopf,et al. Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations , 2018, ICML.