论文信息 - Dual Discriminator Generative Adversarial Nets - 字舞流文

Dual Discriminator Generative Adversarial Nets

We propose in this paper a novel approach to tackle the problem of mode collapse encountered in generative adversarial network (GAN). Our idea is intuitive but proven to be very effective, especially in addressing some key limitations of GAN. In essence, it combines the Kullback-Leibler (KL) and reverse KL divergences into a unified objective function, thus it exploits the complementary statistical properties from these divergences to effectively diversify the estimated density in capturing multi-modes. We term our method dual discriminator generative adversarial nets (D2GAN) which, unlike GAN, has two discriminators; and together with a generator, it also has the analogy of a minimax game, wherein a discriminator rewards high scores for samples from data distribution whilst another discriminator, conversely, favoring data from the generator, and the generator produces data to fool both two discriminators. We develop theoretical analysis to show that, given the maximal discriminators, optimizing the generator of D2GAN reduces to minimizing both KL and reverse KL divergences between data distribution and the distribution induced from the data generated by the generator, hence effectively avoiding the mode collapsing problem. We conduct extensive experiments on synthetic and real-world large-scale datasets (MNIST, CIFAR-10, STL-10, ImageNet), where we have made our best effort to compare our D2GAN with the latest state-of-the-art GAN's variants in comprehensive qualitative and quantitative evaluations. The experimental results demonstrate the competitive and superior performance of our approach in generating good quality and diverse samples over baselines, and the capability of our method to scale up to ImageNet database.

Trung Le | Tu Dinh Nguyen | Dinh Q. Phung | Hung Vu | Trung Le | T. Nguyen | H. Vu

[1] Hava T. Siegelmann,et al. Support Vector Clustering , 2002, J. Mach. Learn. Res..

[2] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[3] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[4] Honglak Lee,et al. An Analysis of Single-Layer Networks in Unsupervised Feature Learning , 2011, AISTATS.

[5] Aaron C. Courville,et al. Adversarially Learned Inference , 2016, ICLR.

[6] Leon Hirsch,et al. Fundamentals Of Convex Analysis , 2016 .

[7] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[8] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[9] Yiannis Demiris,et al. MAGAN: Margin Adaptation for Generative Adversarial Networks , 2017, ArXiv.

[10] David Pfau,et al. Unrolled Generative Adversarial Networks , 2016, ICLR.

[11] Martin J. Wainwright,et al. Estimating Divergence Functionals and the Likelihood Ratio by Convex Risk Minimization , 2008, IEEE Transactions on Information Theory.

[12] Trung Le,et al. Geometric Enclosing Networks , 2017, IJCAI.

[13] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[14] Yoshua Bengio,et al. Mode Regularized Generative Adversarial Networks , 2016, ICLR.

[15] Ian J. Goodfellow,et al. NIPS 2016 Tutorial: Generative Adversarial Networks , 2016, ArXiv.

[16] Trung Le,et al. Multi-Generator Generative Adversarial Nets , 2017, ArXiv.

[17] Sebastian Nowozin,et al. f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization , 2016, NIPS.

[18] Bernt Schiele,et al. Generative Adversarial Text to Image Synthesis , 2016, ICML.

[19] Ferenc Huszar,et al. How (not) to Train your Generative Model: Scheduled Sampling, Likelihood, Adversary? , 2015, ArXiv.

[20] Yingyu Liang,et al. Generalization and Equilibrium in Generative Adversarial Nets (GANs) , 2017, ICML.

[21] Rob Fergus,et al. Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks , 2015, NIPS.

[22] Yann LeCun,et al. The mnist database of handwritten digits , 2005 .

[23] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.

[24] Yoshua Bengio,et al. Improving Generative Adversarial Networks with Denoising Feature Matching , 2016, ICLR.

[25] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[26] David Berthelot,et al. BEGAN: Boundary Equilibrium Generative Adversarial Networks , 2017, ArXiv.

[27] Matthias Bethge,et al. A note on the evaluation of generative models , 2015, ICLR.

[28] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[29] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[30] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.

[31] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).