论文信息 - Improving the Improved Training of Wasserstein GANs: A Consistency Term and Its Dual Effect

Improving the Improved Training of Wasserstein GANs: A Consistency Term and Its Dual Effect

Despite being impactful on a variety of problems and applications, the generative adversarial nets (GANs) are remarkably difficult to train. This issue is formally analyzed by \cite{arjovsky2017towards}, who also propose an alternative direction to avoid the caveats in the minmax two-player training of GANs. The corresponding algorithm, called Wasserstein GAN (WGAN), hinges on the 1-Lipschitz continuity of the discriminator. In this paper, we propose a novel approach to enforcing the Lipschitz continuity in the training procedure of WGANs. Our approach seamlessly connects WGAN with one of the recent semi-supervised learning methods. As a result, it gives rise to not only better photo-realistic samples than the previous methods but also state-of-the-art semi-supervised learning results. In particular, our approach gives rise to the inception score of more than 5.0 with only 1,000 CIFAR-10 images and is the first that exceeds the accuracy of 90% on the CIFAR-10 dataset using only 4,000 labeled images, to the best of our knowledge.

[1] Léon Bottou,et al. Towards Principled Methods for Training Generative Adversarial Networks , 2017, ICLR.

[2] Timo Aila,et al. Temporal Ensembling for Semi-Supervised Learning , 2016, ICLR.

[3] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.

[4] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[5] John E. Hopcroft,et al. Stacked Generative Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Richard Nock,et al. f-GANs in an Information Geometric Nutshell , 2017, NIPS.

[7] David Berthelot,et al. BEGAN: Boundary Equilibrium Generative Adversarial Networks , 2017, ArXiv.

[8] C. Villani. Optimal Transport: Old and New , 2008 .

[9] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[10] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[11] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12] Jun Zhu,et al. Triple Generative Adversarial Nets , 2017, NIPS.

[13] Harri Valpola,et al. Weight-averaged consistency targets improve semi-supervised deep learning results , 2017, ArXiv.

[14] Dilin Wang,et al. Learning to Draw Samples: With Application to Amortized MLE for Generative Adversarial Learning , 2016, ArXiv.

[15] Jacob Abernethy,et al. On Convergence and Stability of GANs , 2018 .

[16] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.

[17] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[18] Richard S. Zemel,et al. Generative Moment Matching Networks , 2015, ICML.

[19] Yinda Zhang,et al. LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop , 2015, ArXiv.

[20] Yiming Yang,et al. MMD GAN: Towards Deeper Understanding of Moment Matching Network , 2017, NIPS.

[21] Tapani Raiko,et al. Semi-supervised Learning with Ladder Networks , 2015, NIPS.

[22] Philip Bachman,et al. Calibrating Energy-based Generative Adversarial Networks , 2017, ICLR.

[23] Abhishek Kumar,et al. Improved Semi-supervised Learning with GANs using Manifold Invariances , 2017, NIPS 2017.

[24] Jost Tobias Springenberg,et al. Unsupervised and Semi-supervised Learning with Categorical Generative Adversarial Networks , 2015, ICLR.

[25] Hui Jiang,et al. Generating images with recurrent adversarial networks , 2016, ArXiv.

[26] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[27] Shin Ishii,et al. Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28] Max Welling,et al. Semi-supervised Learning with Deep Generative Models , 2014, NIPS.

[29] Ruslan Salakhutdinov,et al. On the Quantitative Analysis of Decoder-Based Generative Models , 2016, ICLR.

[30] Alexei A. Efros,et al. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[31] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[32] O. Chapelle,et al. Semi-Supervised Learning (Chapelle, O. et al., Eds.; 2006) [Book reviews] , 2009, IEEE Transactions on Neural Networks.

[33] Augustus Odena,et al. Semi-Supervised Learning with Generative Adversarial Networks , 2016, ArXiv.

[34] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[35] Yoshua Bengio,et al. Improving Generative Adversarial Networks with Denoising Feature Matching , 2016, ICLR.

[36] Zoubin Ghahramani,et al. Training generative neural networks via Maximum Mean Discrepancy optimization , 2015, UAI.

[37] Oriol Vinyals,et al. Towards Principled Unsupervised Learning , 2015, ArXiv.

[38] J. R.,et al. Quantitative analysis , 1892, Nature.

[39] Xiaojin Zhu,et al. Semi-Supervised Learning , 2010, Encyclopedia of Machine Learning.

[40] Guo-Jun Qi,et al. Loss-Sensitive Generative Adversarial Networks on Lipschitz Densities , 2017, International Journal of Computer Vision.

[41] Jonathon Shlens,et al. Conditional Image Synthesis with Auxiliary Classifier GANs , 2016, ICML.

[42] Sebastian Nowozin,et al. f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization , 2016, NIPS.

[43] Rob Fergus,et al. Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks , 2015, NIPS.

[44] Aaron C. Courville,et al. Adversarially Learned Inference , 2016, ICLR.