论文信息 - On Self Modulation for Generative Adversarial Networks

On Self Modulation for Generative Adversarial Networks

Training Generative Adversarial Networks (GANs) is notoriously challenging. We propose and study an architectural modification, self-modulation, which improves GAN performance across different data sets, architectures, losses, regularizers, and hyperparameter settings. Intuitively, self-modulation allows the intermediate feature maps of a generator to change as a function of the input noise vector. While reminiscent of other conditioning techniques, it requires no labeled data. In a large-scale empirical study we observe a relative decrease of $5\%-35\%$ in FID. Furthermore, all else being equal, adding this modification to the generator leads to improved performance in $124/144$ ($86\%$) of the studied settings. Self-modulation is a simple architectural change that requires no additional parameter tuning, which suggests that it can be applied readily to any GAN.

[1] Yann Dauphin,et al. Convolutional Sequence to Sequence Learning , 2017, ICML.

[2] Alexei A. Efros,et al. Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.

[4] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[5] Yinda Zhang,et al. LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop , 2015, ArXiv.

[6] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[8] Gang Sun,et al. Squeeze-and-Excitation Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9] Takeru Miyato,et al. cGANs with Projection Discriminator , 2018, ICLR.

[10] Lantao Yu,et al. Understanding the Effectiveness of Lipschitz Constraint in Training of GANs via Gradient Analysis , 2018, ArXiv.

[11] Olivier Bachem,et al. Assessing Generative Models via Precision and Recall , 2018, NeurIPS.

[12] Han Zhang,et al. Self-Attention Generative Adversarial Networks , 2018, ICML.

[13] Hugo Larochelle,et al. Modulating early visual processing by language , 2017, NIPS.

[14] Yoshua Bengio,et al. Dynamic Layer Normalization for Adaptive Neural Acoustic Modeling in Speech Recognition , 2017, INTERSPEECH.

[15] Eirikur Agustsson,et al. Deep Generative Models for Distribution-Preserving Lossy Compression , 2018, NeurIPS.

[16] Rishi Sharma,et al. A Note on the Inception Score , 2018, ArXiv.

[17] Xiaohua Zhai,et al. The GAN Landscape: Losses, Architectures, Regularization, and Normalization , 2018, ArXiv.

[18] Christian Ledig,et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.

[20] Yuichi Yoshida,et al. Spectral Normalization for Generative Adversarial Networks , 2018, ICLR.

[21] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[22] Léon Bottou,et al. Wasserstein Generative Adversarial Networks , 2017, ICML.

[23] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[24] Alex Graves,et al. Conditional Image Generation with PixelCNN Decoders , 2016, NIPS.

[25] Mario Lucic,et al. Are GANs Created Equal? A Large-Scale Study , 2017, NeurIPS.

[26] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.

[27] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[28] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[29] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30] Aaron C. Courville,et al. FiLM: Visual Reasoning with a General Conditioning Layer , 2017, AAAI.

[31] Raymond Y. K. Lau,et al. Least Squares Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[32] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[33] Jonathon Shlens,et al. Conditional Image Synthesis with Auxiliary Classifier GANs , 2016, ICML.

[34] 拓海杉山,et al. “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[35] Andrea Vedaldi,et al. Instance Normalization: The Missing Ingredient for Fast Stylization , 2016, ArXiv.

[36] Colin Raffel,et al. Is Generator Conditioning Causally Related to GAN Performance? , 2018, ICML.

[37] Jonathon Shlens,et al. A Learned Representation For Artistic Style , 2016, ICLR.

[38] Jaakko Lehtinen,et al. Progressive Growing of GANs for Improved Quality, Stability, and Variation , 2017, ICLR.

[39] Dimitris N. Metaxas,et al. StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).