论文信息 - On Data Augmentation for GAN Training

On Data Augmentation for GAN Training

Recent successes in Generative Adversarial Networks (GAN) have affirmed the importance of using more data in GAN training. Yet it is expensive to collect data in many domains such as medical applications. Data Augmentation (DA) has been applied in these applications. In this work, we first argue that the classical DA approach could mislead the generator to learn the distribution of the augmented data, which could be different from that of the original data. We then propose a principled framework, termed Data Augmentation Optimized for GAN (DAG), to enable the use of augmented data in GAN training to improve the learning of the original distribution. We provide theoretical analysis to show that using our proposed DAG aligns with the original GAN in minimizing the Jensen–Shannon (JS) divergence between the original distribution and model distribution. Importantly, the proposed DAG effectively leverages the augmented data to improve the learning of discriminator and generator. We conduct experiments to apply DAG to different GAN models: unconditional GAN, conditional GAN, self-supervised GAN and CycleGAN using datasets of natural images and medical images. The results show that DAG achieves consistent and considerable improvements across these models. Furthermore, when DAG is used in some GAN models, the system establishes state-of-the-art Fréchet Inception Distance (FID) scores. Our code is available (https://github.com/tntrung/dag-gans).

[1] Razvan Pascanu,et al. Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[2] Xiaohua Zhai,et al. Self-Supervised GANs via Auxiliary Rotation Loss , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Christian Ledig,et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Asja Fischer,et al. On the regularization of Wasserstein GANs , 2017, ICLR.

[5] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[6] Jacob Abernethy,et al. On Convergence and Stability of GANs , 2018 .

[7] Tero Karras,et al. Training Generative Adversarial Networks with Limited Data , 2020, NeurIPS.

[8] Ngai-Man Cheung,et al. Improving GAN with neighbors embedding and gradient matching , 2018, AAAI.

[9] Song Han,et al. Differentiable Augmentation for Data-Efficient GAN Training , 2020, NeurIPS.

[10] Yinghuan Shi,et al. Ea-GANs: Edge-Aware Generative Adversarial Networks for Cross-Modality MR Image Synthesis , 2019, IEEE Transactions on Medical Imaging.

[11] Cho-Jui Hsieh,et al. Rob-GAN: Generator, Discriminator, and Adversarial Attacker , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12] 拓海杉山,et al. “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[13] Trung Le,et al. Dual Discriminator Generative Adversarial Nets , 2017, NIPS.

[14] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[15] Ole Winther,et al. Autoencoding beyond pixels using a learned similarity metric , 2015, ICML.

[16] Han Zhang,et al. Self-Attention Generative Adversarial Networks , 2018, ICML.

[17] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Georg Langs,et al. Unsupervised Anomaly Detection with Generative Adversarial Networks to Guide Marker Discovery , 2017, IPMI.

[19] Dong Xu,et al. Collaborative and Adversarial Network for Unsupervised Domain Adaptation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[20] Trung Le,et al. MGAN: Training Generative Adversarial Nets with Multiple Generators , 2018, ICLR.

[21] Hayit Greenspan,et al. GAN-based Synthetic Medical Image Augmentation for increased CNN Performance in Liver Lesion Classification , 2018, Neurocomputing.

[22] Yuichi Yoshida,et al. Spectral Normalization for Generative Adversarial Networks , 2018, ICLR.

[23] Aaron C. Courville,et al. Adversarially Learned Inference , 2016, ICLR.

[24] Truyen Tran,et al. Catastrophic forgetting and mode collapse in GANs , 2020, 2020 International Joint Conference on Neural Networks (IJCNN).

[25] Jeff Donahue,et al. Large Scale Adversarial Representation Learning , 2019, NeurIPS.

[26] Léon Bottou,et al. Wasserstein Generative Adversarial Networks , 2017, ICML.

[27] Aggelos K. Katsaggelos,et al. Generative Adversarial Networks and Perceptual Losses for Video Super-Resolution , 2018, IEEE Transactions on Image Processing.

[28] Jaakko Lehtinen,et al. Progressive Growing of GANs for Improved Quality, Stability, and Variation , 2017, ICLR.

[29] Jeff Donahue,et al. Large Scale GAN Training for High Fidelity Natural Image Synthesis , 2018, ICLR.

[30] Dacheng Tao,et al. Perceptual Adversarial Networks for Image-to-Image Transformation , 2017, IEEE Transactions on Image Processing.

[31] Sameer Singh,et al. Image Augmentations for GAN Training , 2020, ArXiv.

[32] Shiyu Chang,et al. AutoGAN: Neural Architecture Search for Generative Adversarial Networks , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[33] Nikos Komodakis,et al. Unsupervised Representation Learning by Predicting Image Rotations , 2018, ICLR.

[34] Kanglin Liu. Varying k-Lipschitz Constraint for Generative Adversarial Networks , 2018, ArXiv.

[35] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[36] Tri Dao,et al. A Kernel Theory of Modern Data Augmentation , 2018, ICML.

[37] Philip H. S. Torr,et al. Multi-agent Diverse Generative Adversarial Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[38] Heng Tao Shen,et al. Video Captioning by Adversarial LSTM , 2018, IEEE Transactions on Image Processing.

[39] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.

[40] G. Qiu,et al. Lipschitz constrained GANs via boundedness and continuity , 2020, Neural Computing and Applications.

[41] Ngai-Man Cheung,et al. Self-supervised GAN: Analysis and Improvement with Multi-class Minimax Game , 2019, NeurIPS.

[42] Luis Perez,et al. The Effectiveness of Data Augmentation in Image Classification using Deep Learning , 2017, ArXiv.

[43] David Pfau,et al. Unrolled Generative Adversarial Networks , 2016, ICLR.

[44] Sridhar Mahadevan,et al. Generative Multi-Adversarial Networks , 2016, ICLR.

[45] Stefan Winkler,et al. The Unusual Effectiveness of Averaging in GAN Training , 2018, ICLR.

[46] Yuval Elovici,et al. DOPING: Generative Data Augmentation for Unsupervised Anomaly Detection with GAN , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[47] Ngai-Man Cheung,et al. InfoMax-GAN: Improved Adversarial Image Generation via Information Maximization and Contrastive Learning , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[48] Bernt Schiele,et al. Generative Adversarial Text to Image Synthesis , 2016, ICML.

[49] Navdeep Jaitly,et al. Adversarial Autoencoders , 2015, ArXiv.

[50] Alexei A. Efros,et al. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[51] Bogdan Raducanu,et al. Transferring GANs: generating images from limited data , 2018, ECCV.

[52] Yiming Yang,et al. MMD GAN: Towards Deeper Understanding of Moment Matching Network , 2017, NIPS.

[53] Dimitris N. Metaxas,et al. StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[54] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[55] Truyen Tran,et al. On catastrophic forgetting and mode collapse in Generative Adversarial Networks , 2018, ArXiv.

[56] Sebastian Nowozin,et al. Stabilizing Training of Generative Adversarial Networks through Regularization , 2017, NIPS.

[57] Sitao Xiang,et al. On the Effects of Batch and Weight Normalization in Generative Adversarial Networks , 2017 .

[58] Aggelos K. Katsaggelos,et al. Generative Adversarial Networks and Perceptual Losses for Video Super-Resolution , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[59] Ioannis Mitliagkas,et al. Multi-objective training of Generative Adversarial Networks with multiple discriminators , 2019, ICML.

[60] Nobuaki Minematsu,et al. A Study on Invariance of $f$-Divergence and Its Application to Speech Recognition , 2010, IEEE Transactions on Signal Processing.

[61] Ngai-Man Cheung,et al. Dist-GAN: An Improved GAN Using Distance Constraints , 2018, ECCV.

[62] Surya Ganguli,et al. Continual Learning Through Synaptic Intelligence , 2017, ICML.

[63] Trevor Darrell,et al. Adversarial Feature Learning , 2016, ICLR.

[64] Ngai-Man Cheung,et al. An Improved Self-supervised GAN via Adversarial Training , 2019, ArXiv.

[65] Wei Wang,et al. Improving MMD-GAN Training with Repulsive Loss Function , 2018, ICLR.

[66] Mark D. McDonnell,et al. Understanding Data Augmentation for Classification: When to Warp? , 2016, 2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA).

[67] Timo Aila,et al. A Style-Based Generator Architecture for Generative Adversarial Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[68] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.

[69] Pieter Abbeel,et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[70] Joo-Hwee Lim,et al. DehazeGAN: When Image Dehazing Meets Differential Programming , 2018, IJCAI.