论文信息 - DynaGAN: Dynamic Few-shot Adaptation of GANs to Multiple Domains

DynaGAN: Dynamic Few-shot Adaptation of GANs to Multiple Domains

Few-shot domain adaptation to multiple domains aims to learn a complex image distribution across multiple domains from a few training images. A naïve solution here is to train a separate model for each domain using few-shot domain adaptation methods. Unfortunately, this approach mandates linearly-scaled computational resources both in memory and computation time and, more importantly, such separate models cannot exploit the shared knowledge between target domains. In this paper, we propose DynaGAN, a novel few-shot domain-adaptation method for multiple target domains. DynaGAN has an adaptation module, which is a hyper-network that dynamically adapts a pretrained GAN model into the multiple target domains. Hence, we can fully exploit the shared knowledge across target domains and avoid the linearly-scaled computational requirements. As it is still computationally challenging to adapt a large-size GAN model, we design our adaptation module to be lightweight using the rank-1 tensor decomposition. Lastly, we propose a contrastive-adaptation loss suitable for multi-domain few-shot adaptation. We validate the effectiveness of our method through extensive qualitative and quantitative evaluations.

Sunghyun Cho | Geon-Yeong Kim | Seung-Hwan Baek | Kyoungkook Kang | S. Kim

[1] Daniel Cohen-Or,et al. Stitch it in Time: GAN-Based Facial Editing of Real Videos , 2022, SIGGRAPH Asia.

[2] Tan M. Dinh,et al. HyperInverter: Improving StyleGAN Inversion via Hypernetwork , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Amit H. Bermano,et al. HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4] E. Shechtman,et al. StyleAlign: Analysis and Applications of Aligned StyleGAN Models , 2021, International Conference on Learning Representations.

[5] Peter Wonka,et al. Mind the Gap: Domain Gap Control for Single Shot Domain Adaptation for Generative Adversarial Networks , 2021, ICLR.

[6] Sam Kwong,et al. Unsupervised Image-to-Image Translation via Pre-Trained StyleGAN2 Network , 2020, IEEE Transactions on Multimedia.

[7] Qiang Liu,et al. FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization , 2021, ArXiv.

[8] Bo Dai,et al. Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data , 2021, NeurIPS.

[9] Kyoungkook Kang,et al. GAN Inversion for Out-of-Range Images with Geometric Transformations , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[10] Daniel Cohen-Or,et al. StyleGAN-NADA , 2021, ACM Trans. Graph..

[11] D. Cohen-Or,et al. Designing an encoder for StyleGAN image manipulation , 2021, ACM Trans. Graph..

[12] Jaakko Lehtinen,et al. Alias-Free Generative Adversarial Networks , 2021, NeurIPS.

[13] Sebastian Ruder,et al. Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks , 2021, ACL.

[14] Peiran Ren,et al. GAN Prior Embedded Network for Blind Face Restoration in the Wild , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Yong Jae Lee,et al. Few-shot Image Generation via Cross-domain Correspondence , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Daniel Cohen-Or,et al. StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[17] Ilya Sutskever,et al. Learning Transferable Visual Models From Natural Language Supervision , 2021, ICML.

[18] Daniel Cohen-Or,et al. Designing an encoder for StyleGAN image manipulation , 2021, ACM Trans. Graph..

[19] Xintao Wang,et al. Towards Real-World Blind Face Restoration with Generative Facial Prior , 2021, Computer Vision and Pattern Recognition.

[20] Tal Hassner,et al. HyperSeg: Patch-wise Hypernetwork for Real-time Semantic Segmentation , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21] Xiangyu Xu,et al. GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Dani Lischinski,et al. StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23] Daniel Cohen-Or,et al. Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Bolei Zhou,et al. Closed-Form Factorization of Latent Semantics in GANs , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25] S. Kung,et al. Parameter Efficient Dynamic Convolution via Tensor Decomposition , 2021, BMVC.

[26] Eli Shechtman,et al. Few-shot Image Generation with Elastic Weight Consolidation , 2020, NeurIPS.

[27] Doron Adler,et al. Resolution Dependent GAN Interpolation for Controllable Image Synthesis Between Domains , 2020, ArXiv.

[28] Song Han,et al. Differentiable Augmentation for Data-Efficient GAN Training , 2020, NeurIPS.

[29] Tero Karras,et al. Training Generative Adversarial Networks with Limited Data , 2020, NeurIPS.

[30] Bolei Zhou,et al. In-Domain GAN Inversion for Real Image Editing , 2020, ECCV.

[31] Joost van de Weijer,et al. MineGAN: Effective Knowledge Transfer From GANs to Target Domains With Few Images , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[32] Jung-Woo Ha,et al. StarGAN v2: Diverse Image Synthesis for Multiple Domains , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Tero Karras,et al. Analyzing and Improving the Image Quality of StyleGAN , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[34] Bolei Zhou,et al. Interpreting the Latent Space of GANs for Semantic Face Editing , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[35] Benjamin F. Grewe,et al. Continual learning with hypernetworks , 2019, ICLR.

[36] Ke Yan,et al. Data augmentation using generative adversarial networks (CycleGAN) to improve generalizability in CT segmentation tasks , 2019, Scientific Reports.

[37] Lior Wolf,et al. Deep Meta Functionals for Shape Representation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[38] Tatsuya Harada,et al. Image Generation From Small Datasets via Batch Statistics Adaptation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[39] Fuxin Li,et al. HyperGAN: A Generative Model for Diverse, Performant Neural Networks , 2019, ICML.

[40] Timo Aila,et al. A Style-Based Generator Architecture for Generative Adversarial Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[41] Jeff Donahue,et al. Large Scale GAN Training for High Fidelity Natural Image Synthesis , 2018, ICLR.

[42] Stefanos Zafeiriou,et al. ArcFace: Additive Angular Margin Loss for Deep Face Recognition , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[43] Bogdan Raducanu,et al. Transferring GANs: generating images from limited data , 2018, ECCV.

[44] Arthur Gretton,et al. Demystifying MMD GANs , 2018, ICLR.

[45] Vijay Vasudevan,et al. Learning Transferable Architectures for Scalable Image Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[46] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.

[47] Quoc V. Le,et al. HyperNetworks , 2016, ICLR.

[48] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.

[49] Yinda Zhang,et al. LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop , 2015, ArXiv.

[50] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[51] Xiaogang Wang,et al. Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[52] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.