EqGAN: Feature Equalization Fusion for Few-shot Image Generation

Due to the absence of fine structure and texture information, existing fusion-based few-shot image generation methods suffer from unsatisfactory generation quality and diversity. To address this problem, we propose a novel feature Equalization fusion Generative Adversarial Network (EqGAN) for few-shot image generation. Unlike existing fusion strategies that rely on either deep features or local representations, we design two separate branches to fuse structures and textures by disentangling encoded features into shallow and deep contents. To refine image contents at all feature levels, we equalize the fused structure and texture semantics at different scales and supplement the decoder with richer information by skip connections. Since the fused structures and textures may be inconsistent with each other, we devise a consistent equalization loss between the equalized features and the intermediate output of the decoder to further align the semantics. Comprehensive experiments on three public datasets demonstrate that, EqGAN not only significantly improves generation performance with FID score (by up to 32.7%) and LPIPS score (by up to 4.19%), but also outperforms the state-of-the-arts in terms of accuracy (by up to 1.97%) for downstream classification tasks.

[1]  Ziqiu Chi,et al.  WaveGAN: Frequency-aware GAN for High-Fidelity Few-shot Image Generation , 2022, ECCV.

[2]  Jae-Pil Heo,et al.  Local Attention Pyramid for Scene Image Generation , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Yisheng Song,et al.  A Comprehensive Survey of Few-shot Learning: Evolution, Applications, Challenges, and Opportunities , 2022, ACM Comput. Surv..

[4]  Henghui Ding,et al.  A Closer Look at Few-shot Image Generation , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Bo Dai,et al.  Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data , 2021, NeurIPS.

[6]  Yang Gao,et al.  LoFGAN: Fusing Local Representations for Few-shot Image Generation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[7]  A. Rajagopalan,et al.  Distillation-guided Image Inpainting , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[8]  Zhaorui Gu,et al.  Image Harmonization with Transformer , 2021, IEEE International Conference on Computer Vision.

[9]  Hong-Han Shuai,et al.  Gradient Normalization for Generative Adversarial Networks , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[10]  Hongyu Yang,et al.  Image Inpainting via Conditional Texture and Structure Dual Generation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[11]  Zhe L. Lin,et al.  SSH: A Self-Supervised Framework for Image Harmonization , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[12]  Hao Tang,et al.  Recurrent Mask Refinement for Few-Shot Medical Image Segmentation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[13]  Liqing Zhang,et al.  Making Images Real Again: A Comprehensive Survey on Deep Image Composition , 2021, ArXiv.

[14]  Bogdan Raducanu,et al.  TransferI2I: Transfer Learning for Image-to-Image Translation from Small Datasets , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[15]  Hung-Yu Tseng,et al.  Regularizing Generative Adversarial Networks under Limited Data , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Yizhe Zhu,et al.  Towards Faster and Stabilized GAN Training for High-fidelity Few-shot Image Synthesis , 2021, ICLR.

[17]  Liqing Zhang,et al.  DeltaGAN: Towards Diverse Few-shot Image Generation with Sample-Specific Delta , 2020, ECCV.

[18]  Xinhang Song,et al.  Dataset Bias in Few-Shot Image Recognition , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Yan Hong,et al.  F2GAN: Fusing-and-Filling GAN for Few-shot Image Generation , 2020, ACM Multimedia.

[20]  Daniel Cohen-Or,et al.  Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Di Huang,et al.  Multi-Scale Positive Sample Refinement for Few-Shot Object Detection , 2020, ECCV.

[22]  Song Han,et al.  Differentiable Augmentation for Data-Efficient GAN Training , 2020, NeurIPS.

[23]  Youngjung Uh,et al.  Rethinking the Truly Unsupervised Image-to-Image Translation , 2020, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[24]  Timothy M. Hospedales,et al.  Meta-Learning in Neural Networks: A Survey , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Yan Hong,et al.  Matchinggan: Matching-Based Few-Shot Image Generation , 2020, 2020 IEEE International Conference on Multimedia and Expo (ICME).

[26]  Zixuan Liu,et al.  DAWSON: A Domain Adaptive Few Shot Generation Framework , 2020, ArXiv.

[27]  Liqing Zhang,et al.  DoveNet: Deep Image Harmonization via Domain Verification , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Jinhui Tang,et al.  Few-Shot Image Recognition With Knowledge Transfer , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[29]  Jiashi Feng,et al.  PANet: Few-Shot Image Semantic Segmentation With Prototype Alignment , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[30]  Jaakko Lehtinen,et al.  Few-Shot Unsupervised Image-to-Image Translation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[31]  James T. Kwok,et al.  Generalizing from a Few Examples , 2019, ACM Comput. Surv..

[32]  Louis Clouâtre,et al.  FIGR: Few-shot Image Generation with Reptile , 2019, ArXiv.

[33]  Timo Aila,et al.  A Style-Based Generator Architecture for Generative Adversarial Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Xin Wang,et al.  Few-Shot Object Detection via Feature Reweighting , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[35]  Jeff Donahue,et al.  Large Scale GAN Training for High Fidelity Natural Image Synthesis , 2018, ICLR.

[36]  Dmitry P. Vetrov,et al.  Few-shot Generative Modelling with Generative Matching Networks , 2018, AISTATS.

[37]  Joshua Achiam,et al.  On First-Order Meta-Learning Algorithms , 2018, ArXiv.

[38]  Alexei A. Efros,et al.  The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[39]  Amos J. Storkey,et al.  Data Augmentation Generative Adversarial Networks , 2017, ICLR 2018.

[40]  Omkar M. Parkhi,et al.  VGGFace2: A Dataset for Recognising Faces across Pose and Age , 2017, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[41]  Sepp Hochreiter,et al.  GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.

[42]  Richard S. Zemel,et al.  Prototypical Networks for Few-shot Learning , 2017, NIPS.

[43]  Sergey Levine,et al.  Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[44]  Jonathon Shlens,et al.  Conditional Image Synthesis with Auxiliary Classifier GANs , 2016, ICML.

[45]  Carl Doersch,et al.  Tutorial on Variational Autoencoders , 2016, ArXiv.

[46]  Oriol Vinyals,et al.  Matching Networks for One Shot Learning , 2016, NIPS.

[47]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[48]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[49]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[50]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[51]  Andrew Zisserman,et al.  Automated Flower Classification over a Large Number of Classes , 2008, 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing.