论文信息 - Learning Unsupervised Cross-domain Image-to-Image Translation Using a Shared Discriminator

Learning Unsupervised Cross-domain Image-to-Image Translation Using a Shared Discriminator

Unsupervised image-to-image translation is used to transform images from a source domain to generate images in a target domain without using source-target image pairs. Promising results have been obtained for this problem in an adversarial setting using two independent GANs and attention mechanisms. We propose a new method that uses a single shared discriminator between the two GANs, which improves the overall efficacy. We assess the qualitative and quantitative results on image transfiguration, a cross-domain translation task, in a setting where the target domain shares similar semantics to the source domain. Our results indicate that even without adding attention mechanisms, our method performs at par with attention-based methods and generates images of comparable quality.

Rajiv Kumar | Rishabh Dabral | G. Sivakumar

[1] Nicu Sebe,et al. AttentionGAN: Unpaired Image-to-Image Translation Using Attention-Guided Generative Adversarial Networks , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[2] Eric P. Xing,et al. Generative Semantic Manipulation with Contrasting GAN , 2017, ArXiv.

[3] Steven C. H. Hoi,et al. Deep Learning for Image Super-Resolution: A Survey , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4] Ming-Yu Liu,et al. Coupled Generative Adversarial Networks , 2016, NIPS.

[5] Kaiqi Huang,et al. GP-GAN: Towards Realistic High-Resolution Image Blending , 2017, ACM Multimedia.

[6] Chao Yang,et al. Show, Attend, and Translate: Unsupervised Image Translation With Self-Regularization and Attention , 2018, IEEE Transactions on Image Processing.

[7] Tomas E. Ward,et al. Generative Adversarial Networks in Computer Vision , 2019, ACM Comput. Surv..

[8] Fariborz Taherkhani,et al. Attribute-Guided Coupled GAN for Cross-Resolution Face Recognition , 2019, 2019 IEEE 10th International Conference on Biometrics Theory, Applications and Systems (BTAS).

[9] Xiaohua Zhai,et al. The GAN Landscape: Losses, Architectures, Regularization, and Normalization , 2018, ArXiv.

[10] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[11] Hyunsoo Kim,et al. Learning to Discover Cross-Domain Relations with Generative Adversarial Networks , 2017, ICML.

[12] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Luc Van Gool,et al. Pose Guided Person Image Generation , 2017, NIPS.

[14] Jan Kautz,et al. Multimodal Unsupervised Image-to-Image Translation , 2018, ECCV.

[15] Antonio Torralba,et al. Generating Videos with Scene Dynamics , 2016, NIPS.

[16] Jan Kautz,et al. Unsupervised Image-to-Image Translation Networks , 2017, NIPS.

[17] Lior Wolf,et al. Unsupervised Cross-Domain Image Generation , 2016, ICLR.

[18] Dumitru Erhan,et al. Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Nicu Sebe,et al. Dual Generator Generative Adversarial Networks for Multi-Domain Image-to-Image Translation , 2018, ACCV.

[20] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Nash Equilibrium , 2017, ArXiv.

[21] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[22] Alexei A. Efros,et al. Toward Multimodal Image-to-Image Translation , 2017, NIPS.

[23] Abdul Jabbar,et al. A Survey on Generative Adversarial Networks: Variants, Applications, and Training , 2020, ACM Comput. Surv..

[24] Tomas Pfister,et al. Learning from Simulated and Unsupervised Images through Adversarial Training , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[26] Wenhan Yang,et al. Attentive Generative Adversarial Network for Raindrop Removal from A Single Image , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[27] Jan Kautz,et al. Few-shot Video-to-Video Synthesis , 2019, NeurIPS.

[28] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[29] Jaakko Lehtinen,et al. Few-Shot Unsupervised Image-to-Image Translation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[30] Jung-Woo Ha,et al. StarGAN: Unified Generative Adversarial Networks for Multi-domain Image-to-Image Translation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[31] Xiaogang Wang,et al. Residual Attention Network for Image Classification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32] 拓海杉山,et al. “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[33] Jan Kautz,et al. Video-to-Video Synthesis , 2018, NeurIPS.

[34] Kwang In Kim,et al. Unsupervised Attention-guided Image to Image Translation , 2018, NeurIPS.

[35] Dimitris Kastaniotis,et al. Attention-Aware Generative Adversarial Networks (ATA-GANs) , 2018, 2018 IEEE 13th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP).

[36] Christian Ledig,et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37] Han Zhang,et al. Self-Attention Generative Adversarial Networks , 2018, ICML.

[38] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39] Jan Kautz,et al. High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[40] Katerina Fragkiadaki,et al. Adversarial Inverse Graphics Networks: Learning 2D-to-3D Lifting and Image-to-Image Translation from Unpaired Supervision , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[41] Taesung Park,et al. Semantic Image Synthesis With Spatially-Adaptive Normalization , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[42] Arthur Gretton,et al. Demystifying MMD GANs , 2018, ICLR.

[43] Tom White,et al. Generative Adversarial Networks: An Overview , 2017, IEEE Signal Processing Magazine.

[44] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[45] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[46] Ping Tan,et al. DualGAN: Unsupervised Dual Learning for Image-to-Image Translation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).