论文信息 - GANHopper: Multi-Hop GAN for Unsupervised Image-to-Image Translation

GANHopper: Multi-Hop GAN for Unsupervised Image-to-Image Translation

We introduce GANHopper, an unsupervised image-to-image translation network that transforms images gradually between two domains, through multiple hops. Instead of executing translation directly, we steer the translation by requiring the network to produce in-between images that resemble weighted hybrids between images from the input domains. Our network is trained on unpaired images from the two domains only, without any in-between images. All hops are produced using a single generator along each direction. In addition to the standard cycle-consistency and adversarial losses, we introduce a new hybrid discriminator, which is trained to classify the intermediate images produced by the generator as weighted hybrids, with weights based on a predetermined hop count. We also add a smoothness term to constrain the magnitude of each hop, further regularizing the translation. Compared to previous methods, GANHopper excels at image translations involving domain-specific image features and geometric variations while also preserving non-domain-specific features such as general color schemes.

[1] Dani Lischinski,et al. Cross-Domain Cascaded Deep Feature Translation , 2019, ArXiv.

[2] Young J. Kim,et al. Interactive generalized penetration depth computation for rigid and articulated models using object norm , 2014, ACM Trans. Graph..

[3] Dani Lischinski,et al. Neural best-buddies , 2018, ACM Trans. Graph..

[4] Guillaume Lample,et al. Fader Networks: Manipulating Images by Sliding Attributes , 2017, NIPS.

[5] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[6] Jing Liao,et al. CariGANs , 2018, ACM Trans. Graph..

[7] Jing Liao,et al. Automating Image Morphing Using Structural Similarity on a Halfway Domain , 2014, ACM Trans. Graph..

[8] Marc Christie,et al. Directing Cinematographic Drones , 2017, ACM Trans. Graph..

[9] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Daniel Cohen-Or,et al. LOGAN , 2019, ACM Trans. Graph..

[11] Alexei A. Efros,et al. The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[12] Roger A. Pearce,et al. Large-Scale Deep Learning on the YFCC100M Dataset , 2015, ArXiv.

[13] Jan Kautz,et al. Multimodal Unsupervised Image-to-Image Translation , 2018, ECCV.

[14] Pieter Abbeel,et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[15] Kwang In Kim,et al. Improving Shape Deformation in Unsupervised Image-to-Image Translation , 2018, ECCV.

[16] Jan Kautz,et al. High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[17] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.

[18] Xiaogang Wang,et al. Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[19] Jinwoo Shin,et al. InstaGAN: Instance-aware Image-to-Image Translation , 2018, ICLR.

[20] Alexei A. Efros,et al. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[21] George Papandreou,et al. Rethinking Atrous Convolution for Semantic Image Segmentation , 2017, ArXiv.

[22] Hyunsoo Kim,et al. Learning to Discover Cross-Domain Relations with Generative Adversarial Networks , 2017, ICML.

[23] Ping Tan,et al. DualGAN: Unsupervised Dual Learning for Image-to-Image Translation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[24] David W. Jacobs,et al. Dog Breed Classification Using Part Localization , 2012, ECCV.

[25] Trevor Darrell,et al. Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26] Alexei A. Efros,et al. Toward Multimodal Image-to-Image Translation , 2017, NIPS.

[27] Chen Qian,et al. TransGaGa: Geometry-Aware Unsupervised Image-To-Image Translation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28] Lior Wolf,et al. Unsupervised Cross-Domain Image Generation , 2016, ICLR.

[29] Raymond Y. K. Lau,et al. Least Squares Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[30] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[31] Jan Kautz,et al. Unsupervised Image-to-Image Translation Networks , 2017, NIPS.