论文信息 - DualGAN: Unsupervised Dual Learning for Image-to-Image Translation

DualGAN: Unsupervised Dual Learning for Image-to-Image Translation

Conditional Generative Adversarial Networks (GANs) for cross-domain image-to-image translation have made much progress recently [7, 8, 21, 12, 4, 18]. Depending on the task complexity, thousands to millions of labeled image pairs are needed to train a conditional GAN. However, human labeling is expensive, even impractical, and large quantities of data may not always be available. Inspired by dual learning from natural language translation [23], we develop a novel dual-GAN mechanism, which enables image translators to be trained from two sets of unlabeled images from two domains. In our architecture, the primal GAN learns to translate images from domain U to those in domain V, while the dual GAN learns to invert the task. The closed loop made by the primal and dual tasks allows images from either domain to be translated and then reconstructed. Hence a loss function that accounts for the reconstruction error of images can be used to train the translators. Experiments on multiple image translation tasks with unlabeled data show considerable performance gain of DualGAN over a single GAN. For some tasks, DualGAN can even achieve comparable or slightly better results than conditional GAN trained on fully labeled data.

[1] G. G. Stokes. "J." , 1890, The New Yale Book of Quotations.

[2] Edward H. Adelson,et al. Material perception: What can you see in a brief glance? , 2010 .

[3] Xiaogang Wang,et al. Coupled information-theoretic encoding for face photo-sketch recognition , 2011, CVPR 2011.

[4] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[5] Amit R.Sharma,et al. Face Photo-Sketch Synthesis and Recognition , 2012 .

[6] Radim Sára,et al. Spatial Pattern Templates for Recognition of Objects with Regular Structure , 2013, GCPR.

[7] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[8] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[9] Xiaofeng Tao,et al. Transient attributes for high-level understanding and editing of outdoor scenes , 2014, ACM Trans. Graph..

[10] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[11] Chuan Li,et al. Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks , 2016, ECCV.

[12] Yann LeCun,et al. Deep multi-scale video prediction beyond mean square error , 2015, ICLR.

[13] Tie-Yan Liu,et al. Dual Learning for Machine Translation , 2016, NIPS.

[14] Bogdan Raducanu,et al. Invertible Conditional GANs for image editing , 2016, ArXiv.

[15] Honglak Lee,et al. Attribute2Image: Conditional Image Generation from Visual Attributes , 2015, ECCV.

[16] Ming-Yu Liu,et al. Coupled Generative Adversarial Networks , 2016, NIPS.

[17] Bernt Schiele,et al. Generative Adversarial Text to Image Synthesis , 2016, ICML.

[18] Abhinav Gupta,et al. Generative Image Modeling Using Style and Structure Adversarial Networks , 2016, ECCV.

[19] Ole Winther,et al. Autoencoding beyond pixels using a learned similarity metric , 2015, ICML.

[20] Lior Wolf,et al. Unsupervised Cross-Domain Image Generation , 2016, ICLR.

[21] Léon Bottou,et al. Wasserstein GAN , 2017, ArXiv.

[22] Trevor Darrell,et al. Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Jan Kautz,et al. Unsupervised Image-to-Image Translation Networks , 2017, NIPS.

[25] Christian Ledig,et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26] 拓海杉山,et al. “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[27] Antonio Torralba,et al. Cross-Modal Scene Networks , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.