论文信息 - Unsupervised Cross-Domain Image Generation

Unsupervised Cross-Domain Image Generation

We study the problem of transferring a sample in one domain to an analog sample in another domain. Given two related domains, S and T, we would like to learn a generative function G that maps an input sample from S to the domain T, such that the output of a given function f, which accepts inputs in either domains, would remain unchanged. Other than the function f, the training data is unsupervised and consist of a set of samples from each domain. The Domain Transfer Network (DTN) we present employs a compound loss function that includes a multiclass GAN loss, an f-constancy component, and a regularizing component that encourages G to map samples from T to themselves. We apply our method to visual domains including digits and face images and demonstrate its ability to generate convincing novel images of previously unseen entities, while preserving their identity.

[1] L. Rudin,et al. Nonlinear total variation based noise removal algorithms , 1992 .

[2] Paul Sambre,et al. Gilles Fauconnier & Mark Turner, " The way we think: conceptual blending and the mind's hidden complexities" , 2002 .

[3] G. Fauconnier,et al. The Way We Think: Conceptual Blending and the Mind''s Hidden Complexities. Basic Books , 2002 .

[4] Koby Crammer,et al. Analysis of Representations for Domain Adaptation , 2006, NIPS.

[5] Andrew Y. Ng,et al. Reading Digits in Natural Images with Unsupervised Feature Learning , 2011 .

[6] J. Collomosse,et al. State of the ‘Art’: A Taxonomy of Artistic Stylization Techniques for Images and Video (cid:63) , 2012 .

[7] Kilian Q. Weinberger,et al. Marginalized Denoising Autoencoders for Domain Adaptation , 2012, ICML.

[8] Tinne Tuytelaars,et al. Unsupervised Visual Domain Adaptation Using Subspace Alignment , 2013, 2013 IEEE International Conference on Computer Vision.

[9] Tobias Isenberg,et al. State of the "Art”: A Taxonomy of Artistic Stylization Techniques for Images and Video , 2013, IEEE Transactions on Visualization and Computer Graphics.

[10] Stefan Winkler,et al. A data-driven approach to cleaning large face datasets , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[11] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[12] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[13] Ming Yang,et al. DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[14] Andrew Zisserman,et al. Deep Face Recognition , 2015, BMVC.

[15] Andrea Vedaldi,et al. Understanding deep image representations by inverting them , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[17] Shuo Yang,et al. From Facial Parts Responses to Face Detection: A Deep Learning Approach , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[18] François Laviolette,et al. Domain-Adversarial Training of Neural Networks , 2015, J. Mach. Learn. Res..

[19] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.

[20] Xiaoou Tang,et al. Image Super-Resolution Using Deep Convolutional Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21] Bernt Schiele,et al. Generative Adversarial Text to Image Synthesis , 2016, ICML.

[22] Leon A. Gatys,et al. Image Style Transfer Using Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[24] Thomas Brox,et al. Generating Images with Perceptual Similarity Metrics based on Deep Networks , 2016, NIPS.

[25] Xinyun Chen. Under Review as a Conference Paper at Iclr 2017 Delving into Transferable Adversarial Ex- Amples and Black-box Attacks , 2016 .

[26] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[27] Mark Sandler,et al. Inverting face embeddings with convolutional neural networks , 2016, ArXiv.

[28] Andrea Vedaldi,et al. Texture Networks: Feed-forward Synthesis of Textures and Stylized Images , 2016, ICML.

[29] Andrew Brock,et al. Neural Photo Editing with Introspective Adversarial Networks , 2016, ICLR.

[30] Frank Gabel. Generative Adversarial Text-to-Image Synthesis , 2018 .