论文信息 - GarmentGAN: Photo-realistic Adversarial Fashion Transfer

GarmentGAN: Photo-realistic Adversarial Fashion Transfer

The garment transfer problem comprises two tasks: learning to separate a person's body (pose, shape, color) from their clothing (garment type, shape, style) and then generating new images of the wearer dressed in arbitrary garments. We present GarmentGAN, a new algorithm that performs image-based garment transfer through generative adversarial methods. The GarmentGAN framework allows users to virtually try-on items before purchase and generalizes to various apparel types. GarmentGAN requires as input only two images, namely, a picture of the target fashion item and an image containing the customer. The output is a synthetic image wherein the customer is wearing the target apparel. In order to make the generated image look photo-realistic, we employ the use of novel generative adversarial techniques. GarmentGAN improves on existing methods in the realism of generated imagery and solves various problems related to self-occlusions. Our proposed model incorporates additional information during training, utilizing both segmentation maps and body key-point information. We show qualitative and quantitative comparisons to several other networks to demonstrate the effectiveness of this technique.

Amir Hossein Raffiee | Michael Sollami | A. Raffiee | Michael Sollami

[1] Luc Van Gool,et al. Disentangled Person Image Generation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[2] Francesc Moreno-Noguer,et al. Unsupervised Person Image Synthesis in Arbitrary Poses , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[3] Xu Chen,et al. Unpaired Pose Guided Human Image Generation , 2019, CVPR Workshops.

[4] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[5] Yu Liu,et al. SwapGAN: A Multistage Generative Approach for Person-to-Person Fashion Style Transfer , 2019, IEEE Transactions on Multimedia.

[6] Ping Tan,et al. DualGAN: Unsupervised Dual Learning for Image-to-Image Translation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[7] Sanja Fidler,et al. Be Your Own Prada: Fashion Synthesis with Structural Coherence , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[8] Ke Gong,et al. Look into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[10] Duygu Ceylan,et al. SwapNet: Garment Transfer in Single View Images , 2018, European Conference on Computer Vision.

[11] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.

[12] Larry S. Davis,et al. Compatible and Diverse Fashion Image Inpainting , 2019, ArXiv.

[13] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14] Peter V. Gehler,et al. A Generative Model of People in Clothing , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[15] Bingbing Ni,et al. Skeleton-Aided Articulated Motion Generation , 2017, ACM Multimedia.

[16] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Jérémie Mary,et al. End-to-End Learning of Geometric Deformations of Feature Maps for Virtual Try-On , 2019, ArXiv.

[19] Bernt Schiele,et al. Generative Adversarial Text to Image Synthesis , 2016, ICML.

[20] Ming Yang,et al. Instance-level Human Parsing via Part Grouping Network , 2018, ECCV.

[21] Jan Kautz,et al. High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[22] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[23] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[24] Andrea Vedaldi,et al. Instance Normalization: The Missing Ingredient for Fast Stylization , 2016, ArXiv.

[25] Gang Yu,et al. Cascaded Pyramid Network for Multi-person Pose Estimation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[26] 拓海杉山,et al. “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[27] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[28] Hanjiang Lai,et al. Towards Multi-Pose Guided Virtual Try-On Network , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[29] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.

[30] Hyunsoo Kim,et al. Learning to Discover Cross-Domain Relations with Generative Adversarial Networks , 2017, ICML.

[31] Luc Van Gool,et al. Pose Guided Person Image Generation , 2017, NIPS.

[32] Larry S. Davis,et al. VITON: An Image-Based Virtual Try-on Network , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[33] Michael J. Black,et al. Detailed, Accurate, Human Shape Estimation from Clothed 3D Scan Sequences , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.

[35] Yuichi Yoshida,et al. Spectral Normalization for Generative Adversarial Networks , 2018, ICLR.

[36] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[37] Yusuke Iwasawa,et al. Generative Adversarial Network-Based Virtual Try-On with Clothing Region , 2018 .

[38] Taesung Park,et al. Semantic Image Synthesis With Spatially-Adaptive Normalization , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[39] Nikolay Jetchev,et al. The Conditional Analogy GAN: Swapping Fashion Articles on People Images , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[40] Daniel Cremers,et al. DeepWrinkles: Accurate and Realistic Clothing Modeling , 2018, ECCV.

[41] Nicu Sebe,et al. Deformable GANs for Pose-Based Human Image Generation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[42] Liang Lin,et al. Toward Characteristic-Preserving Image-based Virtual Try-On Network , 2018, ECCV.