Robust 3D Garment Digitization from Monocular 2D Images for 3D Virtual Try-On Systems

In this paper, we develop a robust 3D garment digitization solution that can generalize well on real-world fashion catalog images with cloth texture occlusions and large body pose variations. We assumed fixed topology parametric template mesh models for known types of garments (e.g., Tshirts, Trousers) and perform mapping of high-quality texture from an input catalog image to UV map panels corresponding to the parametric mesh model of the garment. We achieve this by first predicting a sparse set of 2D landmarks on the boundary of the garments. Subsequently, we use these landmarks to perform Thin-Plate-Spline-based texture transfer on UV map panels. Subsequently, we employ a deep texture inpainting network to fill the large holes (due to view variations & self-occlusions) in TPS output to generate consistent UV maps. Furthermore, to train the supervised deep networks for landmark prediction & texture inpainting tasks, we generated a large set of synthetic data with varying texture and lighting imaged from various views with the human present in a wide variety of poses. Additionally, we manually annotated a small set of fashion catalog images crawled from online fashion e-commerce platforms to finetune. We conduct thorough empirical evaluations and show impressive qualitative results of our proposed 3D garment texture solution on fashion catalog images. Such 3D garment digitization helps us solve the challenging task of enabling 3D Virtual Try-on.

[1]  A. Fairhurst,et al.  Fast fashion: response to changes in the fashion industry , 2010 .

[2]  Yu Shen,et al.  GAN-Based Garment Generation Using Sewing Pattern Images , 2020, ECCV.

[3]  Ruigang Yang,et al.  Detailed Human Shape Estimation From a Single Image by Hierarchical Mesh Deformation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Xiao Liu,et al.  Image Inpainting by End-to-End Cascaded Refinement With Mask Awareness , 2021, IEEE Transactions on Image Processing.

[5]  Kwang-Ting Cheng,et al.  Cascaded Deep Monocular 3D Human Pose Estimation With Evolutionary Training Data , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Young Jae Jang,et al.  Effects of 3D Virtual “Try-On” on Online Sales and Customers’ Purchasing Experiences , 2020, IEEE Access.

[7]  Svetlana Lazebnik,et al.  Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[8]  Paul L. Rosin,et al.  CP-VTON+: Clothing Shape and Texture Preserving Image-Based Virtual Try-On , 2020 .

[9]  Xiaodan Liang,et al.  WAS-VTON: Warping Architecture Search for Virtual Try-on Network , 2021, ACM Multimedia.

[10]  Miguel A. Otaduy,et al.  Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Ruimao Zhang,et al.  Towards Photo-Realistic Virtual Try-On by Adaptively Generating↔Preserving Image Content , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Bin Zhou,et al.  Garment Modeling from a Single Image , 2013, Comput. Graph. Forum.

[13]  Hao Li,et al.  PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[14]  Markus H. Gross,et al.  DeepGarment : 3D Garment Shape Estimation from a Single Image , 2017, Comput. Graph. Forum.

[15]  Djemel Ziou,et al.  Image Quality Metrics: PSNR vs. SSIM , 2010, 2010 20th International Conference on Pattern Recognition.

[16]  D. Grechi,et al.  Trends in the Fashion Industry. The Perception of Sustainability and Circular Economy: A Gender/Generation Quantitative Approach , 2020 .

[17]  Amir Hossein Raffiee,et al.  GarmentGAN: Photo-realistic Adversarial Fashion Transfer , 2020, 2020 25th International Conference on Pattern Recognition (ICPR).

[18]  Gerard Pons-Moll,et al.  Learning to Transfer Texture From Clothing Images to 3D Humans , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Daniel Cremers,et al.  DeepWrinkles: Accurate and Realistic Clothing Modeling , 2018, ECCV.

[20]  Jaegul Choo,et al.  VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Weilin Huang,et al.  ClothFlow: A Flow-Based Model for Clothed Person Generation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[22]  Larry S. Davis,et al.  VITON: An Image-Based Virtual Try-on Network , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[23]  Derek Bradley,et al.  Markerless garment capture , 2008, ACM Trans. Graph..

[24]  Marcus A. Magnor,et al.  Tex2Shape: Detailed Full Human Body Geometry From a Single Image , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[25]  M. Madadi,et al.  CLOTH3D: Clothed 3D Humans , 2019, ECCV.

[26]  Hanbyul Joo,et al.  PIFuHD: Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Francesc Moreno-Noguer,et al.  SMPLicit: Topology-aware Generative Model for Clothed People , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Masashi Nishiyama,et al.  Virtual Fitting by Single-Shot Body Shape Estimation , 2014 .

[29]  Pascal Fua,et al.  GarNet: A Two-Stream Network for Fast and Accurate 3D Cloth Draping , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[30]  Shion Honda VITON-GAN: Virtual Try-on Image Generator Trained with Adversarial Loss , 2019, Eurographics.

[31]  Hao Li,et al.  ARCH: Animatable Reconstruction of Clothed Humans , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Jérémie Mary,et al.  End-to-End Learning of Geometric Deformations of Feature Maps for Virtual Try-On , 2019, ArXiv.

[33]  Christian Theobalt,et al.  Multi-Garment Net: Learning to Dress 3D People From Images , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[34]  Ke Wang,et al.  Physics-Inspired Garment Recovery from a Single-View Image , 2018, ACM Trans. Graph..

[35]  Mayur Hemani,et al.  SieveNet: A Unified Framework for Robust Image-Based Virtual Try-On , 2020, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[36]  Yibing Song,et al.  Disentangled Cycle Consistency for Highly-realistic Virtual Try-On , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Jitendra Malik,et al.  Shape Matching and Object Recognition , 2006, Toward Category-Level Object Recognition.

[38]  Shanglin Yang,et al.  3D Virtual Garment Modeling from RGB Images , 2019, 2019 IEEE International Symposium on Mixed and Augmented Reality (ISMAR).

[39]  Xiaogang Wang,et al.  Fashion Landmark Detection in the Wild , 2016, ECCV.

[40]  Ruimao Zhang,et al.  DeepFashion2: A Versatile Benchmark for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Jianfei Cai,et al.  M2E-Try On Net: Fashion from Model to Everyone , 2018, ACM Multimedia.

[42]  Gerard Pons-Moll,et al.  360-Degree Textures of People in Clothing from a Single Image , 2019, 2019 International Conference on 3D Vision (3DV).

[43]  Ling Shao,et al.  Cloth Interactive Transformer for Virtual Try-On , 2021, ArXiv.

[44]  Nikolay Jetchev,et al.  The Conditional Analogy GAN: Swapping Fashion Articles on People Images , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[45]  Michael J. Black,et al.  Learning to Dress 3D People in Generative Clothing , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Xiaohui Xie,et al.  VTNFP: An Image-Based Virtual Try-On Network With Body and Clothing Feature Preservation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[47]  L. Harris,et al.  Uncovering consumers’ returning behaviour: a study of fashion e-commerce , 2017 .

[48]  Bolei Zhou,et al.  Places: A 10 Million Image Database for Scene Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49]  Liang Lin,et al.  Toward Characteristic-Preserving Image-based Virtual Try-On Network , 2018, ECCV.

[50]  Chaitanya Patel,et al.  TailorNet: Predicting Clothing in 3D as a Function of Human Pose, Shape and Garment Style , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  Michael J. Black,et al.  SMPL: A Skinned Multi-Person Linear Model , 2023 .

[52]  Liang Lin,et al.  Look into Person: Joint Body Parsing & Pose Estimation Network and a New Benchmark , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Stamatia Giannarou,et al.  VisionBlender: a tool to efficiently generate computer vision datasets for robotic surgery , 2020, Comput. methods Biomech. Biomed. Eng. Imaging Vis..