论文信息 - A Tale of Color Variants: Representation and Self-Supervised Learning in Fashion E-Commerce

A Tale of Color Variants: Representation and Self-Supervised Learning in Fashion E-Commerce

In this paper, we address a crucial problem in fashion ecommerce (with respect to customer experience, as well as revenue): color variants identification, i.e., identifying fashion products that match exactly in their design (or style), but only to differ in their color. We propose a generic framework, that leverages deep visual Representation Learning at its heart, to address this problem for our fashion e-commerce platform. Our framework could be trained with supervisory signals in the form of triplets, that are obtained manually. However, it is infeasible to obtain manual annotations for the entire huge collection of data usually present in fashion e-commerce platforms, such as ours, while capturing all the difficult corner cases. But, to our rescue, interestingly we observed that this crucial problem in fashion e-commerce could also be solved by simple color jitter based image augmentation, that recently became widely popular in the contrastive Self-Supervised Learning (SSL) literature, that seeks to learn visual representations without using manual labels. This naturally led to a question in our mind: Could we leverage SSL in our use-case, and still obtain comparable performance to our supervised framework? The answer is, Yes! because, color variant fashion objects are nothing but manifestations of a style, in different colors, and a model trained to be invariant to the color (with, or without supervision), should be able to recognize this! This is what the paper further demonstrates, both qualitatively, and quantitatively, while evaluating a couple of state-of-the-art SSL techniques, and also proposing a

[1] Xinlei Chen,et al. Exploring Simple Siamese Representation Learning , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Ali Farhadi,et al. YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Ser-Nam Lim,et al. Unsupervised Deep Metric Learning via Auxiliary Rotation Loss , 2019, ArXiv.

[4] Geoffrey E. Hinton,et al. A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.

[5] Yichen Wei,et al. Circle Loss: A Unified Perspective of Pair Similarity Optimization , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Yingli Tian,et al. Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8] Zhihai He,et al. Unsupervised Deep Metric Learning with Transformed Attention Consistency and Contrastive Clustering Loss , 2020, ECCV.

[9] Michal Valko,et al. Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning , 2020, NeurIPS.

[10] Mehrtash Harandi,et al. Unsupervised Metric Learning with Synthetic Examples , 2020, AAAI.

[11] Geonmo Gu,et al. Symmetrical Synthesis for Deep Metric Learning , 2020, AAAI.

[12] Kaiming He,et al. Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Trevor Darrell,et al. Reducing Class Collapse in Metric Learning with Easy Positive Sampling , 2020, ArXiv.

[14] Serge J. Belongie,et al. Conditional Similarity Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Robinson Piramuthu,et al. Large scale visual recommendations from street fashion images , 2014, KDD.

[16] Xiong Chen,et al. Learning Discriminative Features with Multiple Granularities for Person Re-Identification , 2018, ACM Multimedia.

[17] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Mehrtash Harandi,et al. Unsupervised Deep Metric Learning via Orthogonality Based Probabilistic Loss , 2020, IEEE Transactions on Artificial Intelligence.