E-commerce websites such as Amazon, Alibaba, Flipkart, and Walmart sell billions of products. Machine learning (ML) algorithms involving products are often used to improve the customer experience and increase revenue, e.g., product similarity, recommendation, and price estimation. The products are required to be represented as features before training an ML algorithm. In this paper, we propose an approach called MRNet-Product2Vec for creating generic embeddings of products within an e-commerce ecosystem. We learn a dense and low-dimensional embedding where a diverse set of signals related to a product are explicitly injected into its representation. We train a Discriminative Multi-task Bidirectional Recurrent Neural Network (RNN), where the input is a product title fed through a Bidirectional RNN and at the output, product labels corresponding to fifteen different tasks are predicted. The task set includes several intrinsic characteristics about a product such as price, weight, size, color, popularity, and material. We evaluate the proposed embedding quantitatively and qualitatively. We demonstrate that they are almost as good as sparse and extremely high-dimensional TF-IDF representation in spite of having less than 3% of the TF-IDF dimension. We also use a multimodal autoencoder for comparing products from different language-regions and show preliminary yet promising qualitative results.
[1]
M. Köppen,et al.
The Curse of Dimensionality
,
2010
.
[2]
Geoffrey I. Webb,et al.
Encyclopedia of Machine Learning
,
2011,
Encyclopedia of Machine Learning.
[3]
Juhan Nam,et al.
Multimodal Deep Learning
,
2011,
ICML.
[4]
Jürgen Schmidhuber,et al.
Long Short-Term Memory
,
1997,
Neural Computation.
[5]
Rich Caruana,et al.
Multitask Learning
,
1998,
Encyclopedia of Machine Learning and Data Mining.
[6]
Jeffrey Dean,et al.
Distributed Representations of Words and Phrases and their Compositionality
,
2013,
NIPS.
[7]
Nemanja Djuric,et al.
E-commerce in Your Inbox: Product Recommendations at Scale
,
2015,
KDD.
[8]
Paul J. Werbos,et al.
Backpropagation Through Time: What It Does and How to Do It
,
1990,
Proc. IEEE.
[9]
Elena Smirnova,et al.
Meta-Prod2Vec: Product Embeddings Using Side-Information for Recommendation
,
2016,
RecSys.
[10]
Sepp Hochreiter,et al.
The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions
,
1998,
Int. J. Uncertain. Fuzziness Knowl. Based Syst..