Fine-Grained Product Categorization in E-commerce

E-commerce sites usually leverage taxonomies for better organizing products. The fine-grained categories, regarding the leaf categories in taxonomies, are defined by the most descriptive and specific words of products. Fine-grained product categorization remains challenging, due to blurred concepts of fine grained categories (i.e. multiple equivalent or synonymous categories), instable category vocabulary (i.e. the emerging new products and the evolving language habits), and lack of labelled data. To address these issues, we proposes a novel Neural Product Categorization model---NPC to identify fine-grained categories from the product content. NPC is equipped with a character-level convolutional embedding layer to learn the compositional word representations, and a spiral residual layer to extract the word context annotations capturing complex long range dependencies and structural information. To perform categorization beyond predefined categories, NPC categorizes a product by jointly recognizing categories from the product content and predicting categories from predefined category vocabularies. Furthermore, to avoid extensive human labors, NPC is able to adapt to weak labels, generated by mining the search logs, where the customers' behaviors naturally connect products with categories. Extensive experiments performed on a real e-commerce platform datasets illustrate the effectiveness of the proposed models.

[1]  Hema Raghavan,et al.  Improving ad relevance in sponsored search , 2010, WSDM '10.

[2]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Zhendong Niu,et al.  Fine-grained Product Features Extraction and Categorization in Reviews Opinion Mining , 2012, 2012 IEEE 12th International Conference on Data Mining Workshops.

[4]  Vanja Josifovski,et al.  Supercharging Recommender Systems using Taxonomies for Learning User Purchase Behavior , 2012, Proc. VLDB Endow..

[5]  Aaron Levine,et al.  Large-Scale Categorization of Japanese Product Titles Using Neural Attention Models , 2017, EACL.

[6]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[7]  Eduard H. Hovy,et al.  End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF , 2016, ACL.

[8]  Zornitsa Kozareva,et al.  Everyone Likes Shopping! Multi-class Product Categorization for e-Commerce , 2015, NAACL.

[9]  Harish Karnick,et al.  Product Classification in E-Commerce using Distributional Semantics , 2016, COLING.

[10]  Edleno Silva de Moura,et al.  Lightweight methods for large-scale product categorization , 2011, J. Assoc. Inf. Sci. Technol..

[11]  Michel C. A. Klein,et al.  GoldenBullet: Automated Classification of Product Data in E-commerce , 2002 .

[12]  Ali Cevahir,et al.  Large-scale Multi-class and Hierarchical Product Categorization for an E-commerce Giant , 2016, COLING.

[13]  Jürgen Schmidhuber,et al.  Highway Networks , 2015, ArXiv.

[14]  Neel Sundaresan,et al.  Item categorization in the e-commerce domain , 2011, CIKM '11.

[15]  Jeonghee Kim,et al.  Large-Scale Item Categorization in e-Commerce Using Multiple Recurrent Neural Networks , 2016, KDD.