Active Learning for Product Type Ontology Enhancement in E-commerce

Entity-based semantic search has been widely adopted in modern search engines to improve search accuracy by understanding users' intent. In e-commerce, an accurate and complete product type (PT) ontology is essential for recognizing product entities in queries and retrieving relevant products from catalog. However, finding product types (PTs) to construct such an ontology is usually expensive due to the considerable amount of human efforts it may involve. In this work, we propose an active learning framework that efficiently utilizes domain experts' knowledge for PT discovery. We also show the quality and coverage of the resulting PTs in the experiment results.

[1]  Guillaume Lample,et al.  Neural Architectures for Named Entity Recognition , 2016, NAACL.

[2]  Zheng Yan,et al.  Towards a Simplified Ontology for Better e-Commerce Search , 2018, eCOM@SIGIR.

[3]  Da Xu,et al.  Product Knowledge Graph Embedding for E-commerce , 2019, WSDM.

[4]  Omar Alonso,et al.  Unsupervised Construction of a Product Knowledge Graph , 2019, eCOM@SIGIR.

[5]  Jiawei Han,et al.  Automated Phrase Mining from Massive Text Corpora , 2017, IEEE Transactions on Knowledge and Data Engineering.

[6]  PopovBorislav,et al.  KIM a semantic platform for information extraction and retrieval , 2004 .

[7]  Rodrygo L. T. Santos,et al.  Intent-Aware Semantic Query Annotation , 2017, SIGIR.

[8]  Leo Breiman,et al.  Randomizing Outputs to Increase Prediction Accuracy , 2000, Machine Learning.

[9]  Zhendong Mao,et al.  Knowledge Graph Embedding: A Survey of Approaches and Applications , 2017, IEEE Transactions on Knowledge and Data Engineering.

[10]  Tian Wang,et al.  Building Large-Scale Deep Learning System for Entity Recognition in E-Commerce Search , 2019, BDCAT.

[11]  Christos Faloutsos,et al.  Collective Multi-type Entity Alignment Between Knowledge Graphs , 2020, WWW.

[12]  Steven Bethard,et al.  A Survey on Recent Advances in Named Entity Recognition from Deep Learning models , 2018, COLING.

[13]  Xin Dong,et al.  OpenCeres: When Open Information Extraction Meets the Semi-Structured Web , 2019, NAACL.

[14]  C. Lee Giles,et al.  Learning on the border: active learning in imbalanced data classification , 2007, CIKM '07.

[15]  Atanas Kiryakov,et al.  KIM – a semantic platform for information extraction and retrieval , 2004, Natural Language Engineering.

[16]  Sang-goo Lee,et al.  Building an operational product ontology system , 2006, Electron. Commer. Res. Appl..

[17]  Amr Ahmed,et al.  Predicting Latent Structured Intents from Shopping Queries , 2017, WWW.

[18]  Satoshi Sekine,et al.  A survey of named entity recognition and classification , 2007 .