论文信息 - Improving Semantic Matching via Multi-Task Learning in E-Commerce - 字舞流文

Improving Semantic Matching via Multi-Task Learning in E-Commerce

Semantic matching plays a critical role in an e-commerce search engine, while one of the biggest challenges is the language gap between queries and products. Traditionally, some auxiliary functions, such as the category navigation, are designed to help buyers to clarify their intent. Recently, the advances in deep learning provide new opportunities to bridge the gap, however, these techniques suffer from the data sparseness problem. To address this issue, in addition to the click-through data from buyers, we exploit other types of semantic knowledge from the product category taxonomy and sellers’ behavior. We investigate the correlation between query intent classification and semantic textual similarity, and propose a multi-task framework to boost their performance simultaneously. Moreover, we design a Progressively Hierarchical Classification (PHC) network architecture with the taxonomy to solve the category imbalance problem . We conduct extensive offline and online A/B experiments on a real-world e-commerce platform, and the results show that the proposed method in this paper significantly outperforms the baseline and achieves higher commercial value.

Tianyi Wang | Hongchun Zhang | Xiaonan Meng | Yi Hu | Xiaonan Meng | Tianyi Wang | Hongchun Zhang | Yi Hu

[1] Rabab K. Ward,et al. Deep Sentence Embedding Using the Long Short-Term Memory Networks , 2015 .

[2] Zhiyuan Liu,et al. Convolutional Neural Networks for Soft-Matching N-Grams in Ad-hoc Search , 2018, WSDM.

[3] Rong Jin,et al. Visual Search at Alibaba , 2018, KDD.

[4] Kaiming He,et al. Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[5] Jing Wang,et al. Towards Mitigating the Class-Imbalance Problem for Partial Label Learning , 2018, KDD.

[6] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[7] Larry P. Heck,et al. Learning deep structured semantic models for web search using clickthrough data , 2013, CIKM.

[8] Yaohui Jin,et al. A Generalized Recurrent Neural Architecture for Text Classification with Multi-Task Learning , 2017, IJCAI.

[9] Zhiyuan Liu,et al. End-to-End Neural Ad-hoc Ranking with Kernel Pooling , 2017, SIGIR.

[10] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[11] W. Bruce Croft,et al. Neural Ranking Models with Weak Supervision , 2017, SIGIR.

[12] Xueqi Cheng,et al. Match-SRNN: Modeling the Recursive Matching Structure with Spatial RNN , 2016, IJCAI.

[13] Sattar Hashemi,et al. To Combat Multi-Class Imbalanced Problems by Means of Over-Sampling Techniques , 2016, IEEE Transactions on Knowledge and Data Engineering.

[14] Rong Xiao,et al. Weakly Supervised Co-Training of Query Rewriting andSemantic Matching for e-Commerce , 2019, WSDM.

[15] Xueqi Cheng,et al. Text Matching as Image Recognition , 2016, AAAI.

[16] Yee Whye Teh,et al. A fast and simple algorithm for training neural probabilistic language models , 2012, ICML.

[17] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[18] Aapo Hyvärinen,et al. Noise-Contrastive Estimation of Unnormalized Statistical Models, with Applications to Natural Image Statistics , 2012, J. Mach. Learn. Res..

[19] Hang Li,et al. Convolutional Neural Network Architectures for Matching Natural Language Sentences , 2014, NIPS.

[20] Sebastian Ruder,et al. Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[21] Alec Radford,et al. Improving Language Understanding by Generative Pre-Training , 2018 .

[22] Ye Zhang,et al. A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification , 2015, IJCNLP.

[23] Xuanjing Huang,et al. Convolutional Neural Tensor Network Architecture for Community-Based Question Answering , 2015, IJCAI.

[24] Yelong Shen,et al. A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval , 2014, CIKM.

[25] Quoc V. Le,et al. Distributed Representations of Sentences and Documents , 2014, ICML.

[26] Jun Zhao,et al. Recurrent Convolutional Neural Networks for Text Classification , 2015, AAAI.