End-to-End Neural Ranking for eCommerce Product Search: an Application of Task Models and Textual Embeddings

We consider the problem of retrieving and ranking items in an eCommerce catalog, often called SKUs, in order of relevance to a user-issued query. The input data for the ranking are the texts of the queries and textual fields of the SKUs indexed in the catalog. We review the ways in which this problem both resembles and differs from the problems of IR in the context of web search. The differences between the product-search problem and the IR problem of web search necessitate a different approach in terms of both models and datasets. We first review the recent state-of-the-art models for web search IR, distinguishing between two distinct types of model which we call the distributed type and the local-interaction type. The different types of relevance models developed for IR have complementary advantages and disadvantages when applied to eCommerce product search. Further, we explain why the conventional methods for dataset construction employed in the IR literature fail to produce data which suffices for training or evaluation of models for eCommerce product search. We explain how our own approach, applying task modeling techniques to the click-through logs of an eCommerce site, enables the construction of a large-scale dataset for training and robust benchmarking of relevance models. Our experiments consist of applying several of the models from the IR literature to our own dataset. Empirically, we have established that, when applied to our dataset, certain models of local-interaction type reduce ranking errors by one-third compared to the baseline tf-idf. Applied to our dataset, the distributed models fail to outperform the baseline. As a basis for a deployed system, the distributed models have several advantages, computationally, over the local-interaction models. This motivates an ongoing program of work, which we outline at the conclusion of the paper.

[1]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[2]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[3]  M. de Rijke,et al.  Click Models for Web Search , 2015, Click Models for Web Search.

[4]  Guang Yang,et al.  Towards Practical Visual Search Engine within Elasticsearch , 2018, eCOM@SIGIR.

[5]  Jimmy Lin,et al.  An Exploration of Approaches to Integrating Neural Reranking Models in Multi-Stage Ranking Architectures , 2017, ArXiv.

[6]  Nick Craswell,et al.  Learning to Match using Local and Distributed Representations of Text for Web Search , 2016, WWW.

[7]  Rishabh K. Iyer,et al.  Modeling and Simultaneously Removing Bias via Adversarial Neural Networks , 2018, ArXiv.

[8]  Larry P. Heck,et al.  Learning deep structured semantic models for web search using clickthrough data , 2013, CIKM.

[9]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[10]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[11]  Zhiyuan Liu,et al.  End-to-End Neural Ad-hoc Ranking with Kernel Pooling , 2017, SIGIR.

[12]  Yoav Goldberg,et al.  Neural Network Methods for Natural Language Processing , 2017, Synthesis Lectures on Human Language Technologies.

[13]  Bhaskar Mitra,et al.  Neural Models for Information Retrieval , 2017, ArXiv.

[14]  Reynold Xin,et al.  Apache Spark , 2016 .

[15]  Yuchen Zhang,et al.  User-click modeling for understanding and predicting search-behavior , 2011, KDD.

[16]  Zheng Yan,et al.  Towards a Simplified Ontology for Better e-Commerce Search , 2018, eCOM@SIGIR.

[17]  Hang Li Learning to Rank for Information Retrieval and Natural Language Processing , 2011, Synthesis Lectures on Human Language Technologies.

[18]  Yelong Shen,et al.  A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval , 2014, CIKM.

[19]  Petr Sojka,et al.  Software Framework for Topic Modelling with Large Corpora , 2010 .

[20]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[21]  Vít Novotný,et al.  Semantic Vector Encoding and Similarity Search Using Fulltext Search Engines , 2017, Rep4NLP@ACL.

[22]  Hugo Zaragoza,et al.  The Probabilistic Relevance Framework: BM25 and Beyond , 2009, Found. Trends Inf. Retr..

[23]  Clinton Gormley,et al.  Elasticsearch: The Definitive Guide , 2015 .

[24]  M. de Rijke,et al.  A Neural Click Model for Web Search , 2016, WWW.