Query Reformulation in E-Commerce Search

The importance of e-commerce platforms has driven forward a growing body of research work on e-commerce search. We present the first large-scale and in-depth study of query reformulations performed by users of e-commerce search; the study is based on the query logs of eBay's search engine. We analyze various factors including the distribution of different types of reformulations, changes of search result pages retrieved for the reformulations, and clicks and purchases performed upon the retrieved results. We then turn to address a novel challenge in the e-commerce search realm: predicting whether a user will reformulate her query before presenting her the search results. Using a suite of prediction features, most of which are novel to this study, we attain high prediction quality. Some of the features operate prior to retrieval time, whereas others rely on the retrieved results. While the latter are substantially more effective than the former, we show that the integration of these two types of features is of merit. We also show that high prediction quality can be obtained without considering information from the past about the user or the query she posted. Nevertheless, using these types of information can further improve prediction quality.

[1]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[2]  Zaiqiao Meng,et al.  Dynamic Bayesian Metric Learning for Personalized Product Search , 2019, CIKM.

[3]  M. de Rijke,et al.  Mix 'n Match: Integrating Text Matching and Product Substitutability within Product Search , 2018, CIKM.

[4]  Christos Faloutsos,et al.  Did We Get It Right? Predicting Query Performance in e-Commerce Search , 2018, eCOM@SIGIR.

[5]  Shubhra Kanti Karmaker Santu,et al.  On Application of Learning to Rank for E-Commerce Search , 2017, SIGIR.

[6]  Nick Craswell,et al.  Beyond clicks: query reformulation as a predictor of search satisfaction , 2013, CIKM.

[7]  Pu-Jen Cheng,et al.  Learning user reformulation behavior for query auto-completion , 2014, SIGIR.

[8]  Yo-Ping Huang,et al.  Predicting purchase intention according to fan page users' sentiment , 2017, 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[9]  W. Bruce Croft,et al.  Predicting query performance , 2002, SIGIR '02.

[10]  Lior Rokach,et al.  Product Bundle Identification using Semi-Supervised Learning , 2020, SIGIR.

[11]  Ankur Datta,et al.  Predicting Shopping Behavior with Mixture of RNNs , 2017, eCOM@SIGIR.

[12]  Panayiotis Tsaparas,et al.  Structured annotations of web queries , 2010, SIGMOD Conference.

[13]  Thomas Mandl,et al.  Effects of Auto-Suggest on the Usability of Search in eCommerce , 2015, ISI.

[14]  SpinkAmanda,et al.  Patterns of query reformulation during Web searching , 2009 .

[15]  Oren Kurland,et al.  Selective Cluster Presentation on the Search Results Page , 2018, ACM Trans. Inf. Syst..

[16]  Ryen W. White,et al.  Struggling and Success in Web Search , 2015, CIKM.

[17]  Oren Kurland,et al.  Query-performance prediction: setting the expectations straight , 2014, SIGIR.

[18]  Efthimis N. Efthimiadis,et al.  Analyzing and evaluating query reformulation strategies in web search logs , 2009, CIKM.

[19]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[20]  Sreenivas Gollapudi,et al.  Structured query reformulations in commerce search , 2012, CIKM '12.

[21]  Djoerd Hiemstra,et al.  The Combination and Evaluation of Query Performance Prediction Methods , 2009, ECIR.

[22]  Hua Yang,et al.  Query Rewrite for Null and Low Search Results in eCommerce , 2017, eCOM@SIGIR.

[23]  Rosie Jones,et al.  Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs , 2008, CIKM '08.

[24]  S. T. Buckland,et al.  Computer-Intensive Methods for Testing Hypotheses. , 1990 .

[25]  Martin F. Arlitt,et al.  Characterizing Web user sessions , 2000, PERV.

[26]  David Carmel,et al.  One Query, Many Clicks: Analysis of Queries with Multiple Clicks by the Same User , 2016, CIKM.

[27]  Jean-David Ruvini,et al.  Learning Better Internal Structure of Words for Sequence Labeling , 2018, EMNLP.

[28]  Jun Zhao,et al.  End-to-End Neural Ranking for eCommerce Product Search: an Application of Task Models and Textual Embeddings , 2018, eCOM@SIGIR.

[29]  Ido Guy,et al.  Structuring the Unstructured: From Startup to Making Sense of eBay's Huge eCommerce Inventory , 2017, SIGIR.

[30]  Umut Ozertem,et al.  Characterizing and Predicting Voice Query Reformulation , 2015, CIKM.

[31]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[32]  W. Bruce Croft,et al.  A Language Modeling Approach to Information Retrieval , 1998, SIGIR Forum.

[33]  Stephen E. Robertson,et al.  GatfordCentre for Interactive Systems ResearchDepartment of Information , 1996 .

[34]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[35]  Yiqun Liu,et al.  User Intent, Behaviour, and Perceived Satisfaction in Product Search , 2018, WSDM.

[36]  Rotem Dror,et al.  The Hitchhiker’s Guide to Testing Statistical Significance in Natural Language Processing , 2018, ACL.

[37]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .

[38]  Oren Kurland,et al.  Query performance prediction for entity retrieval , 2014, SIGIR.

[39]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[40]  Ryen W. White,et al.  Understanding and Predicting Graded Search Satisfaction , 2015, WSDM.

[41]  Mohit Sharma,et al.  A Taxonomy of Queries for E-commerce Search , 2018, SIGIR.

[42]  Elad Yom-Tov,et al.  Estimating the query difficulty for information retrieval , 2010, Synthesis Lectures on Information Concepts, Retrieval, and Services.

[43]  Nish Parikh,et al.  User behavior in zero-recall ecommerce queries , 2011, SIGIR '11.

[44]  Ryen W. White,et al.  Struggling or exploring?: disambiguating long search sessions , 2014, WSDM.

[45]  Haggai Roitman,et al.  Enhanced Mean Retrieval Score Estimation for Query Performance Prediction , 2017, ICTIR.

[46]  Yujing Hu,et al.  Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application , 2018, KDD.

[47]  Amanda Spink,et al.  Patterns of query reformulation during Web searching , 2009, J. Assoc. Inf. Sci. Technol..

[48]  Mohit Sharma,et al.  Intent term selection and refinement in e-commerce queries , 2019, ArXiv.

[49]  Oren Kurland,et al.  Predicting Query Performance by Query-Drift Estimation , 2009, TOIS.

[50]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[51]  Dong Wang,et al.  Neural IR Meets Graph Embedding: A Ranking Model for Product Search , 2019, WWW.

[52]  Aristides Gionis,et al.  The query-flow graph: model and applications , 2008, CIKM '08.

[53]  Shengli Wu,et al.  Query Performance Prediction By Considering Score Magnitude and Variance Together , 2014, CIKM.

[54]  W. Bruce Croft,et al.  Query performance prediction in web search environments , 2007, SIGIR.

[55]  Ronan G. Reilly,et al.  Predicting Purchasing Intent: Automatic Feature Learning using Recurrent Neural Networks , 2018, eCOM@SIGIR.

[56]  Oren Kurland,et al.  Query Performance Prediction Using Reference Lists , 2016, ACM Trans. Inf. Syst..

[57]  Joemon M. Jose,et al.  Improved query performance prediction using standard deviation , 2011, SIGIR.

[58]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[59]  Falk Scholer,et al.  Effective Pre-retrieval Query Performance Prediction Using Similarity and Variability Evidence , 2008, ECIR.