Click Through Rate Prediction for Local Search Results

With the ubiquity of internet access and location services provided by smartphone devices, the volume of queries issued by users to find products and services that are located near them is rapidly increasing. Local search engines help users in this task by matching queries with a predefined geographical connotation ("local queries") against a database of local business listings. Local search differs from traditional web-search because to correctly capture users' click behavior, the estimation of relevance between query and candidate results must be integrated with geographical signals, such as distance. The intuition is that users prefer businesses that are physically closer to them. However, this notion of closeness is likely to depend upon other factors, like the category of the business, the quality of the service provided, the density of businesses in the area of interest, etc. In this paper we perform an extensive analysis of online users' behavior and investigate the problem of estimating the click-through rate on local search (LCTR) by exploiting the combination of standard retrieval methods with a rich collection of geo and business-dependent features. We validate our approach on a large log collected from a real-world local search service. Our evaluation shows that the non-linear combination of business information, geo-local and textual relevance features leads to a significant improvements over state of the art alternative approaches based on a combination of relevance, distance and business reputation.

[1]  Q. Vuong Likelihood Ratio Tests for Model Selection and Non-Nested Hypotheses , 1989 .

[2]  Sergei Vassilvitskii,et al.  Driven by Food: Modeling Geographic Choice , 2015, WSDM.

[3]  J. Friedman Stochastic gradient boosting , 2002 .

[4]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[5]  Jaime Teevan,et al.  Understanding the importance of location, time, and people in mobile local search behavior , 2011, Mobile HCI.

[6]  James Caverlee,et al.  Exploiting Geo-Spatial Preference for Personalized Expert Recommendation , 2015, RecSys.

[7]  Feng Zhao,et al.  Hapori: context-based local search for mobile phones using community behavioral modeling and similarity , 2010, UbiComp.

[8]  B. Hillier,et al.  The Social Logic of Space , 1984 .

[9]  S. T. Dumais,et al.  Using latent semantic analysis to improve access to textual information , 1988, CHI '88.

[10]  Thorsten Joachims,et al.  Accurately interpreting clickthrough data as implicit feedback , 2005, SIGIR '05.

[11]  Yi Chang,et al.  Learning to rank with multi-aspect relevance for vertical search , 2012, WSDM '12.

[12]  Mao Ye,et al.  Exploiting geographical influence for collaborative point-of-interest recommendation , 2011, SIGIR.

[13]  Deepak Agarwal,et al.  Spatio-temporal models for estimating click-through rate , 2009, WWW '09.

[14]  Olivier Chapelle,et al.  A dynamic bayesian network click model for web search ranking , 2009, WWW '09.

[15]  James D. Hollan,et al.  A diary study of mobile information needs , 2008, CHI.

[16]  Daniele Quercia,et al.  Ads and the city: considering geographic distance goes a long way , 2012, RecSys '12.

[17]  Martin Wattenberg,et al.  Ad click prediction: a view from the trenches , 2013, KDD.

[18]  Andreas Henrich,et al.  Characteristics of geographic information needs , 2007, GIR '07.

[19]  Barry Smyth,et al.  Understanding the intent behind mobile information needs , 2009, IUI.

[20]  Jiang Bian,et al.  A taxonomy of local search: semi-supervised query classification driven by information needs , 2011, CIKM '11.

[21]  Matthew Richardson,et al.  Predicting clicks: estimating the click-through rate for new ads , 2007, WWW '07.

[22]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..

[23]  The Business Next Door : Click-Through Rate Modeling for Local Search , 2010 .

[24]  Ryen W. White,et al.  Here and there: goals, activities, and predictions about location from geotagged queries , 2013, SIGIR.

[25]  Steffen Rendle,et al.  Factorization Machines with libFM , 2012, TIST.

[26]  Hugo Zaragoza,et al.  The Probabilistic Relevance Framework: BM25 and Beyond , 2009, Found. Trends Inf. Retr..

[27]  Klaus Berberich,et al.  Improving local search ranking through external logs , 2011, SIGIR.

[28]  Jie Liu,et al.  Location-aware click prediction in mobile local search , 2011, CIKM '11.