A noise-aware click model for web search

Recent advances in click model have established it as an attractive approach to infer document relevance. Most of these advances consider the user click/skip behavior as binary events but neglect the context in which a click happens. We show that real click behavior in industrial search engines is often noisy and not always a good indication of relevance. For a considerable percentage of clicks, users select what turn out to be irrelevant documents and these clicks should not be directly used as evidence for relevance inference. Thus in this paper, we put forward an observation that the relevance indication degree of a click is not a constant, but can be differentiated by user preferences and the context in which the user makes her click decision. In particular, to interpret the click behavior discriminatingly, we propose a Noise-aware Click Model (NCM) by characterizing the noise degree of a click, which indicates the quality of the click for inferring relevance. Specifically, the lower the click noise is, the more important the click is in its role for relevance inference. To verify the necessity of explicitly accounting for the uninformative noise in a user click, we conducted experiments on a billion-scale dataset. Extensive experimental results demonstrate that as compared with two state-of-the-art click models in Web Search, NCM can better interpret user click behavior and achieve significant improvements in terms of both perplexity and NDCG.

[1]  Zhihua Zhang,et al.  Learning click models via probit bayesian inference , 2010, CIKM.

[2]  Yuchen Zhang,et al.  User-click modeling for understanding and predicting search-behavior , 2011, KDD.

[3]  Ciya Liao,et al.  A model to estimate intrinsic document relevance from the clickthrough logs of a web search engine , 2010, WSDM '10.

[4]  Ramakrishnan Srikant,et al.  User browsing models: relevance versus examination , 2010, KDD.

[5]  Nick Craswell,et al.  An experimental comparison of click position-bias models , 2008, WSDM '08.

[6]  Erick Cantú-Paz,et al.  Temporal click model for sponsored search , 2010, SIGIR.

[7]  Pia Borlund,et al.  The concept of relevance in IR , 2003, J. Assoc. Inf. Sci. Technol..

[8]  S. Chib,et al.  Bayesian analysis of binary and polychotomous response data , 1993 .

[9]  David Maxwell Chickering,et al.  Modeling Contextual Factors of Click Rates , 2007, AAAI.

[10]  Yuchen Zhang,et al.  Incorporating post-click behaviors into a click model , 2010, SIGIR.

[11]  Chao Liu,et al.  BBM: bayesian browsing model from petabyte-scale data , 2009, KDD.

[12]  Thorsten Joachims,et al.  Eye-tracking analysis of user behavior in WWW search , 2004, SIGIR '04.

[13]  Brendan J. Frey,et al.  Factor graphs and the sum-product algorithm , 2001, IEEE Trans. Inf. Theory.

[14]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[15]  Zheng Chen,et al.  A novel click model and its applications to online advertising , 2010, WSDM '10.

[16]  Olivier Chapelle,et al.  A dynamic bayesian network click model for web search ranking , 2009, WWW '09.

[17]  Chao Liu,et al.  Efficient multiple-click models in web search , 2009, WSDM '09.

[18]  Chao Liu,et al.  Click chain model in web search , 2009, WWW '09.

[19]  Qiang Yang,et al.  A Whole Page Click Model to Better Interpret Search Engine Click Data , 2011, AAAI.

[20]  Ziv Bar-Yossef,et al.  Context-sensitive query auto-completion , 2011, WWW.

[21]  Susan T. Dumais,et al.  Improving Web Search Ranking by Incorporating User Behavior Information , 2019, SIGIR Forum.

[22]  Matthew Richardson,et al.  Predicting clicks: estimating the click-through rate for new ads , 2007, WWW '07.

[23]  Yuchen Zhang,et al.  Characterizing search intent diversity into click models , 2011, WWW.

[24]  Ryen W. White,et al.  Predicting short-term interests using activity-based search context , 2010, CIKM.

[25]  Shuang-Hong Yang,et al.  Collaborative competitive filtering: learning recommender using context of user choice , 2011, SIGIR.

[26]  J. Shane Culpepper,et al.  Including summaries in system evaluation , 2009, SIGIR.

[27]  Andrei Z. Broder,et al.  Competing for users' attention: on the interplay between organic and sponsored search results , 2010, WWW '10.

[28]  Enhong Chen,et al.  Context-aware ranking in web search , 2010, SIGIR '10.