Beyond DCG: user behavior as a predictor of a successful search

Web search engines are traditionally evaluated in terms of the relevance of web pages to individual queries. However, relevance of web pages does not tell the complete picture, since an individual query may represent only a piece of the user's information need and users may have different information needs underlying the same queries. In this work, we address the problem of predicting user search goal success by modeling user behavior. We show empirically that user behavior alone can give an accurate picture of the success of the user's web search goals, without considering the relevance of the documents displayed. In fact, our experiments show that models using user behavior are more predictive of goal success than those using document relevance. We build novel sequence models incorporating time distributions for this task and our experiments show that the sequence and time distribution models are more accurate than static models based on user behavior, or predictions based on document relevance.

[1]  DAVID G. KENDALL,et al.  Introduction to Mathematical Statistics , 1947, Nature.

[2]  Monika Henzinger,et al.  Analysis of a very large web search engine query log , 1999, SIGF.

[3]  Mark Levene,et al.  Data Mining of User Navigation Patterns , 1999, WEBKDD.

[4]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[5]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[6]  Filip Radlinski,et al.  Query chains: learning to rank from implicit feedback , 2005, KDD '05.

[7]  Steve Fox,et al.  Evaluating implicit measures to improve web search , 2005, TOIS.

[8]  Eric Brill,et al.  Improving web search ranking by incorporating user behavior information , 2006, SIGIR.

[9]  Falk Scholer,et al.  User performance versus precision measures for simple search tasks , 2006, SIGIR.

[10]  Seda Özmutlu Automatic new topic identification using multiple linear regression , 2006, Inf. Process. Manag..

[11]  Doug Downey,et al.  Models of Searching and Browsing: Languages, Studies, and Application , 2007, IJCAI.

[12]  Scott B. Huffman,et al.  How well does result relevance predict session satisfaction? , 2007, SIGIR.

[13]  Ben Carterette,et al.  Evaluating Search Engines by Modeling the Relationship Between Relevance and Clicks , 2007, NIPS.

[14]  Amanda Spink,et al.  Patterns and transitions of query reformulation during web searching , 2007, Int. J. Web Inf. Syst..

[15]  Jonathan L. Herlocker,et al.  Click data as implicit relevance feedback in web search , 2007, Inf. Process. Manag..

[16]  Rosie Jones,et al.  Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs , 2008, CIKM '08.

[17]  Aristides Gionis,et al.  The query-flow graph: model and applications , 2008, CIKM '08.

[18]  Filip Radlinski,et al.  How does clickthrough data reflect retrieval quality? , 2008, CIKM '08.

[19]  Jie Li,et al.  Characterizing typical and atypical user sessions in clickstreams , 2008, WWW.

[20]  Lois M. L. Delcambre,et al.  Discounted Cumulated Gain Based Evaluation of Multiple-Query IR Sessions , 2008, ECIR.

[21]  Jane Li,et al.  Good abandonment in mobile and PC internet search , 2009, SIGIR.

[22]  David Mease,et al.  Evaluating web search using task completion time , 2009, SIGIR.

[23]  Olivier Chapelle,et al.  A dynamic bayesian network click model for web search ranking , 2009, WWW '09.

[24]  Benjamin Piwowarski,et al.  Mining user web search activity with layered bayesian networks or how to capture a click in its context , 2009, WSDM '09.