Modeling and analysis of cross-session search tasks

The information needs of search engine users vary in complexity, depending on the task they are trying to accomplish. Some simple needs can be satisfied with a single query, whereas others require a series of queries issued over a longer period of time. While search engines effectively satisfy many simple needs, searchers receive little support when their information needs span session boundaries. In this work, we propose methods for modeling and analyzing user search behavior that extends over multiple search sessions. We focus on two problems: (i) given a user query, identify all of the related queries from previous sessions that the same user has issued, and (ii) given a multi-query task for a user, predict whether the user will return to this task in the future. We model both problems within a classification framework that uses features of individual queries and long-term user search behavior at different granularity. Experimental evaluation of the proposed models for both tasks indicates that it is possible to effectively model and analyze cross-session search behavior. Our findings have implications for improving search for complex information needs and designing search engine features to support cross-session search tasks.

[1]  Raymond J. Mooney,et al.  Learning to Disambiguate Search Queries from Short Sessions , 2009, ECML/PKDD.

[2]  Steve Fox,et al.  Evaluating implicit measures to improve web search , 2005, TOIS.

[3]  Jaime Teevan,et al.  Information re-retrieval: repeat queries in Yahoo's logs , 2007, SIGIR.

[4]  Bonnie MacKay,et al.  Exploring multi-session web tasks , 2008, CHI.

[5]  Filip Radlinski,et al.  Inferring query intent from reformulations and clicks , 2010, WWW '10.

[6]  Benjamin Piwowarski,et al.  Predictive user click models based on click-through history , 2007, CIKM '07.

[7]  Daqing He,et al.  Combining evidence for automatic Web session identification , 2002, Inf. Process. Manag..

[8]  Susan T. Dumais,et al.  Evaluating implicit measures to improve the search experiences , 2003 .

[9]  Ji-Rong Wen,et al.  Clustering user queries of a search engine , 2001, WWW '01.

[10]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[11]  Doug Beeferman,et al.  Agglomerative clustering of a search engine query log , 2000, KDD '00.

[12]  Nicholas J. Belkin,et al.  Personalizing information retrieval for multi-session tasks: the roles of task stage and task type , 2010, SIGIR '10.

[13]  Francesco Bonchi,et al.  Do you want to take notes?: identifying research missions in Yahoo! search pad , 2010, WWW '10.

[14]  Enhong Chen,et al.  Context-aware query classification , 2009, SIGIR.

[15]  Benjamin Piwowarski,et al.  Mining user web search activity with layered bayesian networks or how to capture a click in its context , 2009, WSDM '09.

[16]  Ravi Kumar,et al.  An analysis framework for search sequences , 2009, CIKM.

[17]  ChengXiang Zhai,et al.  Mining long-term search history to improve search accuracy , 2006, KDD '06.

[18]  Ryen W. White,et al.  WWW 2007 / Track: Browsers and User Interfaces Session: Personalization Investigating Behavioral Variability in Web Search , 2022 .

[19]  Carolyn Watters,et al.  A field study characterizing Web-based information-seeking tasks , 2007 .

[20]  Xuehua Shen,et al.  Context-sensitive information retrieval using implicit feedback , 2005, SIGIR '05.

[21]  Anne Aula,et al.  How does search behavior change as search becomes more difficult? , 2010, CHI.

[22]  Wen-Hsiang Lu,et al.  Identifying User Goals from Web Search Results , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[23]  Doug Downey,et al.  Understanding the relationship between searchers' queries and information goals , 2008, CIKM '08.

[24]  Susan T. Dumais,et al.  Individual differences in gaze patterns for web search , 2010, IIiX.

[25]  Filip Radlinski,et al.  Query chains: learning to rank from implicit feedback , 2005, KDD '05.

[26]  Ryen W. White,et al.  Predicting user interests from contextual information , 2009, SIGIR.

[27]  Aristides Gionis,et al.  The query-flow graph: model and applications , 2008, CIKM '08.

[28]  Mika Käki,et al.  Information search and re-access strategies of experienced web users , 2005, WWW '05.

[29]  Yiming Yang,et al.  A re-examination of text categorization methods , 1999, SIGIR '99.

[30]  Zhenyu Liu,et al.  Automatic identification of user goals in Web search , 2005, WWW '05.

[31]  Dan Morris,et al.  SearchBar: a search-centric web history for task resumption and information re-finding , 2008, CHI.

[32]  Rosie Jones,et al.  Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs , 2008, CIKM '08.

[33]  Ahmed Hassan Awadallah,et al.  Beyond DCG: user behavior as a predictor of a successful search , 2010, WSDM '10.

[34]  Enhong Chen,et al.  Context-aware query suggestion by mining click-through and session data , 2008, KDD.

[35]  Daniel E. Rose,et al.  Understanding user goals in web search , 2004, WWW '04.