Modeling and predicting the task-by-task behavior of search engine users

Web search engines answer user needs on a query-by-query fashion, namely they retrieve the set of the most relevant results to each issued query, independently. However, users often submit queries to perform multiple, related tasks. In this paper, we first discuss a methodology to discover from query logs the latent tasks performed by users. Furthermore, we introduce the Task Relation Graph (TRG) as a representation of users' search behaviors on a task-by-task perspective. The task-by-task behavior is captured by weighting the edges of TRG with a relatedness score computed between pairs of tasks, as mined from the query log. We validate our approach on a concrete application, namely a task recommender system, which suggests related tasks to users on the basis of the task predictions derived from the TRG. Finally, we show that the task recommendations generated by our solution are beyond the reach of existing query suggestion schemes, and that our method recommends tasks that user will likely perform in the near future.

[1]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[2]  George Karypis,et al.  Empirical and Theoretical Comparisons of Selected Criterion Functions for Document Clustering , 2004, Machine Learning.

[3]  Francesco Bonchi,et al.  Query suggestions using query-flow graphs , 2009, WSCD '09.

[4]  Fabrizio Silvestri,et al.  Mining Query Logs: Turning Search Usage Data into Knowledge , 2010, Found. Trends Inf. Retr..

[5]  Xueqi Cheng,et al.  Intent-aware query similarity , 2011, CIKM '11.

[6]  Fabrizio Silvestri,et al.  Identifying task-based sessions in search engine query logs , 2011, WSDM '11.

[7]  Amanda Spink,et al.  Multitasking during Web search sessions , 2006, Inf. Process. Manag..

[8]  Ryen W. White,et al.  Modeling and analysis of cross-session search tasks , 2011, SIGIR.

[9]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[10]  Fabrizio Silvestri,et al.  (Query) History Teaches Everything, Including the Future , 2008, 2008 Latin American Web Conference.

[11]  Aristides Gionis,et al.  The query-flow graph: model and applications , 2008, CIKM '08.

[12]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[13]  Francesco Bonchi,et al.  Do you want to take notes?: identifying research missions in Yahoo! search pad , 2010, WWW '10.

[14]  Rosie Jones,et al.  Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs , 2008, CIKM '08.

[15]  George Karypis,et al.  Evaluation of hierarchical clustering algorithms for document datasets , 2002, CIKM '02.

[16]  Ravi Kumar,et al.  An analysis framework for search sequences , 2009, CIKM.