What Can Task Teach Us About Query Reformulations?

A significant amount of prior research has been devoted to understanding query reformulations. The majority of these works rely on time-based sessions which are sequences of contiguous queries segmented using time threshold on users’ activities. However, queries are generally issued by users having in mind a particular task, and time-based sessions unfortunately fail in revealing such tasks. In this paper, we are interested in revealing in which extent time-based sessions vs. task-based sessions represent significantly different background contexts to be used in the perspective of better understanding users’ query reformulations. Using insights from large-scale search logs, our findings clearly show that task is an additional relevant search unit that helps better understanding user’s query reformulation patterns and predicting the next user’s query. The findings from our analyses provide potential implications for model design of task-based search engines.

[1]  Dian Tjondronegoro,et al.  Human-computer interaction: the impact of users' cognitive styles on query reformulation behaviour during web searching , 2012, OZCHI.

[2]  Enrique Alfonseca,et al.  Learning to Attend, Copy, and Generate for Session-Based Query Suggestion , 2017, CIKM.

[3]  Daqing He,et al.  Detecting session boundaries from Web user logs , 2000 .

[4]  Eric Horvitz,et al.  Patterns of search: analyzing and modeling Web query refinement , 1999 .

[5]  Soohyung Joo,et al.  Factors that influence query reformulations and search performance in health information retrieval: A multilevel modeling approach , 2017, J. Assoc. Inf. Sci. Technol..

[6]  Craig MacDonald,et al.  Learning to rank query suggestions for adhoc and diversity search , 2012, Information Retrieval.

[7]  Jacek Gwizdka,et al.  Analysis and evaluation of query reformulations in different task types , 2010, ASIST.

[8]  Emine Yilmaz,et al.  Characterizing Users' Multi-Tasking Behavior in Web Search , 2016, CHIIR.

[9]  Ryen W. White,et al.  Supporting Complex Search Tasks , 2014, CIKM.

[10]  Ryen W. White,et al.  Lessons from the journey: a query log analysis of within-session learning , 2014, WSDM.

[11]  Barbara M. Wildemuth,et al.  The effects of domain knowledge on search tactic formulation , 2004, J. Assoc. Inf. Sci. Technol..

[12]  Lynda Tamine,et al.  On the impact of domain expertise on query formulation, relevance assessment and retrieval performance in clinical settings , 2017, Inf. Process. Manag..

[13]  Efthimis N. Efthimiadis,et al.  Analyzing and evaluating query reformulation strategies in web search logs , 2009, CIKM.

[14]  Hua Ouyang,et al.  Learning to Rewrite Queries , 2016, CIKM.

[15]  Huseyin Cenk Özmutlu,et al.  Application of automatic topic identification on Excite Web search engine data logs , 2005, Inf. Process. Manag..

[16]  Amanda Spink,et al.  Patterns of query reformulation during Web searching , 2009, J. Assoc. Inf. Sci. Technol..

[17]  Rosie Jones,et al.  Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs , 2008, CIKM '08.

[18]  Jakob Grue Simonsen,et al.  A Hierarchical Recurrent Encoder-Decoder for Generative Context-Aware Query Suggestion , 2015, CIKM.

[19]  Monika Henzinger,et al.  Analysis of a very large web search engine query log , 1999, SIGF.

[20]  Ryen W. White,et al.  Assessing the scenic route: measuring the value of search trails in web logs , 2010, SIGIR.

[21]  Pernilla Qvarfordt,et al.  Leading people to longer queries , 2013, CHI.

[22]  Soo Young Rieh,et al.  Analysis of multiple query reformulations on the web: The interactive information retrieval context , 2006, Information Processing & Management.

[23]  Pertti Vakkari,et al.  A theory of the task-based information retrieval process: a summary and generalisation of a longitudinal study , 2001, J. Documentation.

[24]  Barbara M. Wildemuth,et al.  Examining the impact of domain and cognitive complexity on query formulation and reformulation , 2018, Inf. Process. Manag..

[25]  Hang Li,et al.  A unified and discriminative model for query refinement , 2008, SIGIR '08.

[26]  Fabrizio Silvestri,et al.  Identifying task-based sessions in search engine query logs , 2011, WSDM '11.

[27]  Ryen W. White,et al.  Struggling and Success in Web Search , 2015, CIKM.

[28]  Pu-Jen Cheng,et al.  Learning user reformulation behavior for query auto-completion , 2014, SIGIR.

[29]  Emine Yilmaz,et al.  User Behaviour and Task Characteristics: A Field Study of Daily Information Behaviour , 2017, CHIIR.

[30]  Jun Wang,et al.  A term-based methodology for query reformulation understanding , 2015, Information Retrieval Journal.

[31]  Emine Yilmaz,et al.  Uncovering Task Based Behavioral Heterogeneities in Online Search Behavior , 2016, SIGIR.

[32]  Matthias Hagen,et al.  From search session detection to search mission detection , 2013, OAIR.