A Usefulness-based Approach for Measuring the Local and Global Effect of IIR Services

In Interactive Information Retrieval (IIR) different services such as search term suggestion can support users in their search process. The applicability and performance of such services is either measured with different user-centered studies (like usability tests or laboratory experiments) or, in the context of IR, with their contribution to measures like precision and recall. However, each evaluation methodology has its certain disadvantages. For example, user-centered experiments are often costly and small-scaled; IR experiments rely on relevance assessments and measure only relevance of documents. In this work we operationalize the usefulness model of Cole et al. (2009) on the level of system support to measure not only the local effect of an IR service, but the impact it has on the whole search process. We therefore use a log-based evaluation approach which models user interactions within sessions with positive signals and apply it for the case of a search term suggestion service. We found that the usage of the service significantly often implicates the occurrence of positive signals during the following session steps.

[1]  Leif Azzopardi Usage based effectiveness measures: monitoring application performance in information retrieval , 2009, CIKM.

[2]  Hsinchun Chen,et al.  Interactive term suggestion for users of digital libraries: using subject thesauri and co-occurrence lists for information retrieval , 1996, DL '96.

[3]  Jaime Teevan,et al.  Implicit feedback for inferring user preference: a bibliography , 2003, SIGF.

[4]  Andrew Trotman,et al.  Comparative analysis of clicks and judgments for IR evaluation , 2009, WSCD '09.

[5]  Nicholas J. Belkin,et al.  A Model for Evaluation of Interactive Information Retrieval , 2009 .

[6]  Nicholas J. Belkin,et al.  Whole-Session Evaluation of Interactive Information Retrieval Systems (NII Shonan Meeting 2012-7) , 2012, NII Shonan Meet. Rep..

[7]  W. Bruce Croft,et al.  Query expansion using local and global document analysis , 1996, SIGIR '96.

[8]  Johanna Enberg,et al.  Query Expansion , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[9]  Ellen M. Voorhees,et al.  TREC: Experiment and Evaluation in Information Retrieval (Digital Libraries and Electronic Publishing) , 2005 .

[10]  Philipp Mayr,et al.  Digital Library Research in Action: Supporting Information Retrieval in Sowiport , 2015, D Lib Mag..

[11]  Peter Ingwersen,et al.  ON THE HOLISTIC COGNITIVE THEORY FOR INFORMATION RETRIEVAL Drifting Outside the Border of the Laboratory Framework , 2007 .

[12]  Amanda Spink,et al.  Patterns and transitions of query reformulation during web searching , 2007, Int. J. Web Inf. Syst..

[13]  Susan Dumais,et al.  Whole-session evaluation of interactive information retrieval systems Compilation of Homework , 2012 .

[14]  Yen-Jen Oyang,et al.  Relevant term suggestion in interactive web search based on contextual information in query session logs , 2003, J. Assoc. Inf. Sci. Technol..

[15]  Paul Thomas,et al.  Using Interaction Data to Explain Difficulty Navigating Online , 2014, TWEB.

[16]  Chris Buckley,et al.  Improving automatic query expansion , 1998, SIGIR '98.

[17]  Ben Carterette,et al.  Evaluating multi-query sessions , 2011, SIGIR.

[18]  Jaap Kamps,et al.  A Search Log-Based Approach to Evaluation , 2010, ECDL.

[19]  Philipp Mayr,et al.  A Novel Combined Term Suggestion Service for Domain-Specific Digital Libraries , 2011, TPDL.

[20]  Thorsten Joachims,et al.  Accurately interpreting clickthrough data as implicit feedback , 2005, SIGIR '05.

[21]  Fabrizio Silvestri,et al.  Identifying task-based sessions in search engine query logs , 2011, WSDM '11.

[22]  Thorsten Joachims,et al.  Accurately Interpreting Clickthrough Data as Implicit Feedback , 2017 .

[23]  Stephen E. Robertson,et al.  Query Expansion with Long-Span Collocates , 2003, Information Retrieval.

[24]  W. Bruce Croft,et al.  Quary Expansion Using Local and Global Document Analysis , 1996, SIGIR Forum.

[25]  Philipp Mayr,et al.  A framework for specific term recommendation systems , 2013, SIGIR.

[26]  York Sure-Vetter,et al.  Science models as value-added services for scholarly information systems , 2011, Scientometrics.

[27]  Susan T. Dumais,et al.  The vocabulary problem in human-system communication , 1987, CACM.

[28]  Daniel Hienert,et al.  WHOSE - A Tool for Whole-Session Analysis in IIR , 2015, ECIR.

[29]  Nicholas J. Belkin,et al.  Personalization of search results using interaction behaviors in search sessions , 2012, SIGIR '12.

[30]  Filip Radlinski,et al.  Evaluating the accuracy of implicit feedback from clicks and query reformulations in Web search , 2007, TOIS.

[31]  Philipp Mayr,et al.  Improving Retrieval Results with Discipline-Specific Query Expansion , 2012, TPDL.

[32]  J. Liu,et al.  Usefulness as the Criterion for Evaluation of Interactive Information Retrieval , 2009 .

[33]  Barbara M. Wildemuth,et al.  The effects of domain knowledge on search tactic formulation , 2004, J. Assoc. Inf. Sci. Technol..

[34]  Giorgio Maria Di Nunzio,et al.  Web log analysis: a review of a decade of studies about information acquisition, inspection and interpretation of user interaction , 2011, Data Mining and Knowledge Discovery.

[35]  Steve Fox,et al.  Evaluating implicit measures to improve web search , 2005, TOIS.

[36]  Wei Chu,et al.  Learning to extract cross-session search tasks , 2013, WWW.

[37]  Nicholas J. Belkin,et al.  Cases, scripts, and information-seeking strategies: On the design of interactive information retrieval systems , 1995 .

[38]  Diane Kelly,et al.  Methods for Evaluating Interactive Information Retrieval Systems with Users , 2009, Found. Trends Inf. Retr..

[39]  Bernard J. Jansen,et al.  Search log analysis: What it is, what's been done, how to do it , 2006 .

[40]  Iris Xie,et al.  Dimensions of tasks: influences on information-seeking and retrieving process , 2009, J. Documentation.

[41]  José Luis Vicedo González,et al.  TREC: Experiment and evaluation in information retrieval , 2007, J. Assoc. Inf. Sci. Technol..