Evaluating implicit feedback models using searcher simulations

In this article we describe an evaluation of relevance feedback (RF) algorithms using searcher simulations. Since these algorithms select additional terms for query modification based on inferences made from searcher interaction, not on relevance information searchers explicitly provide (as in traditional RF), we refer to them as implicit feedback models. We introduce six different models that base their decisions on the interactions of searchers and use different approaches to rank query modification terms. The aim of this article is to determine which of these models should be used to assist searchers in the systems we develop. To evaluate these models we used searcher simulations that afforded us more control over the experimental conditions than experiments with human subjects and allowed complex interaction to be modeled without the need for costly human experimentation. The simulation-based evaluation methodology measures how well the models learn the distribution of terms across relevant documents (i.e., learn what information is relevant) and how well they improve search effectiveness (i.e., create effective search queries). Our findings show that an implicit feedback model based on Jeffrey's rule of conditioning outperformed other models under investigation.

[1]  Donna K. Harman,et al.  Overview of the first TREC conference , 1993, SIGIR.

[2]  Diane Kelly,et al.  Implicit feedback for inferring user preference , 2003 .

[3]  Stuart K. Card,et al.  Information foraging in information access environments , 1995, CHI '95.

[4]  S. E. Robertson,et al.  On Relevance weight estimation and Query Expansion , 1986, J. Documentation.

[5]  Stephen E. Robertson,et al.  On Term Selection for Query Expansion , 1991, J. Documentation.

[6]  Julie Chen,et al.  The bloodhound project: automating discovery of web usability issues using the InfoScentπ simulator , 2003, CHI '03.

[7]  Mark Sanderson,et al.  Advantages of query biased summaries in information retrieval , 1998, SIGIR '98.

[8]  Carol L. Barry Document Representations and Clues to Document Relevance , 1998, J. Am. Soc. Inf. Sci..

[9]  Ryen W. White,et al.  A study of topic similarity measures , 2004, SIGIR '04.

[10]  Ed H. Chi,et al.  Using information scent to model user information needs and actions and the Web , 2001, CHI.

[11]  Ryen W. White Implicit feedback for interactive information retrieval , 2005, SIGF.

[12]  Susan T. Dumais,et al.  WaveLens: a new view onto Internet search results , 2004, CHI.

[13]  Yoichi Shinoda,et al.  Information filtering based on user behavior analysis and best match text retrieval , 1994, SIGIR '94.

[14]  Ryen W. White,et al.  An approach for implicitly detecting information needs , 2003, CIKM '03.

[15]  Amanda Spink,et al.  From Highly Relevant to Not Relevant: Examining Different Regions of Relevance , 1998, Inf. Process. Manag..

[16]  Ryen W. White,et al.  Using top-ranking sentences to facilitate effective information access , 2005, J. Assoc. Inf. Sci. Technol..

[17]  Ryen W. White,et al.  An implicit feedback approach for interactive information retrieval , 2006, Inf. Process. Manag..

[18]  James Allan,et al.  The effect of adding relevance information in a relevance feedback environment , 1994, SIGIR '94.

[19]  C. J. van Rijsbergen,et al.  Incorporating user search behavior into relevance feedback , 2003, J. Assoc. Inf. Sci. Technol..

[20]  Richard W. Hamming,et al.  Error detecting and error correcting codes , 1950 .

[21]  Peter Ingwersen,et al.  Dimensions of relevance , 2000, Inf. Process. Manag..

[22]  Mark Magennis,et al.  The potential and actual effectiveness of interactive query expansion , 1997, SIGIR '97.

[23]  Pia Borlund,et al.  The IIR evaluation model: a framework for evaluation of interactive information retrieval systems , 2003, Inf. Res..

[24]  Amanda Spink,et al.  Real life, real users, and real needs: a study and analysis of user queries on the web , 2000, Inf. Process. Manag..

[25]  Ryen W. White,et al.  The Use of Implicit Evidence for Relevance Feedback in Web Retrieval , 2002, ECIR.

[26]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[27]  Jaime Teevan,et al.  Implicit feedback for inferring user preference: a bibliography , 2003, SIGF.

[28]  Javed Mostafa,et al.  Detection of shifts in user interests for personalized information filtering , 1996, SIGIR '96.

[29]  R. Jeffrey The Logic of Decision , 1984 .

[30]  Gerard Salton,et al.  Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..

[31]  Prof A. Bryson Appendixes , 2003 .

[32]  Diane Kelly Understanding implicit feedback and document preference: a naturalistic user study , 2004, SIGF.

[33]  Ryen W. White,et al.  Using top-ranking sentences to facilitate effective information access: Book Reviews , 2005 .

[34]  Carol L. Barry Document representations and clues to document relevance , 1998 .

[35]  Ryen W. White,et al.  A task-oriented study on the influencing effects of query-biased summarisation in web searching , 2003, Inf. Process. Manag..

[36]  Iain Campbell,et al.  The ostensive model of developing information needs , 2000 .

[37]  Matthew Chalmers,et al.  The Order of Things: Activity-Centred Information Access, , 1998, Comput. Networks.

[38]  SpinkAmanda,et al.  From highly relevant to not relevant , 1998 .

[39]  Javed Mostafa,et al.  Simulation Studies of Different Dimensions of Users' Interests and their Impact on User Modeling and Information Filtering , 2003, Information Retrieval.

[40]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[41]  Ian Ruthven,et al.  Re-examining the potential effectiveness of interactive query expansion , 2003, SIGIR.

[42]  S. Siegel,et al.  Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[43]  Tefko Saracevic,et al.  RELEVANCE: A review of and a framework for the thinking on the notion in information science , 1997, J. Am. Soc. Inf. Sci..

[44]  Jonathan Furner,et al.  On recommending , 2002, J. Assoc. Inf. Sci. Technol..

[45]  Ryen W. White,et al.  Finding relevant documents using top ranking sentences: an evaluation of two alternative schemes , 2002, SIGIR '02.

[46]  C. J. van Rijsbergen,et al.  Probabilistic Retrieval Revisited , 1992, Comput. J..

[47]  Jock D. Mackinlay,et al.  The impact of fluid documents on reading and browsing: an observational study , 2000, CHI.

[48]  Ryen W. White,et al.  A Simulated Study of Implicit Feedback Models , 2004, ECIR.