INFORMATION VALUE THEORY IN QUERY AUGMENTATION

Information retrieval (IR) systems interact with users by re turning a ranked list of relevant documents in response to a query. Through feedback mechanisms such as relevance feedback and automated keyword expansion, IR systems attempt to guide users in constructing search queries which better represent their informa ti n needs. These mechanisms, however, do not offer the user more insight into the content of the documents in the IR database nor do they provide direction as to which sea r t rms might yield better search results in terms of relevance and ce rtainty that the retrieved document contains the information the user intended to retrieve. This pape r pres nts a methodology based on the decision-analytic concept of expected value of per fect information for controlling query augmentation in information retrieval. The system dynamically learns the content of the documents in the database to com pute the utility (measured in terms of relevance) of retrieving certain documents in response to queries, where the words in the queries represent the random variables. By c omputing the expected value of perfect information for each query term, the syst m either suggests new search terms or suggests that the user terminate the search.

[1]  Vijay V. Raghavan,et al.  A critical analysis of vector space model for information retrieval , 1986, J. Am. Soc. Inf. Sci..

[2]  Michael McGill,et al.  A performance evaluation of similarity measures, document term weighting schemes and representations in a Boolean environment , 1980, SIGIR '80.

[3]  Alice M. Agogino,et al.  Text analysis for constructing design representations , 1997, Artif. Intell. Eng..

[4]  William S. Cooper,et al.  On selecting a measure of retrieval effectiveness , 1973, J. Am. Soc. Inf. Sci..

[5]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[6]  Gerda Ruge,et al.  Human memory models and term association , 1995, SIGIR '95.

[7]  Padmini Srinivasan,et al.  Fuzzy versus probabilistic models for user relevance judgments , 1990, J. Am. Soc. Inf. Sci..

[8]  David Craig,et al.  The importance of drawing in the mechanical design process , 1990, Comput. Graph..

[9]  Robert M. Fung,et al.  Applying Bayesian networks to information retrieval , 1995, CACM.

[10]  Bruce R. Hartsough,et al.  A streamlined approach for calculating expected utility and expected value of perfect information , 1990, Decis. Support Syst..

[11]  Michael D. Gordon,et al.  A utility theoretic examination of the probability ranking principle in information retrieval , 1991, J. Am. Soc. Inf. Sci..

[12]  Michael Woodroofe,et al.  Probability with applications , 1976 .

[13]  Andrew B. Whinston,et al.  A decision theoretic approach to information retrieval , 1990, TODS.

[14]  William S. Cooper,et al.  On selecting a measure of retrieval effectiveness part II. Implementation of the philosophy , 1973, J. Am. Soc. Inf. Sci..

[15]  Joel L Fagan,et al.  Experiments in Automatic Phrase Indexing For Document Retrieval: A Comparison of Syntactic and Non-Syntactic Methods , 1987 .

[16]  William S. Cooper The paradoxical role 0f unexamined documents in the evaluation of retrieval effectiveness , 1976, Inf. Process. Manag..

[17]  Stuart J. Russell,et al.  On Optimal Game-Tree Search using Rational Meta-Reasoning , 1989, IJCAI.

[18]  Thomas G. Dietterich,et al.  A model of the mechanical design process based on empirical data , 1988, Artificial Intelligence for Engineering Design, Analysis and Manufacturing.

[19]  Alice M. Agogino,et al.  Intelligent real time design: application to prototype selection , 1991 .

[20]  Fredric C. Gey,et al.  Experiments in the Probabilistic Retrieval of Full Text Documents , 1994, TREC.

[21]  Thorsten Brants,et al.  Natural Language Processing in Information Retrieval , 2003, CLIN.