Adapting a diagnostic problem-solving model to information retrieval

Abstract In this paper, a competition-based connectionist model for diagnostic problem-solving is adapted to information retrieval. In this model, we treat documents as ‘disorders’ and user information needs as ‘manifestations’, and a competitive activation mechanism is used which converges to a set of documents that best explain the given user information needs. By combining the ideas of Bayesian inferencing and diagnostic inferencing using parsimonious covering theory, this model removes many difficulties of direct application of Bayesian inference, such as the unrealistically large number of conditional probabilities required in the knowledge base, the computational complexity, and certain unreasonable independence assumptions. Also, Bayesian inference strengthens the parsimonious covering theory by providing a likelihood measure which can be used to rank documents as well as to guide the retrieval to the most likely set of documents. We also incorporate thesaurus information to provide semantic relevance among the index terms. Our experimental results using 4 standard document collections demonstrate the efficiency and the retrieval effectiveness of the thesaurus-based model, comparable to or better than that of various information retrieval models reported in the literature.

[1]  James R. Driscoll,et al.  Incorporating Semantics Within a Connectionist Model and a Vector Processing Model , 1993, TREC.

[2]  James A. Reggia,et al.  A formal model of diagnostic inference. I. Problem formulation and decomposition , 1985, Inf. Sci..

[3]  Yiyu Yao,et al.  Computation of term associations by a neural network , 1993, SIGIR.

[4]  Jacques Savoy,et al.  Bayesien Inference networks in hypertext , 1991, RIAO Conference.

[5]  Robert M. Fung,et al.  Applying Bayesian networks to information retrieval , 1995, CACM.

[6]  W. Bruce Croft Approaches to Intelligent Information Retrieval , 1987, Inf. Process. Manag..

[7]  W. Robertson,et al.  A neural algorithm for document clustering , 1991, Inf. Process. Manag..

[8]  W. Bruce Croft,et al.  Efficient probabilistic Inference for text retrieval , 1991, RIAO.

[9]  W. Bruce Croft,et al.  Evaluation of an inference network-based retrieval model , 1991, TOIS.

[10]  James R. Driscoll,et al.  The QA System , 1992, TREC.

[11]  Paul Thagard Book review: Abduetive Inference Models for Diagnostic Problem Solving by Y. Peng and J. Reggia (Springer Verlag New York 1990) , 1990 .

[12]  Kostas Tzeras,et al.  Automatic indexing based on Bayesian inference networks , 1993, SIGIR.

[13]  Brian T. Bartell,et al.  Optimizing ranking functions: a connectionist approach to adaptive information retrieval , 1994 .

[14]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[15]  J. Reggia,et al.  Abductive Inference Models for Diagnostic Problem-Solving , 1990, Symbolic Computation.

[16]  Richard E. Neapolitan,et al.  Probabilistic reasoning in expert systems - theory and algorithms , 2012 .

[17]  Donna K. Harman,et al.  Overview of the first TREC conference , 1993, SIGIR.

[18]  Yun Peng,et al.  A Probabilistic Causal Model for Diagnostic Problem Solving Part II: Diagnostic Strategy , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[19]  Proceedings of The First Text REtrieval Conference, TREC 1992, Gaithersburg, Maryland, USA, November 4-6, 1992 , 1992, TREC.

[20]  Yun Peng,et al.  A Probabilistic Causal Model for Diagnostic Problem Solving Part I: Integrating Symbolic Causal Inference with Numeric Probabilistic Inference , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[21]  Norbert Fuhr,et al.  AIR/X - A rule-based multistage indexing system for Iarge subject fields , 1991, RIAO.

[22]  James A. Reggia,et al.  A connectionist model for diagnostic problem solving , 1989, IEEE Trans. Syst. Man Cybern..