Using context information in structured document retrieval: an approach based on influence diagrams

In this paper we present an Information Retrieval System (IRS) which is able to work with structured document collections. The model is based on the influence diagrams formalism: a generalization of Bayesian Networks that provides a visual representation of a decision problem. These offer an intuitive way to identify and display the essential elements of the domain (the structured document components and their usefulness) and also how these are related to each other. They have also associated quantitative knowledge that measures the strength of the interactions. By means of this approach, we shall present structured retrieval as a decision-making problem. Two different models have been designed: SID (Simple Influence Diagram) and CID (Context-based Influence Diagram). The main difference between these two models is that the latter also takes into account influences provided by the context in which each structural component is located.

[1]  Luis M. de Campos,et al.  An information retrieval model based on simple Bayesian networks , 2003, Int. J. Intell. Syst..

[2]  Sung-Hyon Myaeng,et al.  A flexible model for retrieval of SGML documents , 1998, SIGIR '98.

[3]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[4]  Benjamin Piwowarski,et al.  Bayesian Networks and INEX , 2002, INEX Workshop.

[5]  Patrick Brézillon,et al.  Lecture Notes in Artificial Intelligence , 1999 .

[6]  Ross D. Shachter Evaluating Influence Diagrams , 1986, Oper. Res..

[7]  Nevin Lianwen Zhang,et al.  Probabilistic Inference in Influence Diagrams , 1998, Comput. Intell..

[8]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[9]  Luis M. de Campos,et al.  Two term-layers: an alternative topology for representing term relationships in the Bayesian Network Retrieval Model , 2003 .

[10]  Luis M. de Campos,et al.  The BNR model: foundations and performance of a Bayesian network-based retrieval model , 2003, Int. J. Approx. Reason..

[11]  Rajkumar Roy,et al.  Advances in Soft Computing: Engineering Design and Manufacturing , 1998 .

[12]  Luis M. de Campos,et al.  A Multi-layered Bayesian Network Model for Structured Document Retrieval , 2003, ECSQARU.

[13]  W. Bruce Croft,et al.  Inference networks for document retrieval , 1989, SIGIR '90.

[14]  Berthier A. Ribeiro-Neto,et al.  A belief network model for IR , 1996, SIGIR '96.

[15]  Donna Harman,et al.  Information Processing and Management , 2022 .

[16]  Mounia Lalmas,et al.  Video retrieval using an MPEG-7 based inference network , 2002, SIGIR '02.

[17]  Ross D. Shachter Probabilistic Inference and Influence Diagrams , 1988, Oper. Res..

[18]  Finn V. Jensen,et al.  Bayesian Networks and Decision Graphs , 2001, Statistics for Engineering and Information Science.

[19]  Gregory F. Cooper,et al.  The Computational Complexity of Probabilistic Inference Using Bayesian Belief Networks , 1990, Artif. Intell..

[20]  Yves Chiaramella,et al.  Information Retrieval and Structured Documents , 2000, ESSIR.