A probabilistic relational model for the integration of IR and databases

In this paper, a probabilistic relational model is presented which combines relational algebra with probabilistic retrieval. Based on certain independence assumptions, the operators of the relational algebra are redefined such that the probabilistic algebra is a generalization of the standard relational algebra. Furthermore, a special join operator implementing probabilistic retrieval is proposed. When applied to typical document databases, queries can not only ask for documents, but for any kind of object in the database. In addition, an implicit ranking of these objects is provided in case the query relates to probabilistic indexing or uses the probabilistic join operator. The proposed algebra is intended as a standard interface to combined database and IR systems, as a basis for implementing user-friendly interfaces.

[1]  Chris Buckley,et al.  Optimizing Document Indexing and Search Term Weighting Based on Probabilistic Models , 1992, TREC.

[2]  C. J. Date An Introduction to Database Systems , 1975 .

[3]  Michael Pittarelli,et al.  The Theory of Probabilistic Databases , 1987, VLDB.

[4]  Norbert Fuhr,et al.  Integration of probabilistic fact and text retrieval , 1992, SIGIR '92.

[5]  Henri Prade,et al.  Generalizing Database Relational Algebra for the Treatment of Incomplete/Uncertain Information and Vague Queries , 1984, Inf. Sci..

[6]  Norbert Fuhr,et al.  Aufwandsabschätzung für die Prozessierung vager Anfragen auf der Basis des Datenstrom-Ansatzes , 1993, BTW.

[7]  Bipin C. Desai,et al.  Non-first normal form universal relations: an application to information retrieval systems , 1987, Inf. Syst..

[8]  B. Buckles,et al.  A fuzzy representation of data for relational databases , 1982 .

[9]  Hans-Jörg Schek,et al.  Data Structures for an Integrated Data Base Management and Information Retrieval System , 1982, VLDB.

[10]  Suk Kyoon Lee,et al.  An Extended Relational Database Model for Uncertain and Imprecise Information , 1992, VLDB.

[11]  Abraham Bookstein,et al.  Outline of a General Probabilistic Retrieval Model , 1983, J. Documentation.

[12]  Judea Pearl,et al.  Reasoning with belief functions: An analysis of compatibility , 1990, Int. J. Approx. Reason..

[13]  Norbert Fuhr,et al.  Models for retrieval with probabilistic indexing , 1989, Inf. Process. Manag..

[14]  Hector Garcia-Molina,et al.  A Probalilistic Relational Data Model , 1990, EDBT.

[15]  Norbert Fuhr,et al.  A Probabilistic Framework for Vague Queries and Imprecise Information in Databases , 1990, VLDB.

[16]  C. J. Date An Introduction to Database Systems, Volume I, 5th Edition , 1986 .

[17]  Ian A. Macleod Text retrieval and the relational model , 1991, J. Am. Soc. Inf. Sci..

[18]  David C. Blair,et al.  An extended relational document retrieval model , 1988, Inf. Process. Manag..