A loosely-coupled integration of a text retrieval system and an object-oriented database system

Document management systems are needed for many business applications. This type of system would combine the functionality of a database system, (for describing, storing and maintaining documents with complex structure and relationships) with a text retrieval system (for effective retrieval based on full text). The retrieval model for a document management system is complicated by the variety and complexity of the objects that are represented. In this paper, we describe an approach to complex object retrieval using a probabilistic inference net model, and an implementation of this approach using a loose coupling of an object-oriented database system (IRIS) and a text retrieval system based on inference nets (INQUERY). The resulting system is used to store long, structured documents and can retrieve document components (sections, figures, etc.) based on their contents or the contents of related components. The lessons learnt from the implementation are discussed.

[1]  Norbert Fuhr,et al.  A Probabilistic Framework for Vague Queries and Imprecise Information in Databases , 1990, VLDB.

[2]  W. Bruce Croft,et al.  Evaluation of an inference network-based retrieval model , 1991, TOIS.

[3]  Kevin Wilkinson,et al.  The Iris Architecture and Implementation , 1990, IEEE Trans. Knowl. Data Eng..

[4]  David Maier,et al.  Readings in Object-Oriented Database Systems , 1989 .

[5]  Jay Banerjee,et al.  Data model issues for object-oriented applications , 1987, TOIS.

[6]  Hans-Jörg Schek,et al.  Methods for the administration of textual data in database systems , 1980, SIGIR '80.

[7]  David C. Blair,et al.  An extended relational document retrieval model , 1988, Inf. Process. Manag..

[8]  W. Bruce Croft,et al.  Interactive retrieval of complex documents , 1990, Inf. Process. Manag..

[9]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[10]  G Salton,et al.  Global Text Matching for Information Retrieval , 1991, Science.

[11]  Michael Stonebraker,et al.  Extended User-Defined Indexing with Application to Textual Databases , 1988, VLDB.

[12]  David Maier,et al.  Development and Implementation of an Object-Oriented DBMS , 1987, Research Directions in Object-Oriented Programming.

[13]  W. Bruce Croft,et al.  The use of phrases and structured queries in information retrieval , 1991, SIGIR '91.

[14]  Amihai Motro,et al.  VAGUE: a user interface to relational databases that permits vague queries , 1988, TOIS.

[15]  Hector Garcia-Molina,et al.  A Probalilistic Relational Data Model , 1990, EDBT.

[16]  Alberto O. Mendelzon,et al.  Expressing structural hypertext queries in graphlog , 1989, Hypertext.

[17]  Kevin Wilkinson,et al.  Overview of the Iris DBMS , 1989, Research Foundations in Object-Oriented and Semantic Database Systems.

[18]  W. Bruce Croft,et al.  The INQUERY Retrieval System , 1992, DEXA.

[19]  Jurgen Annevelink Database programming languages: a functional approach , 1991, SIGMOD '91.

[20]  W. Bruce Croft,et al.  Retrieval of Complex Objects , 1992, EDBT.