Versioning a full-text information retrieval system

In this paper, we present an approach to the incorporation of object versioning into a distributed full-text information retrieval system. We propose an implementation based on “partially versioned” index sets, arguing that its space overhead and query-time performance make it suitable for full-text IR, with its heavy dependence on inverted indexing. We develop algorithms for computing both historical queries and time range queries and show how these algorithms can be applied to a number of problems in distributed information management, such as data replication, caching, transactional consistency, and hybrid media repositories.

[1]  Klaus R. Dittrich,et al.  Version Support for Engineering Database Systems , 1988, IEEE Trans. Software Eng..

[2]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[3]  Shashi K. Gadia,et al.  A homogeneous relational model and query languages for temporal databases , 1988, TODS.

[4]  Shmuel Tomi Klein,et al.  Construction of optimal graphs for bit-vector compression , 1989, SIGIR '90.

[5]  W. Bruce Croft,et al.  I3R: A new approach to the design of document retrieval systems , 1987, J. Am. Soc. Inf. Sci..

[6]  W. Bruce Croft,et al.  I 3 R: a new approach to the design of document retrieval systems , 1987 .

[7]  Peter G. Anick,et al.  A direct manipulation interface for boolean information retrieval via natural language query , 1989, SIGIR '90.

[8]  James H. Coombs Hypertext, full text, and automatic linking , 1989, SIGIR '90.

[9]  Michael Stonebraker,et al.  The POSTGRES Data Model , 1987, Research Foundations in Object-Oriented and Semantic Database Systems.

[10]  Michael Stonebraker,et al.  The Design of the POSTGRES Storage System , 1988, VLDB.

[11]  Forbes J. Burkowski,et al.  An Algebra for Hierarchically Organized Text-Dominate Databases , 1992, Inf. Process. Manag..

[12]  Christer Hulten,et al.  An architecture for object management in OIS , 1984, TOIS.

[13]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[14]  Randy H. Katz,et al.  Database Support for Versions and Alternatives of Large Design Files , 1984, IEEE Transactions on Software Engineering.

[15]  Brewster Kahle,et al.  An information system for corporate users: wide area information servers , 1991 .

[16]  Curtis P. Kolovson Indexing techniques for historical databases , 1989, [1989] Proceedings. Fifth International Conference on Data Engineering.

[17]  A. Guttman,et al.  A Dynamic Index Structure for Spatial Searching , 1984, SIGMOD 1984.

[18]  Setrag Khoshafian,et al.  Object identity , 1986, OOPLSA '86.

[19]  Richard T. Snodgrass,et al.  Temporal databases status and research directions , 1990, SGMD.

[20]  David Robson,et al.  Smalltalk-80: The Language and Its Implementation , 1983 .

[21]  Peter G. Anick,et al.  Addressing the requirements of a dynamic corporate textual information base , 1991, SIGIR '91.

[22]  William Kent,et al.  An Overview of the Versioning Problem , 1989, SIGMOD Conference.