Efficient Management of Multiversion Documents by Object Referencing

Traditional approaches to versioning documents are edit-based, and represent successive versions using edit scripts. This paper proposes a reference-based version-ing scheme that preserves the rich logical structure of the evolving document via object references. This approach produces better support for queries, and reconciles the storage-level and transport-level representations of multiversioned XML documents. In particular , we present eecient algorithms for supporting projection and selection queries, and for querying the document evolution history. Then, we show that our representation is also eecient at the transport level, where XML documents are exchanged between remote parties. In fact, with the reference-based scheme, an XML document's history can also be viewed and processed as yet another XML document. Finally, we demonstrate the eeectiveness of the new scheme at the storage level, for which we deene a usefulness-based page management policy, adapted from transaction-time databases, to ensure eecient temporal clustering between versions. The experimental evaluation of the new scheme against previous representations used in temporal databases and persistent-object managers shows the performance advantages of the new approach.

[1]  David B. Lomet,et al.  Access methods for multiversion data , 1989, SIGMOD '89.

[2]  Steven J. DeRose,et al.  XML Path Language (XPath) Version 1.0 , 1999 .

[3]  Malcolm C. Easton,et al.  Key-Sequence Data Sets on Inedible Storage , 1986, IBM J. Res. Dev..

[4]  Rakesh M. Verma,et al.  An Efficient Multiversion Access STructure , 1997, IEEE Trans. Knowl. Data Eng..

[5]  Christos Faloutsos,et al.  Access Methods for Bi-Temporal Databases , 1995, Temporal Databases.

[6]  Randy H. Katz,et al.  Managing Change in a Computer-Aided Design Database , 1987, Research Foundations in Object-Oriented and Semantic Database Systems.

[7]  Vassilis J. Tsotras,et al.  Comparison of access methods for time-evolving data , 1999, CSUR.

[8]  Won Kim,et al.  A Unifying Framework for Version Control in a CAD Environment , 1986, VLDB.

[9]  Gultekin Özsoyoglu,et al.  Temporal and Real-Time Databases: A Survey , 1995, IEEE Trans. Knowl. Data Eng..

[10]  Bernhard Seeger,et al.  An asymptotically optimal multiversion B-tree , 1996, The VLDB Journal.

[11]  David B. Leblang The CM challenge: configuration management that works , 1995 .

[12]  Carlo Zaniolo,et al.  Version Management of XML Documents , 2000, WebDB.

[13]  Kaizhong Zhang,et al.  Algorithms for the constrained editing distance between ordered labeled trees and related problems , 1995, Pattern Recognit..

[14]  Robert E. Tarjan,et al.  Making data structures persistent , 1986, STOC '86.

[15]  David J. DeWitt,et al.  Object and File Management in the EXODUS Extensible Database System , 1986, VLDB.

[16]  David Beech,et al.  Generalized version control in an object-oriented database , 1988, Proceedings. Fourth International Conference on Data Engineering.

[17]  Vassilis J. Tsotras,et al.  The Snapshot Index: An I/O-optimal access method for timeslice queries , 1995, Inf. Syst..

[18]  Amélie Marian,et al.  Change-Centric Management of Versions in an XML Warehouse , 2001, VLDB.

[19]  F. Warren Burton,et al.  Implementation of Overlapping B-Trees for Time and Space Efficient Representation of Collections of Similar Files , 1990, Comput. J..

[20]  Jennifer Widom,et al.  Representing and querying changes in semistructured data , 1998, Proceedings 14th International Conference on Data Engineering.

[21]  Ramez Elmasri,et al.  A consensus glossary of temporal database concepts , 1994, SGMD.

[22]  Jennifer Widom,et al.  Change detection in hierarchically structured information , 1996, SIGMOD '96.

[23]  Walter F. Tichy,et al.  Rcs — a system for version control , 1985, Softw. Pract. Exp..

[24]  Marc J. Rochkind,et al.  The source code control system , 1975, IEEE Transactions on Software Engineering.

[25]  Kyoungro Yoon,et al.  Version Management in Structured Document Retrieval Systems , 1996, SEKE.