Core technologies for native XML database management systems

This work investigates the core technologies required to build Database Management Systems (DBMSs) for large collections of XML documents. We call such systems XML Base Management Systems (XBMSs). We identify requirements, and analyze how they can be met using a conventional DBMS. Our conclusion is that an XML support layer on top of an existing conventional DBMS does not address the requirements for XBMSs. Hence, we built a Native XBMS, called Natix. Natix has been developed completely from scratch, incorporating optimizations for high-performance XML processing in those places where they are most effective.

[1]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[2]  C. Mohan,et al.  ARIES/NT: A Recovery Method Based on Write-Ahead Logging for Nested Transactions , 1989, VLDB.

[3]  Donald D. Chamberlin,et al.  A Complete Guide to DB2 Universal Database , 1998 .

[4]  Daniela Florescu,et al.  Storing and Querying XML Data using an RDMBS , 1999, IEEE Data Eng. Bull..

[5]  Dan Suciu,et al.  Programming Constructs for Unstructured Data , 1995, DBPL.

[6]  Guido Moerkotte Incorporating XSL Processing into Database Engines , 2002, VLDB.

[7]  Sven Helmer,et al.  Optimized translation of XPath into algebraic expressions parameterized by programs containing navigational primitives , 2002, Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002..

[8]  E. James Whitehead,et al.  HTTP Extensions for Distributed Authoring - WEBDAV , 1999, RFC.

[9]  Guido Moerkotte,et al.  Evaluating Queries on Structure with eXtended Access Support Relations , 2000, WebDB.

[10]  Roy T. Fielding,et al.  Uniform Resource Identifiers (URI): Generic Syntax , 1998, RFC.

[11]  Alexandros Biliris An efficient database storage structure for large dynamic objects , 1992, [1992] Eighth International Conference on Data Engineering.

[12]  Henry F. Korth,et al.  Locking Primitives in a Database System , 1983, JACM.

[13]  Sukhamay Kundu,et al.  A Linear Tree Partitioning Algorithm , 1977, SIAM J. Comput..

[14]  C. Mohan,et al.  ARIES/IM: an efficient and high concurrency index management method using write-ahead logging , 1992, SIGMOD '92.

[15]  Alin Deutsch,et al.  Storing semistructured data with STORED , 1999, SIGMOD '99.

[16]  Menzo Windhouwer,et al.  Efficient Relational Storage and Retrieval of XML Documents , 2000, WebDB.

[17]  C. Mohan Commit_LSN: A Novel and Simple Method for Reducing Locking and Latching in Transaction Processing Systems , 1996, Performance of Concurrency Control Mechanisms in Centralized Database Systems.

[18]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[19]  Irving L. Traiger,et al.  Granularity of Locks in a Large Shared Data Base. , 1975, VLDB 1975.

[20]  Michael Stonebraker,et al.  Readings in Database Systems , 1988 .

[21]  Norbert Ritter,et al.  XML content management based on object-relational database technology , 2000, Proceedings of the First International Conference on Web Information Systems Engineering.

[22]  Erhard Rahm,et al.  Datenbanksysteme: Konzepte und Techniken der Implementierung , 1999 .

[23]  Peter Buneman,et al.  Semistructured data , 1997, PODS.

[24]  Ralph Johnson,et al.  design patterns elements of reusable object oriented software , 2019 .

[25]  Gerhard Weikum,et al.  Principles and realization strategies of multilevel transaction management , 1991, TODS.

[26]  Erich J. Neuhold,et al.  Structured document storage and refined declarative and navigational access mechanisms in HyperStorM , 1997, The VLDB Journal.

[27]  Irving L. Traiger,et al.  The Recovery Manager of the System R Database Manager , 1981, CSUR.

[28]  Andreas Reuter,et al.  Principles of transaction-oriented database recovery , 1983, CSUR.

[29]  Roy Goldman,et al.  Lore: a database management system for semistructured data , 1997, SGMD.

[30]  Guido Moerkotte,et al.  Algebraic XML construction in Natix , 2001, Proceedings of the Second International Conference on Web Information Systems Engineering.

[31]  Andreas Reuter,et al.  Group Commit Timers and High Volume Transaction Systems , 1987, HPTS.

[32]  Irving L. Traiger,et al.  System R: relational approach to database management , 1976, TODS.

[33]  C. Mohan,et al.  ARIES/CSA: a method for database recovery in client-server architectures , 1994, SIGMOD '94.

[34]  Gottfried Vossen,et al.  Transactional Information Systems: Theory, Algorithms, and the Practice of Concurrency Control and Recovery , 2002 .

[35]  Roy T. Fielding,et al.  Hypertext Transfer Protocol - HTTP/1.0 , 1996, RFC.

[36]  Luca Padovani,et al.  The Gnome DOM Engine , 2001, Extreme Markup Languages®.

[37]  Marvin H. Solomon,et al.  Towards effective and efficient free space management , 1996, SIGMOD '96.

[38]  Michael Stonebraker,et al.  Operating system support for database management , 1981, CACM.

[39]  Joseph A. Lukes Efficient Algorithm for the Partitioning of Trees , 1974, IBM J. Res. Dev..

[40]  C. M. Sperberg-McQueen,et al.  eXtensible Markup Language (XML) 1.0 (Second Edition) , 2000 .

[41]  Guido Moerkotte,et al.  Efficient Storage of XML Data , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[42]  Bjarne Stroustrup,et al.  C++ : programovací jazyk : The C++ programming language (Orig.) , 1997 .

[43]  C. Mohan,et al.  ARIES/LHS: A concurrency control and recovery method using write-ahead logging for linear hashing with separators , 1993, Proceedings of IEEE 9th International Conference on Data Engineering.

[44]  Hamid Pirahesh,et al.  ARIES: a transaction recovery method supporting fine-granularity locking and partial rollbacks using write-ahead logging , 1998 .

[45]  C. Mohan,et al.  Algorithms for Flexible Space Management in Transaction Systems Supporting Fine-Granularity Locking , 1994, EDBT.

[46]  Mario Schkolnick,et al.  A clustering algorithm for hierarchical structures , 1977, TODS.

[47]  C. Mohan,et al.  Repeating History Beyond ARIES , 1999, VLDB.

[48]  Arnaud Le Hors,et al.  Document Object Model (DOM) Level 2 Core Specification - Version 1.0 , 2000 .

[49]  David J. DeWitt,et al.  Object and File Management in the EXODUS Extensible Database System , 1986, VLDB.

[50]  Holger Meyer,et al.  XML and Object-Relational Database Systems - Enhancing Structural Mappings Based on Statistics , 2000, WebDB.

[51]  Gio Wiederhold File organization for database design , 1987 .

[52]  Ioana Manolescu,et al.  XMark: A Benchmark for XML Data Management , 2002, VLDB.

[53]  Steven J. DeRose,et al.  XML Path Language (XPath) Version 1.0 , 1999 .

[54]  Donald Kossmann,et al.  A Performance Evaluation of OID Mapping Techniques , 1995, VLDB.

[55]  Alfred Z. Spector,et al.  Synchronizing shared abstract types , 1984, TOCS.

[56]  Bruce G. Lindsay,et al.  The Starburst Long Field Manager , 1989, VLDB.

[57]  Serge Abiteboul,et al.  Querying and Updating the File , 1993, VLDB.

[58]  James Clark,et al.  XSL Transformations (XSLT) Version 1.0 , 1999 .

[59]  Hamid Pirahesh,et al.  ARIES-RRH: restricted repeating of history in the ARIES transaction recovery method , 1991, [1991] Proceedings. Seventh International Conference on Data Engineering.

[60]  Sven Helmer,et al.  Isolation in XML Bases , 2001 .

[61]  C. Mohan,et al.  Disk read-write optimizations and data integrity in transaction systems using write-ahead logging , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[62]  Andreas Reuter,et al.  Transaction Processing: Concepts and Techniques , 1992 .

[63]  C. Mohan,et al.  ARIES/KVL: A Key-Value Locking Method for Concurrency Control of Multiaction Transactions Operating on B-Tree Indexes , 1990, VLDB.

[64]  C. A. R. Hoare,et al.  Monitors: an operating system structuring concept , 1974, CACM.

[65]  David J. DeWitt,et al.  Relational Databases for Querying XML Documents: Limitations and Opportunities , 1999, VLDB.

[66]  Michael Stonebraker,et al.  The design and implementation of INGRES , 1976, TODS.

[67]  Wolfgang Effelsberg,et al.  Principles of database buffer management , 1984, TODS.

[68]  Arvind Malhotra,et al.  Xml schema part 2: datatypes , 1999 .

[69]  A. N. Wilschut,et al.  Modelling and Querying Semistructured Data with MOA , 1999 .

[70]  Goetz Graefe,et al.  Query evaluation techniques for large databases , 1993, CSUR.

[71]  Dennis Shasha,et al.  Concurrent search structure algorithms , 1988, TODS.

[72]  Sven Helmer,et al.  Anatomy of a native XML base management system , 2002, The VLDB Journal.