DeltaNI: an efficient labeling scheme for versioned hierarchical data

Main-memory database systems are emerging as the new backbone of business applications. Besides flat relational data representations also hierarchical ones are essential for these modern applications; therefore we devise a new indexing and versioning approach for hierarchies that is deeply integrated into the relational kernel. We propose the DeltaNI index as a versioned pendant of the nested intervals (NI) labeling scheme. The index is space- and time-efficient and yields a gapless, fixed-size integer NI labeling for each version while also supporting branching histories. In contrast to a naive NI labeling, it facilitates even complex updates of the tree structure. As many query processing techniques that work on top of the NI labeling have already been proposed, our index can be used as a building block for processing various kinds of queries. We evaluate the performance of the index on large inputs consisting of millions of nodes and thousands of versions. Thereby we show that DeltaNI scales well and can deliver satisfying performance for large business scenarios.

[1]  David Taniar,et al.  Storage Techniques for Multi-versioned XML Documents , 2008, DASFAA.

[2]  Luis Arévalo Rosado,et al.  Representing Versions in XML Documents Using Versionstamp , 2006, ER.

[3]  Torsten Grust,et al.  Staircase Join: Teach a Relational DBMS to Watch its (Axis) Steps , 2003, VLDB.

[4]  Keishi Tajima,et al.  Archiving scientific data , 2002, SIGMOD '02.

[5]  Gade Krishna,et al.  A scalable peer-to-peer lookup protocol for Internet applications , 2012 .

[6]  Quanzhong Li,et al.  Indexing and Querying XML Data for Regular Path Expressions , 2001, VLDB.

[7]  Tok Wang Ling,et al.  DDE: from dewey to a fully dynamic XML labeling scheme , 2009, SIGMOD Conference.

[8]  Patrick E. O'Neil,et al.  ORDPATHs: insert-friendly XML node labels , 2004, SIGMOD '04.

[9]  Vassilis J. Tsotras,et al.  Experimental Evaluation of Query Processing Techniques over Multiversion XML Documents , 2009, WebDB.

[10]  Amélie Marian,et al.  Change-Centric Management of Versions in an XML Warehouse , 2001, VLDB.

[11]  LometDavid,et al.  Access methods for multiversion data , 1989 .

[12]  Jan Rittinger,et al.  Updating the Pre/Post Plane in MonetDB/XQuery , 2005, XIME-P.

[13]  Alfons Kemper,et al.  HyPer: A hybrid OLTP&OLAP main memory database system based on virtual memory snapshots , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[14]  Carlo Zaniolo,et al.  Efficient Complex Query Support for Multiversion XML Documents , 2002, EDBT.

[15]  Carlo Zaniolo,et al.  Supporting complex queries on multiversion XML documents , 2006, TOIT.

[16]  Edith Cohen,et al.  Labeling dynamic XML trees , 2002, SIAM J. Comput..

[17]  Jignesh M. Patel,et al.  Structural joins: a primitive for efficient XML query pattern matching , 2002, Proceedings 18th International Conference on Data Engineering.

[18]  Robert Morris,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM 2001.

[19]  Torsten. Grust,et al.  Accelerating XPath location steps , 2002, SIGMOD '02.

[20]  David B. Lomet,et al.  Access methods for multiversion data , 1989, SIGMOD '89.

[21]  Kostas Tsichlas,et al.  Fully persistent B-trees , 2012, SODA.

[22]  Bernhard Seeger,et al.  An asymptotically optimal multiversion B-tree , 1996, The VLDB Journal.

[23]  Yuping Zhang,et al.  A Labeling Scheme for Temporal XML , 2009, 2009 International Conference on Web Information Systems and Mining.

[24]  Divesh Srivastava,et al.  Holistic twig joins: optimal XML pattern matching , 2002, SIGMOD '02.

[25]  Carlo Zaniolo,et al.  XML document versioning , 2001, SGMD.

[26]  Vassilis J. Tsotras,et al.  Supporting branched versions on XML documents , 2004, 14th International Workshop Research Issues on Data Engineering: Web Services for e-Commerce and e-Government Applications, 2004. Proceedings..

[27]  Serge Abiteboul,et al.  Detecting changes in XML documents , 2002, Proceedings 18th International Conference on Data Engineering.

[28]  Wolfgang Lehner,et al.  SAP HANA database: data management for modern business applications , 2012, SGMD.

[29]  Alejandro A. Vaisman,et al.  Temporal XML: modeling, indexing, and query processing , 2008, The VLDB Journal.

[30]  David B. Lomet,et al.  The BT-tree: A Branched and Temporal Access Method , 2000, VLDB.

[31]  Vassilis J. Tsotras,et al.  Path-expression Queries over Multiversion XML Documents , 2003, WebDB.

[32]  David Taniar,et al.  Maintaining Versions of Dynamic XML Documents , 2005, WISE.

[33]  Hao He,et al.  BOXes: efficient maintenance of order-based labeling for dynamic XML data , 2005, 21st International Conference on Data Engineering (ICDE'05).

[34]  Dominique Laurent,et al.  Prefix based numbering schemes for XML: techniques, applications and performances , 2008, Proc. VLDB Endow..

[35]  Alberto O. Mendelzon,et al.  Indexing Temporal XML Documents , 2004, VLDB.