Storing XML Data - The ExDB and CellStore Way in the Context of Current Approaches

In this paper we describe possible approaches how to store XML data, which is a key aspect for their further processing. One popular technique for managing XML data is to map the data to an existing database system, e.g., to the relational or object-relational database management system. We describe possible ways how to store XML data in relational databases, because relational systems are still widely used for various purposes, including XML data management. But XML data are trees, not tables, so the main focus of this article is oriented to native XML databases. We describe general properties of such kind of databases and, in particular, explain possible solutions on two experimental native XML database management systems – ExDB and CellStore. Both have been proposed, implemented and optimized in our research groups in recent years for experimental purposes.

[1]  G. Broll,et al.  Microsoft Corporation , 1999 .

[2]  Yannis Papakonstantinou,et al.  Storing and querying XML data using denormalized relational databases , 2005, The VLDB Journal.

[3]  Jeffrey F. Naughton,et al.  Covering indexes for branching path queries , 2002, SIGMOD '02.

[4]  Sihem Amer-Yahia Storage Techniques and Mapping Schemas for XML , 2003 .

[5]  David J. DeWitt,et al.  On supporting containment queries in relational database management systems , 2001, SIGMOD '01.

[6]  C. M. Sperberg-McQueen,et al.  eXtensible Markup Language (XML) 1.0 (Second Edition) , 2000 .

[7]  I. Mlynkova A Journey towards More Efficient Processing of XML Data in (O)RDBMS , 2007, 7th IEEE International Conference on Computer and Information Technology (CIT 2007).

[8]  Paul F. Dietz Maintaining order in a linked list , 1982, STOC '82.

[9]  Jignesh M. Patel,et al.  Structural joins: a primitive for efficient XML query pattern matching , 2002, Proceedings 18th International Conference on Data Engineering.

[10]  Scott Boag,et al.  XQuery 1.0 : An XML Query Language , 2007 .

[11]  Divesh Srivastava,et al.  Holistic twig joins: optimal XML pattern matching , 2002, SIGMOD '02.

[12]  Jan Vraný,et al.  The Debuggable Interpreter Design Pattern , 2016, ICSOFT.

[13]  Mounia Lalmas,et al.  Overview of the INitiative for the evaluation of XML retrieval (INEX) 2003 , 2014 .

[14]  Jiang Li,et al.  TwigBuffer: Avoiding Useless Intermediate Solutions Completely in Twig Joins , 2008, DASFAA.

[15]  Theo Härder,et al.  taDOM: A Tailored Synchronization Concept with Tunable Lock Granularity for the DOM API , 2003, ADBIS.

[16]  Tok Wang Ling,et al.  Efficient processing of XML twig patterns with parent child edges: a look-ahead approach , 2004, CIKM '04.

[17]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[18]  Beng Chin Ooi,et al.  XR-tree: indexing XML data for efficient structural joins , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[19]  Gabriella Kazai,et al.  Overview of the Initiative for the Evaluation of XML retrieval (INEX) 2002 , 2002, INEX Workshop.

[20]  Holger Meyer,et al.  XML and Object-Relational Database Systems - Enhancing Structural Mappings Based on Statistics , 2000, WebDB.

[21]  Michal Krátký,et al.  Benchmarking the Compression of XML Node Streams , 2010, DASFAA Workshops.

[22]  Hua-Gang Li,et al.  Twig2Stack: bottom-up processing of generalized-tree-pattern queries over XML documents , 2006, VLDB.

[23]  Charles Kim,et al.  Oracle Database 11g , 2004 .

[24]  Michal Valenta,et al.  On Benchmarking Transaction Managers , 2009, DASFAA Workshops.

[25]  Edith Cohen,et al.  Labeling dynamic XML trees , 2002, PODS '02.

[26]  Carlo Zaniolo,et al.  Efficient Structural Joins on Indexed XML Documents , 2002, VLDB.

[27]  Steven J. DeRose,et al.  XML Path Language (XPath) Version 1.0 , 1999 .

[28]  Gabriella Kazai INitiative for the Evaluation of XML Retrieval , 2009, Encyclopedia of Database Systems.

[29]  Juliana Freire,et al.  From XML schema to relations: a cost-based approach to XML storage , 2002, Proceedings 18th International Conference on Data Engineering.

[30]  Daniela Florescu,et al.  Storing and Querying XML Data using an RDMBS , 1999, IEEE Data Eng. Bull..

[31]  Patrick E. O'Neil,et al.  ORDPATHs: insert-friendly XML node labels , 2004, SIGMOD '04.

[32]  Tok Wang Ling,et al.  On boosting holism in XML twig pattern matching using structural indexing techniques , 2005, SIGMOD '05.

[33]  C. M. Sperberg-McQueen,et al.  Extensible markup language , 1997 .

[34]  Ralph Krieger,et al.  Efficient Structure Oriented Storage of XML Documents Using ORDBMS , 2002, EEXTT.

[35]  Jignesh M. Patel,et al.  Structural join order selection for XML query optimization , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[36]  Kyuseok Shim,et al.  APEX: an adaptive path index for XML data , 2002, SIGMOD '02.

[37]  David J. DeWitt,et al.  Relational Databases for Querying XML Documents: Limitations and Opportunities , 1999, VLDB.

[38]  Sihem Amer-Yahia,et al.  A comprehensive solution to the XML-to-relational mapping problem , 2004, WIDM '04.

[39]  Vishu Krishnamurthy,et al.  Performance Challenges in Object-Relational DBMSs , 1999, IEEE Data Eng. Bull..

[40]  Robert Richards,et al.  Document Object Model (DOM) , 2006 .

[41]  Ioana Manolescu,et al.  The XML benchmark project , 2001 .

[42]  Quanzhong Li,et al.  Indexing and Querying XML Data for Regular Path Expressions , 2001, VLDB.

[43]  Jignesh M. Patel,et al.  Storing and Querying XML Data in Object-Relational DBMSs , 2002, EDBT Workshops.

[44]  Hongjun Lu,et al.  Holistic Twig Joins on Indexed XML Documents , 2003, VLDB.

[45]  Karel Richta,et al.  CellStore: Educational and Experimental XML-Native DBMS , 2007, ISD.

[46]  Kamil Toman Storing XML Data In a Native Repository , 2004, DATESO.

[47]  Ruprecht Raatz,et al.  Microsoft SQL Server 2008 , 2008 .

[48]  Chun Zhang,et al.  Storing and querying ordered XML using a relational database system , 2002, SIGMOD '02.

[49]  Hongjun Lu,et al.  Cost-driven storage schema selection for XML , 2003, Eighth International Conference on Database Systems for Advanced Applications, 2003. (DASFAA 2003). Proceedings..

[50]  Sihem Amer-Yahia,et al.  ShreX: Managing XML Documents in Relational Databases , 2004, VLDB.

[51]  Wen-Chi Hou,et al.  Efficient Processing of XML Twig Pattern: A Novel One-Phase Holistic Solution , 2007, DEXA.

[52]  Jeffrey Xu Yu,et al.  TwigList : Make Twig Pattern Matching Fast , 2007, DASFAA.

[53]  Xiaoling Wang,et al.  An Adaptable and Adjustable Mapping from XML Data to Tables in RDB , 2002, EEXTT.

[54]  Jiang Li,et al.  Fast Matching of Twig Patterns , 2008, DEXA.