Advantages of complex SQL types in storing XML documents

Abstract This contribution looks at and evaluates new ways to map XML documents to relational database systems preserving their hierarchical structure. In particular we take advantage of the advanced features available following the SQL:2003 standard, which defines complex structure and collection types. Starting out with a survey of existing methods, basic questions concerning appropriate and efficient forms of mapping XML documents to SQL-compliant data types are addressed. This leads to a reversible conversion method, which is subsequently implemented in a prototype application. The design of the mapping is geared towards existing, mature relational database management systems (DBMS). It turns out that among the products considered, IBM Informix seems to offer the best support for complex structure and collection types in the context of our application. To better assess the performance of the mapping process, the observed running time and the required in-memory and database storage of the implementation are measured and evaluated.

[1]  Vincent Quint,et al.  Impact of XML Schema Evolution , 2011, TOIT.

[2]  Kyuseok Shim,et al.  Storing XML (with XSD) in SQL Databases: Interplay of Logical and Physical Designs , 2005, IEEE Trans. Knowl. Data Eng..

[3]  Mong-Li Lee,et al.  A Prime Number Labeling Scheme for Dynamic Ordered XML Trees , 2004, ICDE.

[4]  Kyoungro Yoon,et al.  Index structures for structured documents , 1996, DL '96.

[5]  Hongjun Lu,et al.  Query translation from XPath to SQL in the presence of recursive DTDs , 2009, The VLDB Journal.

[6]  Tok Wang Ling,et al.  QED: a novel quaternary encoding to completely avoid re-labeling in XML updates , 2005, CIKM '05.

[7]  Tok Wang Ling,et al.  DDE: from dewey to a fully dynamic XML labeling scheme , 2009, SIGMOD Conference.

[8]  Krishna G. Kulkarni,et al.  The SQL Standard , 2014 .

[9]  Toshiyuki Amagasa,et al.  XRel: a path-based approach to storage and retrieval of XML documents using relational databases , 2001, ACM Trans. Internet Techn..

[10]  Richi Nayak,et al.  XML data clustering: An overview , 2011, CSUR.

[11]  Kai Schweinsberg Abbildung von XML-Dokumenten auf SQL:2003-konforme Datentypen , 2012 .

[12]  Chun Zhang,et al.  Storing and querying ordered XML using a relational database system , 2002, SIGMOD '02.

[13]  Dongwook Shin,et al.  BUS: an effective indexing and retrieval scheme in structured documents , 1998, DL '98.

[14]  Mark Roantree,et al.  Desirable properties for XML update mechanisms , 2010, EDBT '10.

[15]  Klemens Böhm Proceedings of the 31st International conference on Very Large Data Bases, August 30 - September 2, 2005, Trondheim, Norway , 2005 .

[16]  Peter Dadam,et al.  A DBMS prototype to support extended NF2 relations: an integrated view on flat tables and hierarchies , 1986, SIGMOD '86.

[17]  Wilfred Ng,et al.  Querying XML data by the nested relational sequence database system , 2003, Seventh International Database Engineering and Applications Symposium, 2003. Proceedings..

[18]  Lubos Kulic Adaptability in XML-to-relational mapping strategies , 2010, SAC '10.

[19]  Hongjun Lu,et al.  Path Materialization Revisited: An Efficient Storage Model for XML Data , 2002, Australasian Database Conference.

[20]  Jim Melton,et al.  Advancements in SQL/XML , 2004, SGMD.

[21]  Yannis Papakonstantinou,et al.  Incremental validation of XML documents , 2003, TODS.

[22]  Ge Yu,et al.  What makes the differences: benchmarking XML database implementations , 2005, TOIT.

[23]  Denilson Barbosa,et al.  Designing Information-Preserving Mapping Schemes for XML , 2005, VLDB.

[24]  Elliotte Rusty Harold XML 1.1 Bible , 2004 .

[25]  Jim Melton,et al.  Querying XML,: XQuery, XPath, and SQL/XML in context (The Morgan Kaufmann Series in Data Management Systems) (The Morgan Kaufmann Series in Data Management Systems) , 2006 .

[26]  Byron Choi,et al.  What are real DTDs like? , 2002, WebDB.

[27]  Philip Wadler,et al.  XQuery from the Experts: A Guide to the W3C XML Query Language , 2003 .

[28]  Hans-Jörg Schek,et al.  The relational model with relation-valued attributes , 1986, Inf. Syst..