EXRT: Towards a Simple Benchmark for XML Readiness Testing

As we approach the ten-year anniversary of the first working draft of the XQuery language, one finds XML storage and query support in a number of commercial database systems. For many XML use cases, database vendors now recommend storing and indexing XML natively and using XQuery or SQL/XML to query and update XML directly. If the complexity of the XML data allows, shredding and reconstructing XML to/from relational tables is still an alternative as well, and might in fact outperform native XML processing. In this paper we report on an effort to evaluate these basic XML data management trade-offs for current commercial systems. We describe EXRT (Experimental XML Readiness Test), a simple micro-benchmark that methodically evaluates the impact of query characteristics on the comparison of shredded and native XML. We describe our experiences and preliminary results from EXRT'ing pressure on the XML data management facilities offered by two relational databases and one XML database system.

[1]  Jim Melton,et al.  Advancements in SQL/XML , 2004, SGMD.

[2]  Ioana Manolescu,et al.  MemBeR: A Micro-benchmark Repository for XQuery , 2005, XSym.

[3]  M. Tamer Özsu,et al.  XBench - A Family of Benchmarks for XML DBMSs , 2002, EEXTT.

[4]  Harald Schöning Tamino - A DBMS designed for XML , 2001, ICDE.

[5]  Michael Rys XML and relational database management systems: inside Microsoft® SQL Server™ 2005 , 2005, SIGMOD '05.

[6]  Karsten Schmidt,et al.  Benchmarking Performance-Critical Components in a Native XML Database System , 2009, DASFAA Workshops.

[7]  Loredana Afanasiev,et al.  An analysis of XQuery benchmarks , 2008, Inf. Syst..

[8]  Vikas Arora,et al.  Towards an enterprise XML architecture , 2005, SIGMOD '05.

[9]  Zhen Hua Liu,et al.  A Decade of XML Data Management: An Industrial Experience Report from Oracle , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[10]  Stéphane Bressan,et al.  Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web , 2003, Lecture Notes in Computer Science.

[11]  Jignesh M. Patel,et al.  The Michigan benchmark: towards XML query performance diagnostics , 2006, Inf. Syst..

[12]  Erhard Rahm,et al.  Multi-user Evaluation of XML Data Management Systems with XMach-1 , 2002, EEXTT.

[13]  Jeffrey F. Naughton,et al.  A general technique for querying XML documents using a relational database system , 2001, SGMD.

[14]  Mary Holstege,et al.  Big, Fast XQuery: Enabling Content Applications , 2008, IEEE Data Eng. Bull..

[15]  David J. DeWitt,et al.  The Wisconsin Benchmark: Past, Present, and Future , 1991, The Benchmark Handbook.

[16]  Massimo Franceschet XPathMark: An XPath Benchmark for the XMark Generated Data , 2005, XSym.

[17]  Dan Suciu,et al.  Database and XML Technologies , 2004, Lecture Notes in Computer Science.

[18]  Matthias Nicola Lessons Learned from DB2 pureXML Applications: A Practitioner's Perspective , 2010, XSym.

[19]  Ioana Manolescu,et al.  XMark: A Benchmark for XML Data Management , 2002, VLDB.

[20]  David J. DeWitt,et al.  The 007 Benchmark , 1993, SIGMOD '93.

[21]  Matthias Nicola,et al.  An XML transaction processing benchmark , 2007, SIGMOD '07.

[22]  Stéphane Bressan,et al.  XOO7: applying OO7 benchmark to XML query processing tool , 2001, CIKM '01.