An empirical study of XML data management in business information systems

Due the popularity of XML, an increasingly large amount of business transactions encoded in XML have been exchanged on-line. Currently there are two approaches to process and manage these XML data. One is to store them in relational databases, and the other is to store them in recently-developed native XML databases. There is no conclusion yet as to which approach suits better for contemporary business information systems. Also, the effectiveness of native XML databases used in daily operational systems has not been completely investigated. Therefore, in this paper, we provide: (1) a complete and systematic survey of the current development and challenges of processing XML data in relational and native XML databases, (2) a useful benchmark for IT practitioners who need to process XML data effectively, (3) experimental results and detailed analysis which reveal several interesting tips that can be helpful to XML document designers, and (4) a conclusion, based on the findings of using native XML databases in EDI processes, that it is practical to use native XML databases for daily operations although our experimental results showed that relational database systems outperform native XML databases in processing XML data.

[1]  Alin Deutsch,et al.  XML-QL: A Query Language for XML , 1998 .

[2]  Jeffrey F. Naughton,et al.  A general technique for querying XML documents using a relational database system , 2001, SGMD.

[3]  Jignesh M. Patel,et al.  The Michigan Benchmark: A Microbenchmark for XML Query Processing Systems , 2002, EEXTT.

[4]  Elisa Bertino,et al.  Integrating XML and databases , 2001, IEEE Internet Computing.

[5]  Hyoung-Joo Kim,et al.  XML query processing using document type definitions , 2002, J. Syst. Softw..

[6]  Stéphane Bressan,et al.  The XOO7 Benchmark , 2002, EEXTT.

[7]  Erhard Rahm,et al.  XMach-1: A Benchmark for XML Data Management , 2001, BTW.

[8]  Daniela Florescu,et al.  Storing and Querying XML Data using an RDMBS , 1999, IEEE Data Eng. Bull..

[9]  Michael J. Carey,et al.  XPERANTO: Middleware for Publishing Object-Relational Data as XML Documents , 2000, VLDB.

[10]  Michael J. Franklin,et al.  A Fast Index for Semistructured Data , 2001, VLDB.

[11]  Vishu Krishnamurthy,et al.  Oracle8i-the XML enabled data management system , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[12]  Christof Bornhövd,et al.  A generic load/extract utility for data transfer between XML documents and relational databases , 2000, Proceedings Second International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems. WECWIS 2000.

[13]  Michael Rys Bringing the Internet to your database: using SQL server 2000 and XML to build loosely-coupled systems , 2001, Proceedings 17th International Conference on Data Engineering.

[14]  Dan Suciu,et al.  On database theory and XML , 2001, SGMD.

[15]  Michael Rys Query Languages for XML Documents: A QL '98 Position Paper , 1998, QL.

[16]  M. Tamer Özsu,et al.  XBench - A Family of Benchmarks for XML DBMSs , 2002, EEXTT.

[17]  David J. DeWitt,et al.  Relational Databases for Querying XML Documents: Limitations and Opportunities , 1999, VLDB.

[18]  Donald D. Chamberlin,et al.  XQuery: a query language for XML , 2003, SIGMOD '03.

[19]  Ioana Manolescu,et al.  XMark: A Benchmark for XML Data Management , 2002, VLDB.

[20]  Alin Deutsch,et al.  Storing semistructured data with STORED , 1999, SIGMOD '99.

[21]  Daniela Florescu,et al.  A Performance Evaluation of Alternative Mapping Schemes for Storing XML Data in a Relational Database , 1999 .

[22]  Stéphane Bressan,et al.  Current Approaches to XML Management , 2002, IEEE Internet Comput..

[23]  Menzo Windhouwer,et al.  Efficient Relational Storage and Retrieval of XML Documents , 2000, WebDB.

[24]  Evangelos Kotsakis,et al.  XSD: A Hierarchical Access Method for Indexing XML Schemata , 2002, Knowledge and Information Systems.

[25]  Dan Suciu,et al.  SilkRoute: trading between relations and XML , 2000, Comput. Networks.

[26]  Mary Fernandez XML Query Languages: Experiences and Exemplars , 2001 .

[27]  Shihyu Chou,et al.  An empirical study of XML/EDI , 2001, J. Syst. Softw..

[28]  Ioana Manolescu,et al.  Why and how to benchmark XML databases , 2001, SGMD.

[29]  Eric Jui-Lin Lu,et al.  XDSearch: an efficient search engine for XML document schemata , 2003, Expert Syst. Appl..

[30]  W. Glas Xml and Databases , 2002 .

[31]  Paolo Atzeni,et al.  XML AND DATABASES , 2004 .

[32]  Stéphane Bressan,et al.  Efficient XML Data Management: An Analysis , 2002, EC-Web.

[33]  Michael Rys,et al.  Bringing the Internet to Your Database: Using SQLServer 2000 and XML to Build Loosely-Coupled Systems , 2001, BTW.

[34]  Michael J. Carey,et al.  XPERANTO: Publishing Object-Relational Data as XML , 2000, WebDB.

[35]  Stéphane Bressan,et al.  The XOO7 XML Management System Benchmark , 2001 .

[36]  Hamid Pirahesh,et al.  Efficiently publishing relational data as XML documents , 2001, The VLDB Journal.

[37]  Alin Deutsch,et al.  A Query Language for XML , 1999, Comput. Networks.