A Join Index for XML Data Warehouses

XML data warehouses form an interesting basis for decision-support applications that exploit complex data. However, native-XML database management systems (DBMSs) currently bear limited performances and it is necessary to research for ways to optimize them. In this paper, we propose a new join index that is specifically adapted to the multidimensional architecture of XML warehouses. It eliminates join operations while preserving the information contained in the original warehouse. A theoretical study and experimental results demonstrate the efficiency of our join index. They also show that native XML DBMSs can compete with XML-compatible, relational DBMSs when warehousing and analyzing XML data.

[1]  M. Rys XQuery in Relational Database Systems , 2004 .

[2]  Ehud Gudes,et al.  Exploiting local similarity for indexing paths in graph-structured data , 2002, Proceedings 18th International Conference on Data Engineering.

[3]  Dan Suciu,et al.  Index Structures for Path Expressions , 1999, ICDT.

[4]  Hao He,et al.  Multiresolution indexing of XML for frequent queries , 2004, Proceedings. 20th International Conference on Data Engineering.

[5]  Qing Wang,et al.  UD(k, l)-Index: An Efficient Approximate Index for XML Data , 2003, WAIM.

[6]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[7]  Rada Chirkova,et al.  Efficiently Querying Large XML Data Repositories: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.

[8]  David J. DeWitt,et al.  On supporting containment queries in relational database management systems , 2001, SIGMOD '01.

[9]  Scott Boag,et al.  XQuery 1.0 : An XML Query Language , 2007 .

[10]  Omar Boussaïd,et al.  An Architecture Framework for Complex Data Warehouses , 2007, ICEIS.

[11]  Michael J. Franklin,et al.  A Fast Index for Semistructured Data , 2001, VLDB.

[12]  Wolfgang Hümmer,et al.  XCube: XML for data warehouses , 2003, DOLAP '03.

[13]  Andrew Lim,et al.  D(k)-index: an adaptive structural summary for graph-structured data , 2003, SIGMOD '03.

[14]  Hyoil Han,et al.  XML-OLAP: A Multidimensional Analysis Framework for XML Warehouses , 2005, DaWaK.

[15]  Hadj Mahboubi,et al.  Materialized View Selection by Query Clustering in XML Data Warehouses , 2008, ArXiv.

[16]  Hamid Pirahesh,et al.  Extending XQuery for analytics , 2005, SIGMOD '05.

[17]  Jaroslav Pokorný XML Data Warehouse: Modelling and Querying , 2002, BalticDB&IS.

[18]  Sabine Loudcher,et al.  Web multiform data structuring for warehousing , 2003 .

[19]  Wolfgang Meier,et al.  eXist: An Open Source Native XML Database , 2002, Web, Web-Services, and Database Systems.

[20]  Roy Goldman,et al.  DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases , 1997, VLDB.

[21]  Shant Kirakos Karakashian,et al.  A New Design for a Native XML Storage and Indexing Manager , 2006, EDBT.

[22]  David Taniar,et al.  A Methodology for Building XML Data Warehouses , 2005, Int. J. Data Warehous. Min..

[23]  Kyuseok Shim,et al.  APEX: an adaptive path index for XML data , 2002, SIGMOD '02.