Integrating XML data in the TARGIT OLAP system

We present work on logical integration of OLAP and XML data sources, carried out in cooperation between TARGIT, a Danish OLAP client vendor, and Aalborg University. A prototype has been developed that allows XML data on the WWW to be used as dimensions and measures in the OLAP system in the same way as ordinary dimensions and measures, providing a powerful and flexible way to handle unexpected or short-term data requirements as well as rapidly changing data. Compared to earlier work, we present several major extensions that resulted from TARGIT's requirements. These include the ability to use XML data as measures, as well as a novel multigranular data model and query language that formalizes and extends the TARGIT data model and query language.

[1]  C. M. Sperberg-McQueen,et al.  eXtensible Markup Language (XML) 1.0 (Second Edition) , 2000 .

[2]  Yvan Bédard,et al.  Handling evolutions in multidimensional structures , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[3]  Tony Bain,et al.  Professional SQL Server 2000 Data Warehousing with Analysis Services , 2001 .

[4]  Christian S. Jensen,et al.  A foundation for capturing and querying complex multidimensional data , 2001, Inf. Syst..

[5]  Qiang Zhu,et al.  Global Query Processing and Optimization in the CORDS Multidatabase System , 1996 .

[6]  Torben Bach Pedersen,et al.  XML-extended OLAP querying , 2002, Proceedings 14th International Conference on Scientific and Statistical Database Management.

[7]  Jennifer Widom,et al.  Ozone: Integrating Structured and Semistructured Data , 1999, DBPL.

[8]  Forouzan Golshani,et al.  Proceedings of the Eighth International Conference on Data Engineering , 1992 .

[9]  Chin-Wan Chung,et al.  Exploiting Versions for On-line Data Warehouse Maintenance in MOLAP Servers , 2002, VLDB.

[10]  Arie Shoshani,et al.  Summarizability in OLAP and statistical data bases , 1997, Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150).

[11]  Arie Shoshani,et al.  Extending OLAP querying to external object databases , 2000, CIKM '00.

[12]  Hongjun Lu,et al.  The Fittest Survives: An Adaptive Approach to Query Optimization , 1995, VLDB.

[13]  Michael Stonebraker,et al.  Independent, Open Enterprise Data Integration , 1999, IEEE Data Eng. Bull..

[14]  Roy Goldman,et al.  WSQ/DSQ: a practical approach for combined querying of databases and the Web , 2000, SIGMOD '00.

[15]  Peter Gluchowski,et al.  Data Warehouse , 1997, Informatik-Spektrum.

[16]  Torben Bach Pedersen,et al.  Integrating XML Data in the TARGITOLAP System , 2004, ICDE.

[17]  Jennifer Widom,et al.  The TSIMMIS Project: Integration of Heterogeneous Information Sources , 1994, IPSJ.

[18]  Torben Bach Pedersen,et al.  Cost Modeling and Estimation for OLAP-XML Federations , 2002, DaWaK.

[19]  Torben Bach Pedersen,et al.  A Powerful and SQL-Compatible Data Model and Query Language for OLAP , 2002, Australasian Database Conference.

[20]  Goetz Graefe,et al.  Hash Joins and Hash Teams in Microsoft SQL Server , 1998, VLDB.

[21]  Johann Eder,et al.  Changes of Dimension Data in Temporal Data Warehouses , 2001, DaWaK.

[22]  Laura M. Haas,et al.  Optimizing Queries Across Diverse Data Sources , 1997, VLDB.

[23]  Laura M. Haas,et al.  The Garlic project , 1996, SIGMOD '96.

[24]  Alberto O. Mendelzon,et al.  Maintaining data cubes under dimension updates , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[25]  Torben Bach Pedersen,et al.  Evaluating XML-extended OLAP queries based on a physical algebra , 2004, DOLAP '04.

[26]  Erik Thomsen,et al.  OLAP Solutions - Building Multidimensional Information Systems , 1997 .

[27]  Laks V. S. Lakshmanan,et al.  nD-SQL: A Multi-Dimensional Language for Interoperability and OLAP , 1998, VLDB.