The decoration operator: a foundation for on-line dimensional data integration

The changing data requirements of today's dynamic business environments are not handled well by current on-line analytical processing (OLAP) systems. Physically integrating unexpected, external data into OLAP cubes, i.e., the data warehousing approach, is a long and time-consuming process, making logical, on-the-fly, integration the better choice in many situations. However, OLAP systems have no operations for integrating existing multidimensional cube data with external data. In This work we present a novel multidimensional algebra operator, the decoration operator, which allows external data to be integrated in OLAP cubes as new dimensions, i.e., the cube is "decorated" with new dimensions which can subsequently be used just as the regular dimensions. We formally specify the semantics of the decoration operator, ensuring that semantic problems do not occur in the data integration process. We also provide a comprehensive set of algebraic rewrite rules, specifying how the decoration operator interacts with other operators in the cube algebra.

[1]  Christian S. Jensen,et al.  A foundation for capturing and querying complex multidimensional data , 2001, Inf. Syst..

[2]  Jennifer Widom,et al.  The TSIMMIS Project: Integration of Heterogeneous Information Sources , 1994, IPSJ.

[3]  Chin-Wan Chung,et al.  Exploiting Versions for On-line Data Warehouse Maintenance in MOLAP Servers , 2002, VLDB.

[4]  Erick Thomsen,et al.  Microsoft? OLAP Solutions , 1999 .

[5]  Arie Shoshani,et al.  Summarizability in OLAP and statistical data bases , 1997, Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150).

[6]  Peter Gluchowski,et al.  Data Warehouse , 1997, Informatik-Spektrum.

[7]  Torben Bach Pedersen,et al.  XML-extended OLAP querying , 2002, Proceedings 14th International Conference on Scientific and Statistical Database Management.

[8]  Roy Goldman,et al.  WSQ/DSQ: a practical approach for combined querying of databases and the Web , 2000, SIGMOD 2000.

[9]  Roy Goldman,et al.  WSQ/DSQ: a practical approach for combined querying of databases and the Web , 2000, SIGMOD '00.

[10]  Paul W. P. J. Grefen,et al.  A multi-set extended relational algebra: a formal approach to a practical issue , 1994, Proceedings of 1994 IEEE 10th International Conference on Data Engineering.

[11]  Arie Shoshani,et al.  Extending OLAP querying to external object databases , 2000, CIKM '00.

[12]  Johann Eder,et al.  Evolution of Dimension Data in Temporal Data Warehouses , 2000 .

[13]  Klaus R. Dittrich,et al.  An overview and classification of mediated query systems , 1999, SGMD.

[14]  Proceedings International Database Engineering and Applications Symposium , 2003, Seventh International Database Engineering and Applications Symposium, 2003. Proceedings..

[15]  Yvan Bédard,et al.  Handling evolutions in multidimensional structures , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[16]  Laura M. Haas,et al.  The Garlic project , 1996, SIGMOD '96.

[17]  Alberto O. Mendelzon,et al.  Maintaining data cubes under dimension updates , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[18]  Torben Bach Pedersen,et al.  A Powerful and SQL-Compatible Data Model and Query Language for OLAP , 2002, Australasian Database Conference.

[19]  Jennifer Widom,et al.  Ozone: Integrating Structured and Semistructured Data , 1999, DBPL.

[20]  Ashish Gupta,et al.  Aggregate-Query Processing in Data Warehousing Environments , 1995, VLDB.

[21]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[22]  Laks V. S. Lakshmanan,et al.  nD-SQL: A Multi-Dimensional Language for Interoperability and OLAP , 1998, VLDB.

[23]  Michael Stonebraker,et al.  Independent, Open Enterprise Data Integration , 1999, IEEE Data Eng. Bull..