FIESTA: A Framework for Schema Evolution in Multidimensional Databases (Abstract)

New application areas for databases like data warehousing and OLAP (Online Analytical Proc-essing) deploy the multidimensional data model in order to describe the application domain. Consequently, OLAP systems are represented by a multidimensional database schema to adequately reflect the application semantics. FIESTA presents a methodology for the evolution of such multidimensional schemas. Core of the thesis is a schema evolution algebra which comprehends a formal multidimensional data model together with corresponding schema evolution operations. Since OLAP systems are typically implemented as additional layer for relational database systems, the processing of sequences of schema evolution operations in a relational database system is presented. To this end, we formally describe how a multidimensional schema can be mapped to a corresponding relational database schema. In order to fully maintain the multidimensional semantics during this transformation, a corresponding meta schema is introduced as extension of the relational system catalogue. For a consistent processing of evolution operation sequences, a transformation to corresponding relational evolution commands is performed. These relational evolution commands adapt the relational database schema together with the instances and update the contents of the meta schema accordingly. A prototype for FIESTA has been implemented as part of a graphical design environment for data warehouses. In this environment, multidimensional schemas are presented at the user interface by means of a specialized graphical notation. This notation is an extension of the well-known Entity/Relationship approach. For internal processing the graphical representation is transformed to an algebraic description of the multidimensional schema.

[1]  Umeshwar Dayal,et al.  An Overview of Repository Technology , 1994, VLDB.

[2]  Bernhard Thalheim,et al.  Modelling Data Warehouses and OLAP Applications by Means of Dialogue Objects , 1999, ER.

[3]  Elke A. Rundensteiner,et al.  Psweep: Parallel View Maintenance under Concurrent Data Updates of Distributed Sources Psweep: Parallel View Maintenance under Concurrent Data Updates of Distributed Sources , 1999 .

[4]  Alberto O. Mendelzon,et al.  Updating OLAP dimensions , 1999, DOLAP '99.

[5]  Sunita Sarawagi,et al.  Modeling multidimensional databases , 1997, Proceedings 13th International Conference on Data Engineering.

[6]  Luca Cabibbo,et al.  A Logical Approach to Multidimensional Databases , 1998, EDBT.

[7]  Elke A. Rundensteiner,et al.  PVM: Parallel View Maintenance under Concurrent Data Updates of Distributed Sources , 2001, DaWaK.

[8]  Christoph F. Eick A Methodology for the Design and Transformation of Conceptual Schemas , 1991, VLDB.

[9]  Ralph Kimball,et al.  The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses , 1996 .

[10]  Barbara Dinter,et al.  Extending the E/R Model for the Multidimensional Paradigm , 1998, ER Workshops.

[11]  Luca Cabibbo,et al.  Querying Multidimensional Databases , 1997, DBPL.

[12]  Christoph Quix,et al.  Repository Support for Data Warehouse Evolution , 1999, DMDW.

[13]  Laks V. S. Lakshmanan,et al.  A Foundation for Multi-dimensional Databases , 1997, VLDB.

[14]  Wolfgang Lehner,et al.  CROSS-DB: a feature-extended multidimensional data model for statistical and scientific databases , 1996, CIKM '96.

[15]  Elke A. Rundensteiner,et al.  Evolvable View Environment EVE: A Data Warehouse System Handling Schema and Data Changes of Distribu , 1999 .

[16]  Matteo Golfarelli,et al.  A methodological framework for data warehouse design , 1998, DOLAP '98.

[17]  Volker Markl,et al.  Improving OLAP performance by multidimensional hierarchical clustering , 1999, Proceedings. IDEAS'99. International Database Engineering and Applications Symposium (Cat. No.PR00265).

[18]  Elke A. Rundensteiner,et al.  On Preserving Views in Evolving Environments , 1997, KRDB.

[19]  Peter Chamoni,et al.  Modellierung temporaler multidimensionaler Daten in Analytischen Informationssystemen , 1998, Workshop Data Mining und Data Warehousing.

[20]  Carlo Zaniolo,et al.  A formal approach to the definition and the design of conceptual schemata for databased systems , 1982, TODS.

[21]  Setrag Khoshafian Object orientation , 1990 .

[22]  Peter P. Chen The Entity-Relationship Model: Towards a unified view of Data , 1976 .

[23]  Kenneth A. Ross,et al.  Adapting materialized views after redefinitions , 1995, SIGMOD '95.

[24]  Surajit Chaudhuri,et al.  Maintenance of Materialized Views: Problems, Techniques, and Applications. , 1995 .

[25]  Elke A. Rundensteiner,et al.  View maintenance after view synchronization , 1999, Proceedings. IDEAS'99. International Database Engineering and Applications Symposium (Cat. No.PR00265).

[26]  Peter P. Chen The entity-relationship model: toward a unified view of data , 1975, VLDB '75.

[27]  Matthias Jarke,et al.  Architecture and Quality in Data Warehouses: An Extended Repository Approach , 1999, Information Systems.

[28]  Wolfgang Lehner,et al.  Modelling Large Scale OLAP Scenarios , 1998, EDBT.

[29]  Barbara Dinter,et al.  The OLAP market: state of the art and research issues , 1998, DOLAP '98.

[30]  Chang Li,et al.  A data model for supporting on-line analytical processing , 1996, CIKM '96.

[31]  Olaf Herden A design methodology for data warehouses , 2000 .

[32]  O. Lehrmann Madsen,et al.  ECOOP ’92 European Conference on Object-Oriented Programming , 1992, Lecture Notes in Computer Science.

[33]  Surajit Chaudhuri,et al.  An overview of data warehousing and OLAP technology , 1997, SGMD.

[34]  Elke A. Rundensteiner,et al.  Data warehouse maintenance under concurrent schema and data updates , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[35]  Peter Chamoni,et al.  Temporal Structures in Data Warehousing , 1999, DaWaK.

[36]  Panos Vassiliadis,et al.  Modeling multidimensional databases, cubes and cube operations , 1998, Proceedings. Tenth International Conference on Scientific and Statistical Database Management (Cat. No.98TB100243).

[37]  Carsten Sapia,et al.  An Overview of Multidimensional Data Models for OLAP , 1999 .

[38]  Torben Bach Pedersen,et al.  Multidimensional data modeling for complex data , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[39]  Wolfgang Lehner,et al.  The Cube-Query-Languages (CQL) for Multidimensional Statistical and Scientific Database Systems , 1997, DASFAA.

[40]  Matteo Golfarelli,et al.  Conceptual design of data warehouses from E/R schemes , 1998, Proceedings of the Thirty-First Hawaii International Conference on System Sciences.

[41]  Elke A. Rundensteiner,et al.  Integrating the rewriting and ranking phases of view synchronization , 1998, DOLAP '98.

[42]  Jennifer Widom,et al.  Research problems in data warehousing , 1995, CIKM '95.

[43]  H. V. Jagadish,et al.  Data Integration using Self-Maintainable Views , 1996, EDBT.

[44]  W. H. Inmon,et al.  Building the data warehouse (2nd ed.) , 1996 .

[45]  Alberto O. Mendelzon,et al.  Maintaining data cubes under dimension updates , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[46]  Gabriele Höfling Schema-Evolution in objekt-orientierten Datenbanksystemen , 1996 .

[47]  Yannis Vassiliou,et al.  On Modeling and Predicting User Behavior in OLAP Systems , 1999 .

[48]  Vijay Kumar,et al.  Recent Advances and Research Problems in Data Warehousing , 1998, ER Workshops.

[49]  Elke A. Rundensteiner,et al.  Enhancing Existing Incremental View Maintenance Algorithms Using the Multi-Relation Encapsulation Wrapper , 1999 .

[50]  Elke A. Rundensteiner,et al.  View evolution support for information integration systems over dynamic distributed information spaces , 1998 .

[51]  Elke A. Rundensteiner,et al.  Optimizing Performance of Schema Evolution Sequences , 2000, Objects and Databases.

[52]  Jennifer Widom,et al.  Making views self-maintainable for data warehousing , 1996, Fourth International Conference on Parallel and Distributed Information Systems.

[53]  Rudolf Bayer,et al.  The Universal B-Tree for Multidimensional Indexing: general Concepts , 1997, WWCA.

[54]  Sunny Marche,et al.  Measuring the stability of data models , 1993 .

[55]  Inderpal Singh Mumick,et al.  Maintenance of data cubes and summary tables in a warehouse , 1997, SIGMOD '97.

[56]  Rudolf Bayer The Universal B-Tree for multidimensional Indexing , 1996 .

[57]  Carsten Sapia,et al.  PROMISE: modeling and predicting user behavior for online analytical processing applications , 2001 .

[58]  Carsten Sapia,et al.  GraMMi: using a standard repository management system to build a generic graphical modeling tool , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[59]  Andy Schürr,et al.  Defining and Parsing Visual Languages with Layered Graph Grammars , 1997, J. Vis. Lang. Comput..

[60]  Jay Banerjee,et al.  Semantics and implementation of schema evolution in object-oriented databases , 1987, SIGMOD '87.

[61]  Alan R. Simon Strategic Database Technology: Management for the Year 2000 , 1995 .

[62]  Bernhard Schiefer Eine Umgebung zur Unterstützung von Schemaänderungen und Sichten in objektorientierten Datenbanksystemen , 1993, GI Datenbank Rundbrief.

[63]  Hartmut Ehrig,et al.  Introduction to the Algebraic Theory of Graph Grammars (A Survey) , 1978, Graph-Grammars and Their Application to Computer Science and Biology.

[64]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[65]  Gottfried Vossen,et al.  Datenmodelle, Datenbanksprachen und Datenbank-Management-Systeme , 1990 .

[66]  Barbara Dinter,et al.  Finding your way through multidimensional data models , 1998, Proceedings Ninth International Workshop on Database and Expert Systems Applications (Cat. No.98EX130).

[67]  Matthias Jarke,et al.  Fundamentals of Data Warehouses , 2000, Springer Berlin Heidelberg.

[68]  Karl J. Lieberherr,et al.  Incremental Class Dictionary Learning and Optimization , 1991, ECOOP.

[69]  Barry Devlin,et al.  Data Warehouse: From Architecture to Implementation , 1996 .

[70]  Eduardo Casais,et al.  An Incremental Class Reorganization Approach , 1992, ECOOP.

[71]  Carsten Sapia,et al.  On Schema Evolution in Multidimensional Databases , 1999, DaWaK.

[72]  Mukesh K. Mohania,et al.  Algorithms for Adapting Materialised Views in Data Warehouses , 1996, CODAS.

[73]  Markus Tresch Evolution in Objekt-Datenbanken , 1995 .

[74]  D. Sjøberg,et al.  Quantifying schema evolution , 1993, Inf. Softw. Technol..

[75]  Timos K. Sellis,et al.  A survey of logical models for OLAP databases , 1999, SGMD.

[76]  Volker Markl,et al.  Mistral - Processing Relational Queries using a Multidimensional Access Technique , 1999, Datenbank Rundbr..

[77]  Elke A. Rundensteiner,et al.  The SDCC framework for integrating existing algorithms for diverse data warehouse maintenance tasks , 1999, Proceedings. IDEAS'99. International Database Engineering and Applications Symposium (Cat. No.PR00265).

[78]  Für Wissensbasierte An Overview of Multidimensional Data Models for OLAP , 1999 .

[79]  Zohra Bellahsene View Adaptation in Data Warehousing Systems , 1998, DEXA.