Repository Support for Data Warehouse Evolution

Data warehouses are complex systems consisting of many components which store highlyaggregated data for decision support. Due to the role of the data warehouses in the daily business work of an enterprise, the requirements for the design and the implementation are dynamic and subjective. Therefore, data warehouse design is a continuous process which has to reflect the changing environment of a data warehouse, i.e. the data warehouse must evolve in reaction to the enterprise’s evolution. Based on existing meta models for the architecture and quality of a data warehouse, we propose in this paper a data warehouse process model to capture the dynamics of a data warehouse. The evolution of a data warehouse is represented as a special process and the evolution operators are linked to the corresponding architecture components and quality factors they affect. We show the application of our model on schema evolution in data warehouses and its consequences on data warehouse views. The models have been implemented in the metadata repository ConceptBase which can be used to analyze the result of evolution operations and to monitor the quality of a data warehouse.

[1]  Zohra Bellahsene Structural view maintenance in data warehousing systems , 1998, BDA.

[2]  Gang Zhou,et al.  A framework for supporting data integration using the materialized and virtual approaches , 1996, SIGMOD '96.

[3]  Elke A. Rundensteiner,et al.  On Preserving Views in Evolving Environments , 1997, KRDB.

[4]  Kenneth A. Ross,et al.  Adapting materialized views after redefinitions , 1995, SIGMOD '95.

[5]  Panos Vassiliadis,et al.  Towards Quality-oriented Data Warehouse Usage and Evolution , 2000, Inf. Syst..

[6]  Manfred A. Jeusfeld,et al.  View maintenance and change notification for application program views , 1998, SAC '98.

[7]  Matthias Jarke,et al.  Design and Analysis of Quality Information for Data Warehouses , 1998, ER.

[8]  Surajit Chaudhuri,et al.  Maintenance of Materialized Views: Problems, Techniques, and Applications. , 1995 .

[9]  Giri Kumar Tayi,et al.  Examining data quality , 1998, CACM.

[10]  Matthias Jarke,et al.  Architecture and Quality in Data Warehouses: An Extended Repository Approach , 1999, Information Systems.

[11]  Inderpal Singh Mumick,et al.  The Stanford Data Warehousing Project , 1995 .

[12]  Matthias Jarke,et al.  Improving OLTP data quality using data warehouse mechanisms , 1999, SIGMOD '99.

[13]  Alberto O. Mendelzon,et al.  Maintaining data cubes under dimension updates , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[14]  Victor R. Basili,et al.  Representing Software Engineering Models: The TAME Goal Oriented Approach , 1992, IEEE Trans. Software Eng..

[15]  W. Alex Gray,et al.  Organising Knowledge of a Federated Database System to Support Multiple View Generation , 1998, KRDB.

[16]  Elke A. Rundensteiner,et al.  Optimizing Performance of Schema Evolution Sequences , 2000, Objects and Databases.