Schema Evolution for Stars and Snowflakes

The most common implementation platform for multidimensional data warehouses is RDBMSs storing data in relational star and snowflake schemas. DW schemas evolve over time, which may invalidate existing analysis queries used for reporting purposes. However, the evolution properties of star and snowflake schemas have not previously been investigated systematically. This paper systematically investigates the evolution properties of star and snowflake schemas. Eight evolution operations are considered, covering insertion and deletion of dimensions, levels, dimension attributes, and measure attributes. For each operation, the formal semantics of the changes for star and snowflake schemas are given, and instance adaption and impact on existing queries are described. Finally, we compare the evolution properties of star and snowflake schemas, concluding that the star schema is considerably more robust towards schema changes than the snowflake schema.

[1]  Peter Gluchowski,et al.  Data Warehouse , 1997, Informatik-Spektrum.

[2]  Alberto O. Mendelzon,et al.  Temporal Queries in OLAP , 2000, VLDB.

[3]  Carsten Sapia,et al.  On Schema Evolution in Multidimensional Databases , 1999, DaWaK.

[4]  Christian S. Jensen,et al.  A foundation for capturing and querying complex multidimensional data , 2001, Inf. Syst..

[5]  Mark Levene,et al.  Why is the snowflake schema a good data warehouse design? , 2003, Inf. Syst..

[6]  Zoubida Kedad,et al.  A Logical Model for Data Warehouse Design and Evolution , 2000, DaWaK.

[7]  John F. Roddick,et al.  Schema evolution in database systems: an annotated bibliography , 1992, SGMD.

[8]  Panos Vassiliadis,et al.  Gulliver in the land of data warehousing: practical experiences and observations of a researcher , 2000, DMDW.

[9]  Markus Blaschka,et al.  FIESTA: A Framework for Schema Evolution in Multidimensional Databases (Abstract) , 2000, Datenbank Rundbr..

[10]  Yvan Bédard,et al.  A multidimensional and multiversion structure for OLAP applications , 2002, DOLAP '02.

[11]  Il-Yeol Song,et al.  Multidimensional Modeling with UML Package Diagrams , 2002, ER.

[12]  Barbara Dinter,et al.  Extending the E/R Model for the Multidimensional Paradigm , 1998, ER Workshops.

[13]  Johann Eder,et al.  Changes of Dimension Data in Temporal Data Warehouses , 2001, DaWaK.

[14]  Robert Wrembel,et al.  Modeling a Multiversion Data Warehouse: A Formal Approach , 2003, ICEIS.

[15]  Torben Bach Pedersen,et al.  Multidimensional Database Technology , 2001, Computer.

[16]  Panos Vassiliadis,et al.  Towards Quality-oriented Data Warehouse Usage and Evolution , 2000, Inf. Syst..

[17]  Gottfried Vossen,et al.  Consistency in data warehouse dimensions , 2002, Proceedings International Database Engineering and Applications Symposium.

[18]  Alberto O. Mendelzon,et al.  Updating OLAP dimensions , 1999, DOLAP '99.