Schema Evolution for Databases and Data Warehouses

Like all software systems, databases are subject to evolution as time passes. The impact of this evolution is tremendous as every change to the schema of a database affects the syntactic correctness and the semantic validity of all the surrounding applications and de facto necessitates their maintenance in order to remove errors from their source code. This survey provides a walk-through on different approaches to the problem of handling database and data warehouse schema evolution. The areas covered include (a) published case studies with statistical information on database evolution, (b) techniques for managing schema and view evolution, (c) techniques pertaining to the area of data warehouses, and, (d) prospects for future research.

[1]  Gottfried Vossen,et al.  Schema versioning in data warehouses: Enabling cross-version querying via schema augmentation , 2006, Data Knowl. Eng..

[2]  Carlo Curino,et al.  Graceful database schema evolution: the PRISM workbench , 2008, Proc. VLDB Endow..

[3]  John F. Roddick,et al.  A survey of schema versioning issues for database systems , 1995, Inf. Softw. Technol..

[4]  George Papastefanatos,et al.  Language Extensions for the Automation of Database Schema Evolution , 2008, ICEIS.

[5]  Apostolos V. Zarras,et al.  Open-Source Databases: Within, Outside, or Beyond Lehman's Laws of Software Evolution? , 2014, CAiSE.

[6]  Erhard Rahm,et al.  Recent Advances in Schema and Ontology Evolution , 2011, Schema Matching and Mapping.

[7]  Christoph Quix,et al.  Repository Support for Data Warehouse Evolution , 1999, DMDW.

[8]  Apostolos V. Zarras,et al.  Growing up with stability: How open-source relational databases evolve , 2015, Inf. Syst..

[9]  George Papastefanatos,et al.  HECATAEUS: Regulating schema evolution , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[10]  Dewayne E. Perry,et al.  Metrics and laws of software evolution-the nineties view , 1997, Proceedings Fourth International Software Metrics Symposium.

[11]  Carlo Curino,et al.  Update Rewriting and Integrity Constraint Maintenance in a Schema Evolution Support System: PRISM++ , 2010, Proc. VLDB Endow..

[12]  Johann Eder,et al.  Analysing Slices of Data Warehouses to Detect Structural Modifications , 2004, CAiSE.

[13]  Carsten Sapia,et al.  On Schema Evolution in Multidimensional Databases , 1999, DaWaK.

[14]  Robert Wrembel A Survey of Managing the Evolution of Data Warehouses , 2009, Int. J. Data Warehous. Min..

[15]  Jesús M. González-Barahona,et al.  The evolution of the laws of software evolution , 2013, ACM Comput. Surv..

[16]  George Papastefanatos,et al.  Policy-Regulated Management of ETL Evolution , 2009, J. Data Semant..

[17]  Elke A. Rundensteiner,et al.  The CVS Algorithm for View Synchronization in Evolvable Large-Scale Information Systems , 1998, EDBT.

[18]  Alberto O. Mendelzon,et al.  Updating OLAP dimensions , 1999, DOLAP '99.

[19]  Kenneth A. Ross,et al.  Adapting materialized views after redefinitions: techniques and a performance study , 2001, Inf. Syst..

[20]  Johann Eder,et al.  Automatic Detection of Structural Changes in Data Warehouses , 2003, DaWaK.

[21]  A. Maule,et al.  Impact analysis of database schema changes , 2008, 2008 ACM/IEEE 30th International Conference on Software Engineering.

[22]  Carlo Curino,et al.  Schema Evolution in Wikipedia - Toward a Web Information System Benchmark , 2008, ICEIS.

[23]  Iulian Neamtiu,et al.  Collateral evolution of applications and databases , 2009, IWPSE-Evol '09.

[24]  Matteo Golfarelli,et al.  A Survey on Temporal Data Warehousing , 2009, Int. J. Data Warehous. Min..

[25]  Robert Wrembel,et al.  Metadata Management in a Multiversion Data Warehouse , 2005, OTM Conferences.

[26]  George Papastefanatos,et al.  Impact Analysis and Policy-Conforming Rewriting of Evolving Data-Intensive Ecosystems , 2015, Journal on Data Semantics.

[27]  Zohra Bellahsene Schema Evolution in Data Warehouses , 2002, Knowledge and Information Systems.

[28]  Torben Bach Pedersen,et al.  Schema Evolution for Stars and Snowflakes , 2004, ICEIS.

[29]  Alberto O. Mendelzon,et al.  Maintaining data cubes under dimension updates , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[30]  Zohra Bellahsene View Adaptation in Data Warehousing Systems , 1998, DEXA.

[31]  D. Sjøberg,et al.  Quantifying schema evolution , 1993, Inf. Softw. Technol..

[32]  Salima Benbernou,et al.  A survey on service quality description , 2013, CSUR.

[33]  George Papastefanatos,et al.  Design Metrics for Data Warehouse Evolution , 2008, ER.

[34]  Carlo Curino,et al.  Automating the database schema evolution process , 2012, The VLDB Journal.

[35]  George Papastefanatos,et al.  Automating the Adaptation of Evolving Data-Intensive Ecosystems , 2013, ER.

[36]  Philip A. Bernstein,et al.  Worry-free database upgrades: automated model-driven evolution of schemas and complex mappings , 2010, SIGMOD Conference.

[37]  Meir M. Lehman,et al.  Rules and Tools for Software Evolution Planning and Management , 2001, Ann. Softw. Eng..

[38]  Meir M. Lehman,et al.  A Model of Large Program Development , 1976, IBM Syst. J..

[39]  Dong Qiu,et al.  An empirical analysis of the co-evolution of schema and code in database applications , 2013, ESEC/FSE 2013.

[40]  Iulian Neamtiu,et al.  Schema evolution analysis for embedded databases , 2011, 2011 IEEE 27th International Conference on Data Engineering Workshops.

[41]  George Papastefanatos,et al.  Metrics for the Prediction of Evolution Impact in ETL Ecosystems: A Case Study , 2012, Journal on Data Semantics.

[42]  Apostolos V. Zarras,et al.  How is Life for a Table in an Evolving Relational Schema? Birth, Death and Everything in Between , 2015, ER.

[43]  Johann Eder,et al.  Changes of Dimension Data in Temporal Data Warehouses , 2001, DaWaK.