Graceful database schema evolution: the PRISM workbench

Supporting graceful schema evolution represents an unsolved problem for traditional information systems that is further exacerbated in web information systems, such as Wikipedia and public scientific databases: in these projects based on multiparty cooperation the frequency of database schema changes has increased while tolerance for downtimes has nearly disappeared. As of today, schema evolution remains an error-prone and time-consuming undertaking, because the DB Administrator (DBA) lacks the methods and tools needed to manage and automate this endeavor by (i) predicting and evaluating the effects of the proposed schema changes, (ii) rewriting queries and applications to operate on the new schema, and (iii) migrating the database. Our PRISM system takes a big first step toward addressing this pressing need by providing: (i) a language of Schema Modification Operators to express concisely complex schema changes, (ii) tools that allow the DBA to evaluate the effects of such changes, (iii) optimized translation of old queries to work on the new schema version, (iv) automatic data migration, and (v) full documentation of intervened changes as needed to support data provenance, database flash back, and historical queries. PRISM solves these problems by integrating recent theoretical advances on mapping composition and invertibility, into a design that also achieves usability and scalability. Wikipedia and its 170+ schema versions provided an invaluable testbed for validating PRISM tools and their ability to support legacy queries.

[1]  Alin Deutsch,et al.  MARS: A System for Publishing XML from Mixed and Redundant Storage , 2003, VLDB.

[2]  Carlo Curino,et al.  Managing and querying transaction-time databases under schema evolution , 2008, Proc. VLDB Endow..

[3]  Anthony Cleve,et al.  Co-transformations in Database Applications Evolution , 2005, GTTSE.

[4]  Denilson Barbosa,et al.  Designing Information-Preserving Mapping Schemes for XML , 2005, VLDB.

[5]  H. V. Jagadish,et al.  Scalable versioning in distributed databases with commuting updates , 1997, Proceedings 13th International Conference on Data Engineering.

[6]  Jagadish Chatarji Pulakhandam ODP.NET Developer's Guide: Oracle Database 10g Development with Visual Studio 2005 and the Oracle Data Provider for .NET A practical guide for developers ... Developer Tools for Visual Studio 2005 , 2007 .

[7]  John F. Roddick,et al.  A survey of schema versioning issues for database systems , 1995, Inf. Softw. Technol..

[8]  Erhard Rahm,et al.  Rondo: a programming platform for generic model management , 2003, SIGMOD '03.

[9]  Matteo Golfarelli,et al.  X-Time: Schema Versioning and Cross-Version Querying in Data Warehouses , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[10]  Renée J. Miller,et al.  Mapping Adaptation under Evolving Schemas , 2003, VLDB.

[11]  Jean-Luc Hainaut,et al.  Database application evolution: A transformational approach , 2006, Data Knowl. Eng..

[12]  Ronald Fagin,et al.  Quasi-inverses of schema mappings , 2007, PODS '07.

[13]  Ronald Fagin,et al.  Composing schema mappings: second-order dependencies to the rescue , 2004, PODS '04.

[14]  Ronald Fagin,et al.  Inverting schema mappings , 2006, TODS.

[15]  Jeffrey D. Ullman,et al.  Principles of Database Systems , 1980 .

[16]  Philip A. Bernstein,et al.  Composition of mappings given by embedded dependencies , 2005, PODS '05.

[17]  Jayant Madhavan,et al.  Composing Mappings Among Data Sources , 2003, VLDB.

[18]  Philip A. Bernstein,et al.  Implementing mapping composition , 2007, The VLDB Journal.

[19]  Young-Gook Ra Relational Schema Evolution for Program Independency , 2004, CIT.

[20]  Alin Deutsch,et al.  Optimization Properties for Classes of Conjunctive Regular Path Queries , 2001, DBPL.

[21]  Rachel Pottinger,et al.  Schema Repository for Database Schema Evolution , 2006, 17th International Workshop on Database and Expert Systems Applications (DEXA'06).

[22]  Carlo Curino,et al.  Managing the History of Metadata in Support for DB Archiving and Schema Evolution , 2008, ER Workshops.

[23]  Philip A. Bernstein,et al.  Applying Model Management to Classical Meta Data Problems , 2003, CIDR.

[24]  Erhard Rahm,et al.  Data Warehouse Scenarios for Model Management , 2000, ER.

[25]  Gottfried Vossen,et al.  Schema Versioning in Data Warehouses , 2004, ER.

[26]  Cong Yu,et al.  Semantic Adaptation of Schema Mappings when Schemas Evolve , 2005, VLDB.

[27]  Lipyeow Lim,et al.  Preserving XML queries during schema evolution , 2007, WWW '07.

[28]  Álvaro F. Moreira,et al.  Temporal and versioning model for schema evolution in object-oriented databases , 2005, Data Knowl. Eng..

[29]  Carlo Curino,et al.  Schema Evolution in Wikipedia - Toward a Web Information System Benchmark , 2008, ICEIS.