Synchronization of Queries and Views Upon Schema Evolutions

One of the problems arising upon the evolution of a database schema is that some queries and views defined on the previous schema version might no longer work properly. Thus, evolving a database schema entails the redefinition of queries and views to adapt them to the new schema. Although this problem has been mainly raised in the context of traditional information systems, solutions to it are also advocated in other database-related areas, such as Data Integration, Web Data Integration, and Data Warehouses. The problem is a critical one, since industrial organizations often need to adapt their databases and data warehouses to frequent changes in the real world. In this article, we provide a survey of existing approaches and tools to the problem of adapting queries and views upon a database schema evolution; we also propose a classification framework to enable a uniform comparison method among many heterogeneous approaches and tools.

[1]  Carlo Curino,et al.  Managing and querying transaction-time databases under schema evolution , 2008, Proc. VLDB Endow..

[2]  Michael Weiss,et al.  Modeling the Mashup Ecosystem: Structure and Growth , 2009 .

[3]  George Papastefanatos,et al.  Adaptive Query Formulation to Handle Database Evolution , 2006, CAiSE Forum.

[4]  Michael Boyd,et al.  AutoMed: A BAV Data Integration System for Heterogeneous Data Sources , 2004, CAiSE.

[5]  John F. Roddick,et al.  In Support of Mesodata in Database Management Systems , 2004, DEXA.

[6]  Erhard Rahm,et al.  Rondo: a programming platform for generic model management , 2003, SIGMOD '03.

[7]  George Papastefanatos,et al.  HECATAEUS: Regulating schema evolution , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[8]  Ben Shneiderman,et al.  Automatic database system conversion: schema revision, data translation, and source-to-source program transformation , 1982, AFIPS '82.

[9]  Raymond Reiter,et al.  A Logic for Default Reasoning , 1987, Artif. Intell..

[10]  Carlo Curino,et al.  Automating the database schema evolution process , 2012, The VLDB Journal.

[11]  Zohra Bellahsene Schema Evolution in Data Warehouses , 2002, Knowledge and Information Systems.

[12]  Wenfei Fan,et al.  Conditional Functional Dependencies for Data Cleaning , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[13]  Torben Bach Pedersen,et al.  Integrating XML data in the TARGIT OLAP system , 2004, Proceedings. 20th International Conference on Data Engineering.

[14]  Alin Deutsch,et al.  MARS: A System for Publishing XML from Mixed and Redundant Storage , 2003, VLDB.

[15]  Laks V. S. Lakshmanan,et al.  On implementing SchemaLog—a database programming language , 1996, CIKM '96.

[16]  Philip A. Bernstein,et al.  Generic Model Management: A Database Infrastructure for Schema Manipulation , 2001, CoopIS.

[17]  Elke A. Rundensteiner,et al.  Using containment information for view evolution in dynamic distributed environments , 1998, Proceedings Ninth International Workshop on Database and Expert Systems Applications (Cat. No.98EX130).

[18]  Alexandra Poulovassilis,et al.  Using AutoMed metadata in data warehousing environments , 2003, DOLAP '03.

[19]  A. Maule,et al.  Impact analysis of database schema changes , 2008, 2008 ACM/IEEE 30th International Conference on Software Engineering.

[20]  Frédéric Cuppens,et al.  Cooperative Answering: A Methodology to Provide Intelligent Access to databases , 1988, Expert Database Conf..

[21]  Renée J. Miller,et al.  Mapping Adaptation under Evolving Schemas , 2003, VLDB.

[22]  Jean-Luc Hainaut,et al.  Database application evolution: A transformational approach , 2006, Data Knowl. Eng..

[23]  Jayant Madhavan,et al.  OpenII: an open source information integration toolkit , 2010, SIGMOD Conference.

[24]  Genny Tortora,et al.  A visual language‐based system for extraction–transformation–loading development , 2014, Softw. Pract. Exp..

[25]  Michael H. Böhlen,et al.  Versioned Relations: Support for Conditional Schema Changes and Schema Versioning , 2007, DASFAA.

[26]  Philip A. Bernstein,et al.  A vision for management of complex models , 2000, SGMD.

[27]  Elke A. Rundensteiner,et al.  A history-driven approach at evolving views under meta data changes , 2005, Knowledge and Information Systems.

[28]  Gaetano Aurelio Lanzarone,et al.  Metareasoning agents for query-answering systems , 1997 .

[29]  Philip A. Bernstein,et al.  Model management 2.0: manipulating richer mappings , 2007, SIGMOD '07.

[30]  Stuart E. Madnick,et al.  Data Integration Using Web Services , 2002, EEXTT.

[31]  John F. Roddick,et al.  Facilitating Database Attribute Domain Evolution Using Mesodata , 2004, ER.

[32]  Alexandra Poulovassilis,et al.  Schema Evolution in Heterogeneous Database Architectures, A Schema Transformation Approach , 2002, CAiSE.

[33]  Subbarao Kambhampati,et al.  Mining approximate functional dependencies and concept similarities to answer imprecise queries , 2004, WebDB '04.

[34]  Elke A. Rundensteiner,et al.  On Preserving Views in Evolving Environments , 1997, KRDB.

[35]  Boris Motik,et al.  A mapping system for the integration of OWL-DL ontologies , 2005, IHIS '05.

[36]  Sergey Melnik,et al.  Generic Model Management , 2004, Lecture Notes in Computer Science.

[37]  Barbara Lerner,et al.  A model for compound type changes encountered in schema evolution , 2000, TODS.

[38]  Renée J. Miller,et al.  The Use of Information Capacity in Schema Integration and Translation , 1993, VLDB.

[39]  John Mylopoulos,et al.  ToMAS: a system for adapting mappings while schemas evolve , 2004, Proceedings. 20th International Conference on Data Engineering.

[40]  Serge Abiteboul,et al.  Foundations of Databases: The Logical Level , 1995 .

[41]  Elke A. Rundensteiner,et al.  Data warehouses evolution: trade-offs between quality and cost of query rewritings , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[42]  Alexandra Poulovassilis,et al.  Data integration by bi-directional schema transformation rules , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[43]  Alexandra Poulovassilis,et al.  A Formalisation of Semantic Schema Integration , 1998, Inf. Syst..

[44]  Carlo Curino,et al.  Graceful database schema evolution: the PRISM workbench , 2008, Proc. VLDB Endow..

[45]  Alexandra Poulovassilis,et al.  A Semantic Approach to Integrating XML and Structured Data Sources , 2001, CAiSE.

[46]  Elke A. Rundensteiner,et al.  The CVS Algorithm for View Synchronization in Evolvable Large-Scale Information Systems , 1998, EDBT.

[47]  Fabio Grandi,et al.  A formal model for temporal schema versioning in object-oriented databases , 2003, Data Knowl. Eng..

[48]  Giuseppe Polese,et al.  Visual data integration based on description logic reasoning , 2014, IDEAS.

[49]  Sergey Melnik,et al.  Generic Model Management: Concepts And Algorithms (Lecture Notes in Computer Science) , 2004 .

[50]  Phokion G. Kolaitis,et al.  Structural characterizations of schema-mapping languages , 2009, ICDT '09.

[51]  Torben Bach Pedersen,et al.  Synchronizing XPath views , 2004, Proceedings. International Database Engineering and Applications Symposium, 2004. IDEAS '04..

[52]  Ana Carolina Salgado,et al.  Managing the Evolution of Mediation Queries , 2003, OTM.

[53]  Enrico Franconi,et al.  Schema Evolution and Versioning: A Logical and Computational Characterisation , 2000, FMLDO.

[54]  Mark A. Musen,et al.  Promptdiff: a fixed-point algorithm for comparing ontology versions , 2002, AAAI/IAAI.

[55]  Richard Hull,et al.  Relative information capacity of simple relational database schemata , 1984, SIAM J. Comput..

[56]  George Papastefanatos,et al.  Hecataeus: A What-If Analysis Tool for Database Schema Evolution , 2008, 2008 12th European Conference on Software Maintenance and Reengineering.

[57]  Laura M. Haas,et al.  The Clio project: managing heterogeneity , 2001, SGMD.

[58]  Carlo Curino,et al.  Automating database schema evolution in information system upgrades , 2009, HotSWUp '09.

[59]  Alexandra Poulovassilis,et al.  A General Formal Framework for Schema Transformation , 1998, Data Knowl. Eng..

[60]  Alexandra Poulovassilis,et al.  A Uniform Approach to Inter-model Transformations , 1999, CAiSE.

[61]  Fabio Grandi,et al.  Schema Versioning for Multitemporal Relational Databases , 1997, Inf. Syst..

[62]  Fabio Casati,et al.  Understanding Mashup Development , 2008, IEEE Internet Computing.

[63]  Jayant Madhavan,et al.  Web-Scale Data Integration: You can afford to Pay as You Go , 2007, CIDR.

[64]  Carlo Curino,et al.  The PRISM Workwench: Database Schema Evolution without Tears , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[65]  Alexandra Poulovassilis,et al.  Automatic Migration and Wrapping of Database Applications - A Schema Transformation Approach , 1999, ER.

[66]  Giuseppe Polese,et al.  Relaxed Functional Dependencies—A Survey of Approaches , 2016, IEEE Transactions on Knowledge and Data Engineering.

[67]  Carlo Curino,et al.  Schema Evolution in Wikipedia - Toward a Web Information System Benchmark , 2008, ICEIS.

[68]  Akaichi Jalel E-learning data warehouse maintenance system for collaborative learning availability resources optimization , 2007 .

[69]  Carlo Curino,et al.  Managing the History of Metadata in Support for DB Archiving and Schema Evolution , 2008, ER Workshops.

[70]  Joost Visser Coupled Transformation of Schemas, Documents, Queries, and Constraints , 2008, Electron. Notes Theor. Comput. Sci..

[71]  Elke A. Rundensteiner,et al.  History-Driven View Synchronization , 2000, DaWaK.

[72]  Renée J. Miller,et al.  Adapting Mappings in Frequently Changing Environments , 2007 .

[73]  Giuseppe Polese,et al.  A Normalization Framework for Multimedia Databases , 2007, IEEE Transactions on Knowledge and Data Engineering.

[74]  Jay Banerjee,et al.  Semantics and implementation of schema evolution in object-oriented databases , 1987, SIGMOD '87.

[75]  Xue Li A survey of schema evolution in object-oriented databases , 1999, Proceedings Technology of Object-Oriented Languages and Systems (Cat. No.PR00393).

[76]  Ana Carolina Salgado,et al.  Evolution of XML-Based Mediation Queries in a Data Integration System , 2004, ER.

[77]  John F. Roddick,et al.  A Taxonomy for Schema Versioning Based on the Relational and Entity Relationship Models , 1993, ER.

[78]  Elke A. Rundensteiner,et al.  Evolvable view environment (EVE): non-equivalent view maintenance under schema changes , 1999, SIGMOD '99.

[79]  M. Tamer Özsu,et al.  An axiomatic model of dynamic schema evolution in objectbase systems , 1997, TODS.

[80]  Philip A. Bernstein,et al.  Applying Model Management to Classical Meta Data Problems , 2003, CIDR.

[81]  Erhard Rahm,et al.  Data Warehouse Scenarios for Model Management , 2000, ER.

[82]  Carlo Curino,et al.  How Clean Is Your Sandbox? - Towards a Unified Theoretical Framework for Incremental Bidirectional Transformations , 2012, ICMT@TOOLS.

[83]  Alon Y. Halevy,et al.  Piazza: data management infrastructure for semantic web applications , 2003, WWW '03.

[84]  Ralf Lämmel,et al.  Bidirectional Transformations: A Cross-Discipline Perspective , 2009, ICMT@TOOLS.

[85]  Joost Visser,et al.  Strongly Typed Rewriting For Coupled Software Transformation , 2007, Electron. Notes Theor. Comput. Sci..

[86]  Ronald Fagin,et al.  Composing schema mappings: second-order dependencies to the rescue , 2004, PODS '04.

[87]  Ronald Fagin,et al.  Inverting schema mappings , 2006, TODS.

[88]  George Papastefanatos,et al.  What-If Analysis for Data Warehouse Evolution , 2007, DaWaK.

[89]  Ben Shneiderman,et al.  An architecture for automatic relational database sytem conversion , 1982, TODS.

[90]  Renée J. Miller,et al.  Preserving mapping consistency under schema changes , 2004, The VLDB Journal.

[91]  Fabrizio Ferrandina,et al.  Implementing Lazy Database Updates for an Object Database System , 1994, VLDB.

[92]  Gerhard Weikum,et al.  ACM Transactions on Database Systems , 2005 .

[93]  Giuseppe Polese,et al.  A dialogue-based model for the query synchronization problem , 2009, 2009 IEEE 5th International Conference on Intelligent Computer Communication and Processing.

[94]  Erhard Rahm,et al.  Developing metadata-intensive applications with Rondo , 2003, J. Web Semant..

[95]  Harith Alani,et al.  Changing Ontology Breaks Queries , 2006, SEMWEB.

[96]  Jaakko Hintikka,et al.  What If--: Toward Excellence in Reasoning , 1991 .

[97]  Edgar Jasper Global Query Processing in the AutoMed Heterogeneous Database Environment , 2002, BNCOD.

[98]  Sergey Melnik,et al.  Model Management: First Steps and Beyond , 2005, BTW.

[99]  Elke A. Rundensteiner,et al.  The EVE Approach: View Synchronization in Dynamic Distributed Environments , 2002, IEEE Trans. Knowl. Data Eng..

[100]  Elke A. Rundensteiner,et al.  Integrating the rewriting and ranking phases of view synchronization , 1998, DOLAP '98.

[101]  LAKS V.S. LAKSHMANAN,et al.  Logic and Algebraic Languages for Interoperability in Multidatabase Systems , 1997, J. Log. Program..

[102]  Torben Bach Pedersen,et al.  Schema Evolution for Stars and Snowflakes , 2004, ICEIS.

[103]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[104]  Lipyeow Lim,et al.  Preserving XML queries during schema evolution , 2007, WWW '07.

[105]  Michael H. Böhlen,et al.  Current, Legacy, and Invalid Tuples in Conditionally Evolving Databases , 2002, ADVIS.

[106]  Andreas Thor,et al.  Data Integration Support for Mashups , 2007 .

[107]  Carlo Curino,et al.  Update Rewriting and Integrity Constraint Maintenance in a Schema Evolution Support System: PRISM++ , 2010, Proc. VLDB Endow..

[108]  Elke A. Rundensteiner,et al.  Non-Equivalent Query Rewritings , 1999 .

[109]  Laks V. S. Lakshmanan,et al.  On the Logical Foundations of Schema Integration and Evolution in Heterogeneous Database Systems , 1993, DOOD.

[110]  Elke A. Rundensteiner,et al.  Maintaining data warehouses over changing information sources , 2000, CACM.

[111]  Andreas Thor,et al.  iFuice - Information Fusion utilizing Instance Correspondences and Peer Mappings , 2005, WebDB.

[112]  Laks V. S. Lakshmanan,et al.  Languages for multi-database interoperability , 1997, SIGMOD '97.

[113]  John F. Roddick,et al.  The case for mesodata: An empirical investigation of an evolving database system , 2007, Inf. Softw. Technol..

[114]  Torben Bach Pedersen,et al.  Multidimensional Database Technology , 2001, Computer.

[115]  S. Ram,et al.  Research Issues in Database Schema Evolution: the Road Not Taken , 2003 .

[116]  Joost Visser,et al.  Transformation of structure-shy programs: applied to XPath queries and strategic functions , 2007, PEPM '07.

[117]  Giuseppe Polese,et al.  Notes on View Synchronization using Default Logic , 2009, SEBD.

[118]  Gregor Engels,et al.  Theory and Practice of Model Transformations , 2016, Lecture Notes in Computer Science.

[119]  Roberto Zicari,et al.  A framework for schema updates in an object-oriented database system , 1991, [1991] Proceedings. Seventh International Conference on Data Engineering.

[120]  Phokion G. Kolaitis Schema mappings, data exchange, and metadata management , 2005, PODS '05.

[121]  Enrico Franconi,et al.  A Semantic Approach for Schema Evolution and Versioning in Object-Oriented Databases , 2000, Computational Logic.

[122]  George Papastefanatos,et al.  Policy-Regulated Management of ETL Evolution , 2009, J. Data Semant..

[123]  Zoubida Kedad,et al.  Discovering view expressions from a multi-source information system , 1999, Proceedings Fourth IFCIS International Conference on Cooperative Information Systems. CoopIS 99 (Cat. No.PR00384).

[124]  Elisa Bertino,et al.  A View Mechanism for Object-Oriented Databases , 1992, EDBT.