Rule-Based Management of Schema Changes at ETL Sources

In this paper, we visit the problem of the management of inconsistencies emerging on ETL processes as results of evolution operations occurring at their sources. We abstract Extract-Transform-Load (ETL) activities as queries and sequences of views. ETL activities and its sources are uniformly modeled as a graph that is annotated with rules for the management of evolution events. Given a change at an element of the graph, our framework detects the parts of the graph that are affected by this change and highlights the way they are tuned to respond to it. We then present the system architecture of a tool called Hecataeus that implements the main concepts of the proposed framework.

[1]  Zoubida Kedad,et al.  A Logical Model for Data Warehouse Design and Evolution , 2000, DaWaK.

[2]  Robert Wrembel,et al.  Managing Evolution of Data Warehouses by Means of Nested Transactions , 2006, ADVIS.

[3]  Carlo Curino,et al.  Managing and querying transaction-time databases under schema evolution , 2008, Proc. VLDB Endow..

[4]  George Papastefanatos,et al.  What-If Analysis for Data Warehouse Evolution , 2007, DaWaK.

[5]  Elke A. Rundensteiner,et al.  The CVS Algorithm for View Synchronization in Evolvable Large-Scale Information Systems , 1998, EDBT.

[6]  Robert Wrembel,et al.  Metadata Management in a Multiversion Data Warehouse , 2005, OTM Conferences.

[7]  Renée J. Miller,et al.  Preserving mapping consistency under schema changes , 2004, The VLDB Journal.

[8]  Γιώργος Παπαστεφανάτος Policy Regulated Management Of Schema Evolution In Database-centric Environments , 2009 .

[9]  George Papastefanatos,et al.  Policy-Regulated Management of ETL Evolution , 2009, J. Data Semant..

[10]  Matteo Golfarelli,et al.  A Survey on Temporal Data Warehousing , 2009, Int. J. Data Warehous. Min..

[11]  Carsten Sapia,et al.  On Schema Evolution in Multidimensional Databases , 1999, DaWaK.

[12]  Mukesh K. Mohania,et al.  Algorithms for Adapting Materialised Views in Data Warehouses , 1996, CODAS.

[13]  Kenneth A. Ross,et al.  Adapting materialized views after redefinitions: techniques and a performance study , 2001, Inf. Syst..

[14]  Zohra Bellahsene Schema Evolution in Data Warehouses , 2002, Knowledge and Information Systems.

[15]  Torben Bach Pedersen,et al.  Schema Evolution for Stars and Snowflakes , 2004, ICEIS.

[16]  Gottfried Vossen,et al.  Schema Versioning in Data Warehouses , 2004, ER.

[17]  Isidro Ramos,et al.  Advances in Database Technology — EDBT'98 , 1998, Lecture Notes in Computer Science.