Efficient Update of Data Warehouse Views with Generalised Referential Integrity Differential Files

Data warehouse (DW) views provide an efficient access to information integrated from source data. When changes are made to the source data, the corresponding views may be outdated. Thus, the maintenance of DW views is crucial for the currency of information. Recently, a method was proposed to use referential integrity differential files (RIDFs) to self-maintain DW views that contain select-project-joins over relations modelled in a star schema. However, it is not uncommon for applications to have relations that are modelled in other schemas such as a snowflake schema or a galaxy schema. In this paper, we generalise the concept of RIDFs; we propose a method that uses generalised RIDFs to self-maintain the DW views that contain joins over relations modelled in the star schema as well as non-star schemas. Our method computes new views by using only the old materialised views and files that keep the truly relevant tuples in the “delta”. Consequently, it avoids accessing the underlying source data, and hence leads to efficient update of DW views.

[1]  Wookey Lee,et al.  On the Independence of Data Warehouse from Databases in Maintaining Join Views , 1999, DaWaK.

[2]  Nick Roussopoulos,et al.  A case for dynamic view management , 2001, ACM Trans. Database Syst..

[3]  Gottfried Vossen,et al.  Monotonic complements for independent data warehouses , 2001, The VLDB Journal.

[4]  Jennifer Widom,et al.  Making views self-maintainable for data warehousing , 1996, Fourth International Conference on Parallel and Distributed Information Systems.

[5]  Gio Wiederhold,et al.  Incremental Recomputation of Active Relational Expressions , 1991, IEEE Trans. Knowl. Data Eng..

[6]  Leonid Libkin,et al.  An Improved Algorithm for the Incremental Recomputation of Active Relational Expressions , 1997, IEEE Trans. Knowl. Data Eng..

[7]  Dimitri Theodoratos,et al.  Constructing search spaces for materialized view selection , 2004, DOLAP '04.

[8]  Peter Brezany,et al.  On-line analytical processing on large databases managed by computational grids , 2004 .

[9]  Frank Wm. Tompa,et al.  Efficiently updating materialized views , 1986, SIGMOD '86.

[10]  W. H. Inmon,et al.  Building the data warehouse (2nd ed.) , 1996 .

[11]  Wookey Lee,et al.  Exploitation of Referential Integrity Constraints for Efficient Update of Data Warehouse Views , 2005, BNCOD.

[12]  Malcolm P. Atkinson Databases and the Grid: Who Challenges Whom? , 2003, BNCOD.

[13]  Peter L. Mott,et al.  LeedsCQ : A Scalable Continual Queries System , 2002, DEXA.

[14]  Henrik Engström,et al.  Evaluating Maintenance Policies for Externally Materialised Multi-source Views , 2003, BNCOD.

[15]  Inderpal Singh Mumick,et al.  Selection of Views to Materialize in a Data Warehouse , 2005, IEEE Trans. Knowl. Data Eng..

[16]  Mukesh K. Mohania,et al.  Making Aggregate Views Self-maintainable , 2000, Data Knowl. Eng..

[17]  Wenfei Fan,et al.  Keys with Upward Wildcards for XML , 2001, DEXA.

[18]  W. H. Inmon,et al.  Building the data warehouse , 1992 .

[19]  Jennifer Widom,et al.  View maintenance in a warehousing environment , 1995, SIGMOD '95.

[20]  David Nelson,et al.  Database: Enterprise, Skills and Innovation, 22nd British National Conference on Databases, BNCOD 22, Sunderland, UK, July 5-7, 2005, Proceedings , 2005, BNCOD.

[21]  A Min Tjoa,et al.  Managing Time Consistency for Active Data Warehouse Environments , 2001, DaWaK.

[22]  Nam Huyn,et al.  Multiple-View Self-Maintenance in Data Warehousing Environments , 1997, VLDB.