A User-centric View of Data Warehouse Maintenance Issues

Data in warehouses need to be updated in a timely manner from underlying operational data sources. This is referred to as warehouse maintenance. Not all of the data in the warehouse has the same requirement in terms of staleness (how old can it be with respect to the actual data), or its inverse freshness, and consistency (combining data from autonomous sources may give rise to some inconsistency). Given the requirements and schema information of a data warehouse, identifying policies for change detection and warehouse maintenance is a complex task. In this paper we identify a problem with current specification of user requirements, and suggest a specification scheme that is more general and user-oriented than extant suggestions. We also survey various policies that have been proposed for data propagation and analyse how change detection capabilities of sources influence user, as well as system requirements.

[1]  Eric N. Hanson,et al.  A performance analysis of view materialization strategies , 1987, SIGMOD '87.

[2]  Jennifer Widom,et al.  On-line warehouse view maintenance , 1997, SIGMOD '97.

[3]  Gang Zhou,et al.  A framework for supporting data integration using the materialized and virtual approaches , 1996, SIGMOD '96.

[4]  Jennifer Widom,et al.  Maintenance of Materialized Views: Problems, Techniques, and Applications , 1999, IEEE Data Eng. Bull..

[5]  Jennifer Widom,et al.  Maintaining Temporal Views over Non-Temporal Information Sources for Data Warehousing , 1998, EDBT.

[6]  Inderpal Singh Mumick,et al.  The Stanford Data Warehousing Project , 1995 .

[7]  Peter C. Lockemann,et al.  Distributed Events in Active Database Systems: Letting the Genie out of the Bottle , 1998, Data Knowl. Eng..

[8]  Surajit Chaudhuri,et al.  An overview of data warehousing and OLAP technology , 1997, SGMD.

[9]  Yue Zhuge,et al.  Incremental maintenance of consistent data warehouses , 1999 .

[10]  Scarlet Schwiderski,et al.  Monitoring the behaviour of distributed systems , 1996 .

[11]  Jeffrey D. Ullman,et al.  Implementing data cubes efficiently , 1996, SIGMOD '96.

[12]  Abraham Silberschatz,et al.  View maintenance issues for the chronicle data model (extended abstract) , 1995, PODS.

[13]  Sharma Chakravarthy,et al.  Formal semantics of composite events for distributed environments , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[14]  Minsoo Lee,et al.  Speeding up Warehouse Physical Design Using a Randomized Algorithm , 1999, DMDW.

[15]  Jaideep Srivastava,et al.  Analytical modeling of materialized view maintenance , 1988, PODS '88.

[16]  Umeshwar Dayal,et al.  On the Updatability of Relational Views , 1978, VLDB.

[17]  Arie Segev,et al.  Currency-based updates to distributed materialized views , 1990, [1990] Proceedings. Sixth International Conference on Data Engineering.

[18]  Jennifer Widom,et al.  Research problems in data warehousing , 1995, CIKM '95.

[19]  Alejandro P. Buchmann,et al.  Research Issues in Data Warehousing , 1997, BTW.

[20]  Klaus R. Dittrich,et al.  The active database management system manifesto: a rulebase of ADBMS features , 1995, SGMD.

[21]  Gang Zhou,et al.  Towards the Study of Performance Trade-offs Between Materialized and Virtual Integrated Views , 1996, VIEWS.

[22]  Ambuj K. Singh,et al.  Efficient view maintenance at data warehouses , 1997, SIGMOD '97.

[23]  Latha S. Colby,et al.  Algorithms for deferred view maintenance , 1996, SIGMOD '96.

[24]  Arie Segev,et al.  Optimal update policies for distributed materialized views , 1991 .

[25]  Yue Zhuge,et al.  The Strobe algorithms for multi-source warehouse consistency , 1996, Fourth International Conference on Parallel and Distributed Information Systems.

[26]  Inderpal Singh Mumick,et al.  Selection of Views to Materialize Under a Maintenance Cost Constraint , 1999, ICDT.

[27]  Nick Roussopoulos,et al.  Principles and Techniques in the Design of ADMS± , 1986, Computer.