Issues in Developing Very Large Data Warehouses

The size of The Boeing Company posts some stringent requirements on data warehouse design and implementation. We summarize four interesting and challenging issues in developing very large scale data warehouses, namely failure recovery, incremental update maintenance, cost model for schema design and query optimization, and metadata definition and management. For each issue, we give the reasons we think it is important but not well-addressed in research literature and commercial products, and our current research to solve it.

[1]  Jennifer Widom,et al.  View maintenance in a warehousing environment , 1995, SIGMOD '95.

[2]  Jennifer Widom,et al.  Making views self-maintainable for data warehousing , 1996, Fourth International Conference on Parallel and Distributed Information Systems.

[3]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[4]  Yue Zhuge,et al.  The Strobe algorithms for multi-source warehouse consistency , 1996, Fourth International Conference on Parallel and Distributed Information Systems.

[5]  Jennifer Widom,et al.  On-line warehouse view maintenance , 1997, SIGMOD '97.