Data Warehouse Benchmarking with DWEB

Performance evaluation is a key issue for designers and users of Database Management Systems (DBMSs). Performance is generally assessed with software benchmarks that help, e.g., test architectural choices, compare different technologies or tune a system. In the particular context of data warehousing and On-Line Analytical Processing (OLAP), although the Transaction Processing Performance Council (TPC) aims at issuing standard decision-support benchmarks, few benchmarks do actually exist. We present in this chapter the Data Warehouse Engineering Benchmark (DWEB), which allows generating various ad-hoc synthetic data warehouses and workloads. DWEB is fully parameterized to fulfill various data warehouse design needs. However, two levels of parameterization keep it relatively easy to tune. We also expand on our previous work on DWEB by presenting its new Extract, Transform, and Load (ETL) feature as well as its new execution protocol. A Java implementation of DWEB is freely available on-line, which can be interfaced with most existing relational DMBSs. To the best of our knowledge, DWEB is the only easily available, up-to-date benchmark for data warehouses.

[1]  Michel Schneider,et al.  Benchmarking OODBs with a Generic Tool , 2000, J. Database Manag..

[2]  W. H. Inmon,et al.  Building the Data Warehouse,3rd Edition , 2002 .

[3]  Ramesh Bhashyam TPC-D—the challenges, issues and results , 1996, SGMD.

[4]  Goetz Graefe,et al.  Multi-table joins through bitmapped join indices , 1995, SGMD.

[5]  Sergio Luján-Mora,et al.  Towards a Metric Suite for Conceptual Models of Datawarehouse , 2004 .

[6]  Shirley Ann Becker,et al.  Effective Databases for Text & Document Management , 2003 .

[7]  Kamalakar Karlapalem,et al.  Some issues in design of data warehousing systems , 2001 .

[8]  Meikel Pöss,et al.  New TPC benchmarks for decision support and web commerce , 2000, SGMD.

[9]  Carrie Ballinger TPC-D: Benchmarking for Decision Support , 1993, The Benchmark Handbook.

[10]  Ralph Kimball,et al.  The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling , 1996 .

[11]  Roderic G. G. Cattell The benchmark handbook for database and transaction processing systems , 1991 .

[12]  Nick Roussopoulos,et al.  A Performance Evaluation of Online Warehouse Update Algorithms , 1998 .

[13]  Mario Piattini,et al.  Towards a Metrics Suite for Conceptual Models of Datawarehouses , 2004, Software Audit and Metrics.

[14]  Omar Boussaïd,et al.  Benchmarking data warehouses , 2007, Int. J. Bus. Intell. Data Min..

[15]  Omar Boussaïd,et al.  DWEB: A Data Warehouse Engineering Benchmark , 2005, DaWaK.

[16]  Mario Piattini,et al.  Metrics for Data Warehouse Quality , 2005, Encyclopedia of Information Science and Technology.

[17]  John Matthew Poole Common Warehouse Metamodel Developer's Guide , 2003 .

[18]  Meikel Pöss,et al.  TPC-DS, taking decision support benchmarking to the next level , 2002, SIGMOD '02.