论文信息 - Adaptive Virtual Partitioning for OLAP Query Processing in a Database Cluster

Adaptive Virtual Partitioning for OLAP Query Processing in a Database Cluster

OLAP queries are typically heavy-weight and ad-hoc thus requiring high storage capacity and processing power. In this paper, we address this problem using a database cluster which we see as a cost-effective alternative to a tightly-coupled multiprocessor. We propose a solution to efficient OLAP query processing using a simple data parallel processing technique called adaptive virtual partitioning which dynamically tunes partition sizes, without requiring any knowledge about the database and the DBMS. To validate our solution, we implemented a Java prototype on a 32 node cluster system and ran experiments with typical queries of the TPC-H benchmark. The results show that our solution yields linear, and sometimes super-linear, speedup. In many cases, it outperforms traditional virtual partitioning by factors superior to 10.

Marta Mattoso | Patrick Valduriez | Alexandre A. B. Lima | P. Valduriez | M. Mattoso

[1] GraefeGoetz. Query evaluation techniques for large databases , 1993 .

[2] Fuat Akal,et al. OLAP Query Evaluation in a Database Cluster: A Performance Study on Intra-Query Parallelism , 2002, ADBIS.

[3] Patrick Valduriez,et al. Parallel database systems: Open problems and new issues , 1993, Distributed and Parallel Databases.

[4] Klemens Böhm,et al. OLAP Query Routing and Physical Design in a Database Cluster , 2000, EDBT.

[5] Erhard Rahm,et al. Multi-Dimensional Database Allocation for Parallel Data Warehouses , 2000, VLDB.

[6] Marta Mattoso,et al. OLAP Query Processing in a Database Cluster , 2004, Euro-Par.

[7] Patrick Valduriez,et al. Principles of distributed database systems (2nd ed.) , 1999 .

[8] Patrick Valduriez,et al. Scaling Up the Preventive Replication of Autonomous Databases in Cluster Systems , 2004, VECPAR.

[9] Patrick Valduriez,et al. Principles of Distributed Database Systems , 1990 .

[10] Goetz Graefe,et al. Query evaluation techniques for large databases , 1993, CSUR.

[11] Narasimhaiah Gorla,et al. Features to consider in a data warehousing system , 2003, CACM.