论文信息 - Adaptive parallel aggregation algorithms

Adaptive parallel aggregation algorithms

Aggregation and duplicate removal are common in SQL queries. However, in the parallel query processing literature, aggregate processing has received surprisingly little attention; furthermore, for each of the traditional parallel aggregation algorithms, there is a range of grouping selectivities where the algorithm performs poorly. In this work, we propose new algorithms that dynamically adapt, at query evaluation time, in response to observed grouping selectivities. Performance analysis via analytical modeling and an implementation on a workstation-cluster shows that the proposed algorithms are able to perform well for all grouping selectivities. Finally, we study the effect of data skew and show that for certain data sets the proposed algorithms can even outperform the best of traditional approaches.

Jeffrey F. Naughton | Ambuj Shatdal | J. Naughton | A. Shatdal

[1] Stanley Y. W. Su,et al. Parallel Algorithms and Their Implementation in MICRONET , 1982, VLDB.

[2] David J. DeWitt,et al. Parallel algorithms for the execution of relational database operations , 1983, TODS.

[3] Donovan A. Schneider,et al. The Gamma Database Machine Project , 1990, IEEE Trans. Knowl. Data Eng..

[4] Alfred G. Dale,et al. A Taxonomy and Performance Model of Data Skew Effects in Parallel Joins , 1991, VLDB.

[5] S. Seshadri. Probabilistic methods in query processing , 1992 .

[6] Miron Livny,et al. Managing Memory to Meet Multiclass Workload Response Time Goals , 1993, VLDB.

[7] Jack Dongarra,et al. Pvm 3 user's guide and reference manual , 1993 .

[8] Goetz Graefe,et al. Query evaluation techniques for large databases , 1993, CSUR.

[9] J. Bunge,et al. Estimating the Number of Species: A Review , 1993 .