论文信息 - A cluster architecture for parallel data warehousing

A cluster architecture for parallel data warehousing

Describes the parallel, cluster-based implementation of an algorithm for the computation of a database operator known as the datacube. Though a number of efficient sequential algorithms have recently been proposed for this problem, very little research effort has been expended upon cost-effective parallelization techniques. Our approach builds directly upon the existing sequential proposals and is designed to be both load-balanced and communication-efficient. We also provide experimental results that demonstrate the viability of our technique under a variety of test conditions. Ultimately, we show that parallel performance relative to the underlying sequential algorithm (speedup) is near-optimal.

[1] Nick Roussopoulos,et al. Direct spatial search on pictorial databases using packed R-trees , 1985, SIGMOD Conference.

[2] Stephen R. Schach,et al. A Shifting Algorithm for Min-Max Tree Partitioning , 1980, JACM.

[3] Kenneth A. Ross,et al. Fast Computation of Sparse Datacubes , 1997, VLDB.

[4] Ronald L. Rivest,et al. Introduction to Algorithms , 1990 .

[5] Christos Faloutsos,et al. On packing R-trees , 1993, CIKM '93.

[6] Philippe Flajolet,et al. Probabilistic Counting Algorithms for Data Base Applications , 1985, J. Comput. Syst. Sci..

[7] Nick Roussopoulos,et al. Cubetree: organization of and bulk incremental updates on the data cube , 1997, SIGMOD '97.

[8] Jeffrey F. Naughton,et al. An array-based algorithm for simultaneous multidimensional aggregates , 1997, SIGMOD '97.

[9] Sunita Sarawagi,et al. On computing the data cube , 1996 .

[10] Jeffrey F. Naughton,et al. Storage Estimation for Multidimensional Aggregates in the Presence of Hierarchies , 1996, VLDB.

[11] A. Guttmma,et al. R-trees: a dynamic index structure for spatial searching , 1984 .

[12] Jeffrey D. Ullman,et al. Implementing data cubes efficiently , 1996, SIGMOD '96.

[13] Susanne E. Hambrusch,et al. Parallelizing the Data Cube , 2001, ICDT.

[14] Raghu Ramakrishnan,et al. Bottom-up computation of sparse and Iceberg CUBE , 1999, SIGMOD '99.

[15] Jeffrey F. Naughton,et al. On the Computation of Multidimensional Aggregates , 1996, VLDB.