Data Warehousing and OLAP in a Cluster Computer Environment

Decision oriented technologies, like data warehousing and on-line analytical processing (OLAP) systems store and handle very large volumes of data, requiring more efficient ways of dealing with them. Recent advances in parallel computing and high-speed networks using a cluster of PCs or workstations (COWs) offer a low cost solution for providing this scale up in performance by parallelism of data, and it’s processing, in the data warehouse. This paper investigates how the star join and data cube operations can be performed in parallel on a cluster of Pcs.

[1]  Jeffrey F. Naughton,et al.  Simultaneous optimization and evaluation of multiple dimensional queries , 1998, SIGMOD '98.

[2]  Rajkumar Buyya Cluster Computing : The Commodity Supercomputing , 1988 .

[3]  Yue Zhuge,et al.  Distributed and Parallel Computing Issues in Data Warehousing (Invited Talk) , 1998 .

[4]  Peter M. G. Apers,et al.  Parallel Evaluation of Multi-join Queries , 1996, ACPC.

[5]  Surajit Chaudhuri,et al.  An overview of data warehousing and OLAP technology , 1997, SGMD.

[6]  Alok N. Choudhary,et al.  Design and implementation of a scalable parallel system for multidimensional analysis and OLAP , 1999, Proceedings 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. IPPS/SPDP 1999.

[7]  Bongki Moon,et al.  A case for parallelism in data warehousing and OLAP , 1998, Proceedings Ninth International Workshop on Database and Expert Systems Applications (Cat. No.98EX130).

[8]  Gregory F. Pfister,et al.  In Search of Clusters , 1995 .

[9]  Krithi Ramamritham,et al.  Indexing and Compression in Data Warehouses , 1999, DMDW.

[10]  Jehoshua Bruck,et al.  Efficient Message Passing Interface (MPI) for Parallel Computing on Clusters of Workstations , 1997, J. Parallel Distributed Comput..

[11]  Patrick E. O'Neil,et al.  Improved query performance with variant indexes , 1997, SIGMOD '97.

[12]  Rajkumar Buyya,et al.  Cluster computing: the commodity supercomputer , 1999 .

[13]  Goetz Graefe,et al.  Query evaluation techniques for large databases , 1993, CSUR.

[14]  Inderpal Singh Mumick,et al.  Maintenance of data cubes and summary tables in a warehouse , 1997, SIGMOD '97.