Organization of Parallel Query Processing in Multiprocessor Database Machines with Hierarchical Architecture

The development of database systems with hierarchical hardware architecture is currently a perspective trend in the field of parallel database machines. Hierarchical architectures have been suggested with the aim to combine advantages of shared-nothing architectures and architectures with shared memory and disks. A commonly accepted way of construction of hierarchical systems is to combine shared-memory (shared-everything) clusters in a unique system without shared resources. However, such architectures cannot ensure data accessibility under hardware failures on the processor cluster level, which limits their use in systems with high fault-tolerance requirements. In this paper, an alternative approach to construction of hierarchical systems is suggested. In accordance with this approach, the systems is constructed as an assembly of processor clusters with shared disks, with each cluster being a two-level multiprocessor structure with a standard strongly connected topology of interprocessor connections. A stream model for organization of parallel query processing in systems with the hierarchical architecture suggested is described. This model has been implemented in a prototype parallel database management system Omega designed for Russian multiprocessor computational systems MBC-100/1000. Our experiments show that the total performance of the processor clusters in the Omega system is comparable with that of the processor clusters with shared resources even in the case of great data skew. At the same time, the clusters of the Omega system are capable of ensuring a higher degree of data availability compared to the clusters with shared-memory architectures.

[1]  Divesh Srivastava,et al.  Fault Tolerance Issues in Data Declustering for Parallel Database Systems. , 1994 .

[2]  Goetz Graefe,et al.  Query evaluation techniques for large databases , 1993, CSUR.

[3]  Sivarama P. Dandamudi,et al.  Performance Evaluation of a Two-Level Hierarchical Parallel Database System , 1997 .

[4]  Leonid B. Sokolinsky Operating System Support for a Parallel DBMS with an Hierarchical Shared-Nothing Architecture , 1999, ADBIS.

[5]  Anupam Bhide,et al.  An Analysis of Three Transaction Processing Architectures , 1988, VLDB.

[6]  Michael Stonebraker,et al.  Parallel Database Systems , 1990, Lecture Notes in Computer Science.

[7]  Leonid B. Sokolinsky,et al.  Implementation Principles of File Management System for Omega Parallel DBMS , 2000 .

[8]  Philip S. Yu,et al.  Effectiveness of Parallel Joins , 1990, IEEE Trans. Knowl. Data Eng..

[9]  M. Howard Williams,et al.  Data Placement in Parallel Database Systems , 1996 .

[10]  Michael Stonebraker,et al.  The Case for Shared Nothing , 1985, HPTS.

[11]  Shreekant S. Thakkar,et al.  Performance of an OLTP application on symmetry multiprocessor system , 1990, ISCA '90.

[12]  Hamid Pirahesh,et al.  Parallelism in Relational Database Management Systems , 1994, IBM Syst. J..

[13]  Goetz Graefe,et al.  Encapsulation of parallelism in the Volcano query processing system , 1990, SIGMOD '90.

[14]  Leonid B. Sokolinsky Interprocessor Communication Support in the Omega Parallel Database System , 1999 .

[15]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[16]  Michael Stonebraker,et al.  Operating system support for database management , 1981, CACM.

[17]  Dina Bitton,et al.  Disk Shadowing , 1988, VLDB.

[18]  Patrick Valduriez,et al.  Parallel database systems: The case for shared-something , 1993, Proceedings of IEEE 9th International Conference on Data Engineering.

[19]  Donovan A. Schneider,et al.  The Gamma Database Machine Project , 1990, IEEE Trans. Knowl. Data Eng..

[20]  Patrick Valduriez,et al.  Prototyping Bubba, A Highly Parallel Database System , 1990, IEEE Trans. Knowl. Data Eng..

[21]  Kien A. Hua,et al.  Interconnecting shared-everything systems for efficient parallel query processing , 1991, [1991] Proceedings of the First International Conference on Parallel and Distributed Information Systems.

[22]  Luc Bouganim,et al.  Dynamic Load Balancing in Hierarchical Parallel Database Systems , 1996, VLDB.