Implementing relational database operations in a cube-connected multicomputer system

Parallel architectures for database processing should incorporate parallel CPU as well as parallel I/O (disk access) capability. The need to support parallel I/O gives rise to two important issues - data combination and non-uniform data distribution. Strategies for performing database operations in a cube-connected multicomputer system with parallel I/O are presented in this paper. The cube interconnection subsumes many other structures such as the tree, ring, etc. This property is exploited to efficiently support database operations such as Select, Aggregate, Join, and Project. The strategies presented here are unique in that they account for the non-uniform distribution of data across parallel paths by incorporating data redistribution steps as part of the overall algorithm. The two main data redistribution operations used are tuple balancing and merging. A simple analysis of the join and project operations is carried out assuming non-uniform data distributions. A more detailed simulation and study of issues related to query processing will be carried out as part of the future work.

[1]  John P. Hayes,et al.  A Microprocessor-based Hypercube Supercomputer , 1986, IEEE Micro.

[2]  Chaitanya K. Baru,et al.  Dynamically partitionable multicomputers with switchable memory , 1984, J. Parallel Distributed Comput..

[3]  A. Gottleib,et al.  The nyu ultracomputer- designing a mimd shared memory parallel computer , 1983 .

[4]  Keki B. Irani,et al.  A Database Machine for Very Large Relational Databases , 1985, IEEE Transactions on Computers.

[5]  O. Frieder Database processing on a cube-connected multicomputer , 1987 .

[6]  M. Kitsuregawa,et al.  Architecture and performance of relational algebra machine GRACE , 1989 .

[7]  Karen A. Frenkel,et al.  Evaluating two massively parallel machines , 1986, CACM.

[8]  Chaitanya K. Baru,et al.  The Architecture of SM3: A Dynamically Partitionable Multicomputer System , 1986, IEEE Transactions on Computers.

[9]  David J. DeWitt,et al.  A PERFORMANCE EVALUATION OF DATABASE MACHINE ARCHITECTURES , 1981 .

[10]  P. Bruce Berra,et al.  The Role of Associative Array Processers in Data Base Machine Architecture , 1979, Computer.

[11]  David J. DeWitt,et al.  GAMMA - A High Performance Dataflow Database Machine , 1986, VLDB.

[12]  John C. Peterson,et al.  The Mark III Hypercube-Ensemble Concurrent Computer , 1985, International Conference on Parallel Processing.

[13]  John P. Hayes,et al.  Architecture of a Hypercube Supercomputer , 1986, ICPP.

[14]  David Elliot Shaw,et al.  NON-VON's performance on certain database benchmarks , 1986, IEEE Transactions on Software Engineering.

[15]  Chaitanya K. Baru,et al.  Implementing Relational Database Operations in a Cube-Connected Multicomputer System , 1987, ICDE.

[16]  David J. DeWitt,et al.  Parallel algorithms for the execution of relational database operations , 1983, TODS.

[17]  David J. DeWitt,et al.  DIRECT - A Multiprocessor Organization for Supporting Relational Database Management Systems , 1979, IEEE Trans. Computers.