A Parallel Distributive Join Algorithm for Cube-Connected Multiprocessors

This paper presents a parallel distributive join algorithm for cube-connected multiprocessors. The performance analysis shows that the proposed algorithm has an almost linear speedup over the sequential distributive join algorithm as the number of processors increases, and its performance is comparable to that of the parallel hybrid-hash join algorithm. A big advantage of the proposed algorithm over hash-based join algorithms is that it does not have the bucket overflow problem caused by nonuniform hashing of the smaller operand relation. Moreover, the proposed algorithm can easily support the nonequijoin operation, which is very hard to implement by using hash-based join algorithms.

[1]  Giuseppe Pelagatti,et al.  Distributive join: a new algorithm for joining relations , 1991, TODS.

[2]  David J. DeWitt,et al.  A performance evaluation of four parallel join algorithms in a shared-nothing multiprocessor environment , 1989, SIGMOD '89.

[3]  Calvin C. Gotlieb,et al.  Order-preserving key transformations , 1986, TODS.

[4]  David Rabinowitz Database computers , 1993 .

[5]  G. C. Fox,et al.  Solving Problems on Concurrent Processors , 1988 .

[6]  Patrick Valduriez,et al.  Join and Semijoin Algorithms for a Multiprocessor Database Machine , 1984, TODS.

[7]  Optical database machine using spatial light rebroadcasters , 1991, [1991] Proceedings. The Fifth International Parallel Processing Symposium.

[8]  Michael Stonebraker,et al.  Future trends in data base systems , 1988, Proceedings. Fourth International Conference on Data Engineering.

[9]  Esen A. Ozkarahan Database machines and database management , 1986 .

[10]  P. Bruce Berra,et al.  A database machine based on concatenated code words for very large databases , 1990 .

[11]  Ophir Frieder,et al.  Multiprocessor algorithms for relational-database operators on hypercube systems , 1990, Computer.

[12]  Kjell Bratbergsengen,et al.  Hashing Methods and Relational Algebra Operations , 1984, VLDB.

[13]  Chiaki Sakama,et al.  Parallel control techniques for dedicated relational database engines , 1987, 1987 IEEE Third International Conference on Data Engineering.

[14]  Soon Myoung Chung Parallel main memory database system , 1992, SAC '92.

[15]  David J. DeWitt,et al.  Database Machines: An Idea Whose Time Passed? A Critique of the Future of Database Machines , 1989, IWDM.

[16]  Edward Omiecinski,et al.  Hash-Based and Index-Based Join Algorithms for Cube and Ring Connected Multicomputers , 1989, IEEE Trans. Knowl. Data Eng..

[17]  David J. DeWitt,et al.  Multiprocessor Hash-Based Join Algorithms , 1985, VLDB.