An Improved Generalization of Mesh-Connected Computers with Multiple Buses

Mesh-connected computers (MCCs) are a class of important parallel architectures due to their simple and regular interconnections. However, their performances are restricted by their large diameters. Various augmenting mechanisms have been proposed to enhance the communication efficiency of MCCs. One major approach is to add nonconfigurable buses for improved broadcasting. A typical example is the mesh-connected computer with multiple buses (MMB). We propose a new class of generalized MMBs, the improved generalized MMBs (IMMBs). We compare IMMBs with MMBs and a class of previously proposed generalized MMBs (GMMBs). We show the power of IMMBs by considering semigroup and prefix computations. Specifically, as our main result we show that for any constant 0</spl epsiv/<1, one can construct an N/sup 1/2 //spl times/N/sup 1/2 / square IMMB using which semigroup and prefix computations on N operands can be carried out in O(N/sup /spl epsiv//) time, while maintaining O(1) broadcasting time. Compared with the previous best complexities O(N/sup 1/8 /) and O(N/sup 1/16 /) achieved on a rectangular MMB and GMMB, respectively, for the same computations, our results show that IMMBs are more powerful than MMBs and GMMBs.

[1]  Russ Miller,et al.  Meshes with reconfigurable buses , 1988 .

[2]  David A. Carlson,et al.  Solving Linear Recurrence Systems on Mesh-Connected Computers with Multiple Global Buses , 1990, J. Parallel Distributed Comput..

[3]  Si-Qing Zheng,et al.  An Abstract Model for Optical Interconnection Networks , 1998 .

[4]  Behrooz Parhami,et al.  Optimal Architectures and Algorithms for Mesh-Connected Parallel Computers with Separable Row/Column Buses , 1993, IEEE Trans. Parallel Distributed Syst..

[5]  David Peleg,et al.  Square Meshes are not always Optimal , 1991, IEEE Trans. Computers.

[6]  Selim G. Akl Parallel computation: models and methods , 1997 .

[7]  Kuo-Liang Chung Prefix Computations on a Generalized Mesh-Connected Computer with Multiple Buses , 1995, IEEE Trans. Parallel Distributed Syst..

[8]  Shahid H. Bokhari,et al.  Finding Maximum on an Array Processor with a Global Bus , 1984, IEEE Transactions on Computers.

[9]  Alok Aggarwal,et al.  Optimal Bounds for Finding Maximum on Array of Processors with k Global Buses , 1986, IEEE Transactions on Computers.

[10]  David A. Carlson,et al.  Modified Mesh-Connected Parallel Computers , 1988, IEEE Trans. Computers.

[11]  Viktor K. Prasanna,et al.  Array Processor with Multiple Broadcasting , 1985, ISCA.

[12]  Jang-Ping Sheu,et al.  Designing Efficient Parallel Algorithms on Mech-Connected Computers with Multiple Broadcasting , 1990, IEEE Trans. Parallel Distributed Syst..

[13]  Amotz Bar-Noy,et al.  Square meshes are not always optimal , 1989, SPAA '89.

[14]  Wen-Tsuen Chen,et al.  Efficient Medain Finding and Its Application to Two-Variable Linear Programming on Mesh-Connected Computers with Multiple Broadcasting , 1992, J. Parallel Distributed Comput..

[15]  Guy E. Blelloch,et al.  Prefix sums and their applications , 1990 .

[16]  Stephan Olariu,et al.  Selection on rectangular meshes with multiple broadcasting , 1993, BIT Comput. Sci. Sect..

[17]  Quentin F. Stout,et al.  Mesh-Connected Computers with Broadcasting , 1983, IEEE Transactions on Computers.

[18]  F. Leighton,et al.  Introduction to Parallel Algorithms and Architectures: Arrays, Trees, Hypercubes , 1991 .

[19]  Quentin F. Stout,et al.  Meshes with multiple buses , 1986, 27th Annual Symposium on Foundations of Computer Science (sfcs 1986).

[20]  Charles E. Leiserson,et al.  Fat-trees: Universal networks for hardware-efficient supercomputing , 1985, IEEE Transactions on Computers.

[21]  Kang G. Shin,et al.  Implementation of Decentralized Load Sharing in Networked Workstations Using the Condor Package , 1997, J. Parallel Distributed Comput..

[22]  Si-Qing Zheng,et al.  The Bus-Connected Ringed Tree: A Versatile Interconnection Network , 1996, J. Parallel Distributed Comput..

[23]  S. Lakshmivarahan,et al.  Parallel computing using the prefix problem , 1994 .