From dual-cube to metacube: efficient low-degree alternatives to hypercube

The hypercube has been widely used as the interconnection network in parallel computers. However, when dealing with the parallel computers of very large scale, the port limitation due to the technology greatly forbid the use of hypercube networks. The hypercube-based SGI Origin2000, a newly developed multiprocessor system, tried to solve this problem by introducing a Cray router. In this paper we first describe a hypercube-like network, called dual-cube, that was motivated by the structure of Origin2000. A dual-cube DC(m) has m+1 links per node where m is the degree of a cluster (m-cube), one more link is used for connecting to a node in another cluster. The dual-cube mitigates the problem of port limitation in the large-scale hypercube network while keeps most of the topological properties of the hypercube network. Then, we describe an interconnection network that extends dual-cube into a more general network called metacube. The metacube has a two-level cube structure with two parameters representing the dimensions of the two-level cubes. Metacube is much more flexible than dual-cube and can solve the port limitation problem completely. The dual-cube and metacube networks can be applied to SGI Origin2000 to connect large number of processors without using Cray router.

[1]  Shietung Peng,et al.  DUAL-CUBES: A NEW INTERCONNECTION NETWORK FOR HIGH-PERFORMANCE COMPUTER CLUSTERS , 2000 .

[2]  Franco P. Preparata,et al.  The cube-connected-cycles: A versatile network for parallel computation , 1979, 20th Annual Symposium on Foundations of Computer Science (sfcs 1979).

[3]  Hyeong-Ah Choi,et al.  Circuit-Switched Broadcasting in Torus and Mesh Networks , 1996, IEEE Trans. Parallel Distributed Syst..

[4]  Leonard Kleinrock,et al.  Virtual Cut-Through: A New Computer Communication Switching Technique , 1979, Comput. Networks.

[5]  Shahram Latifi,et al.  Properties and Performance of Folded Hypercubes , 1991, IEEE Trans. Parallel Distributed Syst..

[6]  Kanad Ghose,et al.  Hierarchical Cubic Networks , 1995, IEEE Trans. Parallel Distributed Syst..

[7]  D. Lenoski,et al.  The SGI Origin: A ccnuma Highly Scalable Server , 1997, Conference Proceedings. The 24th Annual International Symposium on Computer Architecture.

[8]  J. Laudon,et al.  The SGI Origin 2000: A CCNUMA highly Scaleble Server , 1997, ISCA 1997.

[9]  Arthur M. Farley Minimum-time line broadcast networks , 1980, Networks.

[10]  Shietung Peng,et al.  Optimal Algorithms for Node-to-Node Fault Tolerant Routing in Hypercubes , 1996, Comput. J..

[11]  Joseph G. Peters,et al.  Circuit-Switched Broadcasting in Torus Networks , 1996, IEEE Trans. Parallel Distributed Syst..

[12]  Lionel M. Ni,et al.  Multicast in hypercube multiprocessors , 1988, Seventh Annual International Phoenix Conference on Computers an Communications. 1988 Conference Proceedings.

[13]  Trevor Mudge,et al.  Hypercube supercomputers , 1989, Proc. IEEE.

[14]  L. W. Tucker,et al.  Architecture and applications of the Connection Machine , 1988, Computer.

[15]  Philip K. McKinley,et al.  Communication issues in parallel computing across ATM networks , 1994, IEEE Parallel & Distributed Technology: Systems & Applications.

[16]  Kemal Efe,et al.  The Crossed Cube Architecture for Parallel Computation , 1992, IEEE Trans. Parallel Distributed Syst..

[17]  George Karypis,et al.  Introduction to Parallel Computing , 1994 .

[18]  Philip K. McKinley,et al.  Collective Communication in Wormhole-Routed Massively Parallel Computers , 1995, Computer.

[19]  Lionel M. Ni,et al.  A survey of wormhole routing techniques in direct networks , 1993, Computer.

[20]  Sudhakar Yalamanchili,et al.  Interconnection Networks: An Engineering Approach , 2002 .

[21]  S. Lennart Johnsson,et al.  Optimum Broadcasting and Personalized Communication in Hypercubes , 1989, IEEE Trans. Computers.

[22]  Shietung Peng,et al.  Efficient Collective Communications in Dual-Cube , 2004, The Journal of Supercomputing.

[23]  Sotirios G. Ziavras RH: A Versatile Family of Reduced Hypercube Interconnection Networks , 1994, IEEE Trans. Parallel Distributed Syst..

[24]  E. Barszcz,et al.  Profiling the communication workload of an iPSC/860 , 1994, Proceedings of IEEE Scalable High Performance Computing Conference.