Free dimensions-an effective approach to achieving fault tolerance in hypercube

In the n-dimensional hypercube, Q/sub n/, for large n, faults can occur with relatively high probability. How to use the inherent redundancy present in the hypercube to obtain fault tolerance is discussed, along with computing in faulty hypercubes. The authors study the fault tolerance independently present in hypercubes by defining and using the concept of free dimensions. Briefly, in Q/sub n/, a dimension is said to be free if no pair of nodes across the dimension link are both faulty. Efficient algorithms are presented for finding free dimensions, given a set of faulty nodes, and it is shown that at least n-f+1 free dimensions exist with f<or=n faulty nodes. Free dimensions can be used to partition Q/sub n/ into subcubes such that each subcube contains at most one fault. Such a partitioning helps in achieving fault tolerance via emulation, embedding, and reconfiguration. It also helps in designing efficient routing and broadcasting algorithms in faulty hypercubes.<<ETX>>

[1]  Frank Thomson Leighton,et al.  Reconfiguring a hypercube in the presence of faults , 1987, STOC.

[2]  W. Daniel Hillis,et al.  The connection machine , 1985 .

[3]  Frank Thomson Leighton,et al.  Coding theory, hypercube embeddings, and fault tolerance , 1991, SPAA '91.

[4]  Ming-Syan Chen,et al.  Depth-First Search Approach for Fault-Tolerant Routing in Hypercube , 1990, IEEE Trans. Parallel Distributed Syst..

[5]  Prithviraj Banerjee Strategies for reconfiguring hypercubes under faults , 1990, [1990] Digest of Papers. Fault-Tolerant Computing: 20th International Symposium.

[6]  Bernd Becker,et al.  How robust is the n-cube? , 1986, 27th Annual Symposium on Foundations of Computer Science (sfcs 1986).

[7]  Rami Melhem,et al.  Distributed Fault Tolerant Embedding of Binary Trees and Rings in Hypercubes , 1989 .

[8]  John P. Hayes,et al.  An automorphic approach to the design of fault-tolerant multiprocessors , 1989, [1989] The Nineteenth International Symposium on Fault-Tolerant Computing. Digest of Papers.

[9]  Wei-Tek Tsai,et al.  An efficient multi-dimensional grids reconfiguration algorithm on hypercube , 1988, [1988] The Eighteenth International Symposium on Fault-Tolerant Computing. Digest of Papers.

[10]  Parameswaran Ramanathan,et al.  Reliable Broadcast in Hypercube Multicomputers , 1988, IEEE Trans. Computers.

[11]  Tze Chiang Lee Quick Recovery of Embedded Structures in Hypercube Computers , 1990, Proceedings of the Fifth Distributed Memory Computing Conference, 1990..

[12]  Krzysztof Diks,et al.  Optimal broadcasting in faulty hypercubes , 1991, [1991] Digest of Papers. Fault-Tolerant Computing: The Twenty-First International Symposium.

[13]  Jehoshua Bruck,et al.  Running algorithms efficiently on faulty hypercubes , 1990, SPAA '90.

[14]  Mee Yee Chan,et al.  Fault-Tolerant Embedding of Complete Binary Trees in Hypercubes , 1993, IEEE Trans. Parallel Distributed Syst..

[15]  John P. Hayes,et al.  Architecture of a Hypercube Supercomputer , 1986, ICPP.

[16]  Sartaj Sahni,et al.  Optimal BPC Permutations on a Cube Connected SIMD Computer , 1982, IEEE Transactions on Computers.

[17]  Cauligi S. Raghavendra,et al.  Embedding and reconfiguration of binary trees in faulty hypercubes , 1992, Proceedings Sixth International Parallel Processing Symposium.

[18]  J. P. Hayes,et al.  Routing and broadcasting in faulty hypercube computers , 1988, C3P.

[19]  Cauligi S. Raghavendra,et al.  Embedding of Multidimensional Meshes on to Faulty Hypercubes , 1991, ICPP.

[20]  F. Harary,et al.  A survey of the theory of hypercube graphs , 1988 .

[21]  S. F. Nugent,et al.  The iPSC/2 direct-connect communications technology , 1988, C3P.

[22]  M. H. Schultz,et al.  Topological properties of hypercubes , 1988, IEEE Trans. Computers.