An adaptive system-level diagnosis approach for hypercube multiprocessors

This paper proposes a hierarchical adaptive system-level diagnosis approach for hypercube systems. Three measures for diagnosis cost (diagnosis time, number of tests and number of test links) are analyzed for the proposed algorithm. It is proved that the diagnosis cost required by this algorithm is lower than in the previous diagnosis algorithms in most of the fault cases. It is shown that the diagnosis cost for the proposed algorithm depends on the number of faulty units in the system and the cost is extremely low when only a small number of faulty units exist. It is shown that this algorithm is even characterized by lower costs than a pessimistic diagnosis algorithm which trades lower diagnosis cost for a lower degree of accuracy.<<ETX>>

[1]  Dhiraj K. Pradhan,et al.  Dynamic Testing Strategy for Distributed Systems , 1989, IEEE Trans. Computers.

[2]  Fabrizio Lombardi,et al.  An Adaptive System-Level Diagnosis Approach for Mesh Connected Multiprocessors , 1993, 1993 International Conference on Parallel Processing - ICPP'93.

[3]  Dharma P. Agrawal,et al.  Generalized Hypercube and Hyperbus Structures for a Computer Network , 1984, IEEE Transactions on Computers.

[4]  S. Louis Hakimi,et al.  An optimal algorithm for distributed system level diagnosis , 1991, [1991] Digest of Papers. Fault-Tolerant Computing: The Twenty-First International Symposium.

[5]  Ronald P. Bianchini,et al.  On-Line Diagnosis in General Topology Networks , 1992 .

[6]  Richard W. Buskens,et al.  Simulation of the Adapt on-line diagnosis algorithm for general topology networks , 1992, [1992] Proceedings 11th Symposium on Reliable Distributed Systems.

[7]  GERNOT METZE,et al.  On the Connection Assignment Problem of Diagnosable Systems , 1967, IEEE Trans. Electron. Comput..

[8]  A. Kavianpour,et al.  A comparative evaluation of four basic system-level diagnosis strategies for hypercubes , 1992 .

[9]  Sheldon B. Akers,et al.  A Group-Theoretic Model for Symmetric Interconnection Networks , 1989, IEEE Trans. Computers.

[10]  James R. Armstrong,et al.  Fault Diagnosis in a Boolean n Cube Array of Microprocessors , 1981, IEEE Transactions on Computers.

[11]  Dhiraj K. Pradhan,et al.  Safe System Level Diagnosis , 1994, IEEE Trans. Computers.