System-level fault diagnosis: A survey

Abstract Due to recent advances in technology, the potential applications of the Theory of System-Level Fault Diagnosis have grown. Furthermore, recent breakthroughs on open problems in the theory itself have broadened its applicability. In this paper, we describe the motivations for studying the theory, its applicability to computer systems for achieving fault-tolerance, and the current results.

[1]  Kyung-Yong Chwa,et al.  Schemes for Fault-Tolerant Computing: A Comparison of Modularly Redundant and t-Diagnosable Systems , 1981, Inf. Control..

[2]  Gregory F. Sullivan,et al.  A Polynomial Time Algorithm for Fault Diagnosability , 1984, FOCS.

[3]  Oded Goldreich,et al.  On the np-completeness of certain network testing problems , 1984, Networks.

[4]  Omri Serlin Fault-Tolerant Systems in Commercial Applications , 1984, Computer.

[5]  Arthur D. Friedman,et al.  System-Level Fault Diagnosis , 1980, Computer.

[6]  S. Louis Hakimi,et al.  An Adaptive Algorithm for System Level Diagnosis , 1984, J. Algorithms.

[7]  Krishan K. Sabnani,et al.  The Comparison Approach to Multiprocessor Fault Diagnosis , 1987, IEEE Transactions on Computers.

[8]  S. Louis Hakimi,et al.  Characterization of Connection Assignment of Diagnosable Systems , 1974, IEEE Transactions on Computers.

[9]  Robert S. Swarz,et al.  The theory and practice of reliable system design , 1982 .

[10]  Gerald M. Masson,et al.  An 0(n2.5) Fault Identification Algorithm for Diagnosable Systems , 1984, IEEE Transactions on Computers.

[11]  Gerard G. L. Meyer,et al.  A Diagnosis Algorithm for the BGM System Level Fault Model , 1984, IEEE Transactions on Computers.

[12]  Fabrizio Grandoni,et al.  A Theory of Diagnosability of Digital Systems , 1976, IEEE Transactions on Computers.

[13]  GERNOT METZE,et al.  On the Connection Assignment Problem of Diagnosable Systems , 1967, IEEE Trans. Electron. Comput..

[14]  J. Goldberg,et al.  SIFT: Design and analysis of a fault-tolerant computer for aircraft control , 1978, Proceedings of the IEEE.

[15]  Pavel M. Blecher,et al.  On a logical problem , 1983, Discret. Math..

[16]  Sudhakar M. Reddy,et al.  Distributed fault-tolerance for large multiprocessor systems , 1980, ISCA '80.

[17]  Jon Gregory Kuhl Fault diagnosis in computing networks , 1980 .

[18]  Gerald M. Masson,et al.  Diagnosable Systems for Intermittent Faults , 1978, IEEE Transactions on Computers.

[19]  S. Louis Hakimi,et al.  On Models for Diagnosable Systems and Probabilistic Fault Diagnosis , 1976, IEEE Transactions on Computers.

[20]  Charles R. Kime,et al.  System Fault Diagnosis: Masking, Exposure, and Diagnosability Without Repair , 1975, IEEE Transactions on Computers.

[21]  S. Louis Hakimi,et al.  On Adaptive System Diagnosis , 1984, IEEE Transactions on Computers.

[22]  Charles R. Kime An Analysis Model for Digital System Diagnosis , 1970, IEEE Transactions on Computers.

[23]  Kyung-Yong Chwa,et al.  On Fault Identification in Diagnosable Systems , 1981, IEEE Transactions on Computers.

[24]  James E. Smith,et al.  Diagnosis of Systems with Asymmetric Invalidation , 1981, IEEE Transactions on Computers.

[25]  Che-Liang Yang,et al.  A Fault Identification Algorithm for ti-Diagnosable Systems , 1986, IEEE Transactions on Computers.

[26]  Charles R. Kime,et al.  System Fault Diagnosis: Closure and Diagnosability with Repair , 1975, IEEE Transactions on Computers.