Dynamic fault-tree models for fault-tolerant computer systems

Reliability analysis of fault-tolerant computer systems for critical applications is complicated by several factors. Systems designed to achieve high levels of reliability frequently employ high levels of redundancy, dynamic redundancy management, and complex fault and error recovery techniques. This paper describes dynamic fault-tree modeling techniques for handling these difficulties. Three advanced fault-tolerant computer systems are described: a fault-tolerant parallel processor, a mission avionics system, and a fault-tolerant hypercube. Fault-tree models for their analysis are presented. HARP (Hybrid Automated Reliability Predictor) is a software package developed at Duke University and NASA Langley Research Center that can solve those fault-tree models. >

[1]  Joanne Bechta Dugan,et al.  Fault trees and imperfect coverage , 1989 .

[2]  M. A. Boyd,et al.  Fault tree models for fault tolerant hypercube multiprocessors , 1991, Annual Reliability and Maintainability Symposium. 1991 Proceedings.

[3]  Kishor S. Trivedi,et al.  Coverage Modeling for Dependability Analysis of Fault-Tolerant Systems , 1989, IEEE Trans. Computers.

[4]  Arthur L. Liestman,et al.  A proposal for a fault-tolerant binary hypercube architecture , 1989, [1989] The Nineteenth International Symposium on Fault-Tolerant Computing. Digest of Papers.

[5]  Dirk Grunwald,et al.  Hyperswitch network for the hypercube computer , 1988, ISCA '88.

[6]  Kishor S. Trivedi,et al.  The hybrid automated reliability predictor , 1986 .

[7]  J.B. Fussell,et al.  On the Quantitative Analysis of Priority-AND Failure Logic , 1976, IEEE Transactions on Reliability.

[8]  Salvatore J. Bavuso,et al.  Fault trees and sequence dependencies , 1990, Annual Proceedings on Reliability and Maintainability Symposium.

[9]  M. Smotherman,et al.  A non-homogeneous Markov model for phased-mission reliability analysis , 1989 .

[10]  M. Thomason,et al.  Boolean Difference Techniques for Time-Sequence and Common-Cause Analysis of Fault-Trees , 1984, IEEE Transactions on Reliability.

[11]  James Daniel. Esary,et al.  Reliability analysis of phased missions. , 1975 .

[12]  Jaynarayan H. Lala,et al.  Fault tolerant parallel processor architecture overview , 1988, [1988] The Eighteenth International Symposium on Fault-Tolerant Computing. Digest of Papers.

[13]  Richard E. Harper RELIABILITY ANALYSIS OF PARALLEL PROCESSING SYSTEMS , 1988 .

[14]  Kishor S. Trivedi,et al.  Analysis of Typical Fault-Tolerant Architectures using HARP , 1987, IEEE Transactions on Reliability.