Fault Injection: A Method for Validating Computer-System Dependability

A fault tolerant computer system's dependability must be validated to ensure that its redundancy has been correctly implemented and the system will provide the desired level of reliable service. Fault injection-the deliberate insertion of faults into an operational system to determine its response offers an effective solution to this problem. We survey several fault injection studies and discuss tools such as React (Reliable Architecture Characterization Tool) that facilitate its application. >

[1]  Volkmar Sieh,et al.  Combining software-implemented and simulation-based fault injection into a single fault injection method , 1995, Twenty-Fifth International Symposium on Fault-Tolerant Computing. Digest of Papers.

[2]  Daniel P. Siewiorek,et al.  FIAT-fault injection based automated testing environment , 1988, [1988] The Eighteenth International Symposium on Fault-Tolerant Computing. Digest of Papers.

[3]  Johan Karlsson,et al.  TWO FAULT INJECTION TECHNIQUES FOR TEST OF FAULT HANDLING MECHANISMS , 1991, 1991, Proceedings. International Test Conference.

[4]  Ravishankar K. Iyer,et al.  DEPEND: a design environment for prediction and evaluation of system dependability , 1990, 9th IEEE/AIAA/NASA Conference on Digital Avionics Systems.

[5]  Ann C. Merenda,et al.  Recovery/serviceability system test improvements for the IBM ES/9000 520 based models , 1992, [1992] Digest of Papers. FTCS-22: The Twenty-Second International Symposium on Fault-Tolerant Computing.

[6]  Daniel P. Siewiorek,et al.  Effects of transient gate-level faults on program behavior , 1990, [1990] Digest of Papers. Fault-Tolerant Computing: 20th International Symposium.

[7]  Ravishankar K. Iyer,et al.  A Simulation-Based Study of a Triple Modular Redundant System Using DEFEND , 1991, Fault-Tolerant Computing Systems.

[8]  Jean Arlat,et al.  Fault Injection for Dependability Validation: A Methodology and Some Applications , 1990, IEEE Trans. Software Eng..

[9]  Jacob A. Abraham,et al.  Dependability evaluation using hybrid fault/error injection , 1995, Proceedings of 1995 IEEE International Computer Performance and Dependability Symposium.

[10]  Farnam Jahanian,et al.  Testing of fault-tolerant and real-time distributed systems via protocol fault injection , 1996, Proceedings of Annual Symposium on Fault Tolerant Computing.

[11]  Ravishankar K. Iyer,et al.  DEPEND: A Simulation-Based Environment for System Level Dependability Analysis , 1997, IEEE Trans. Computers.

[12]  Ravishankar K. Iyer,et al.  FINE: A Fault Injection and Monitoring Environment for Tracing the UNIX System Behavior under Faults , 1993, IEEE Trans. Software Eng..

[13]  J. A. Clark,et al.  REACT: a synthesis and evaluation tool for fault-tolerant multiprocessor architectures , 1993, Annual Reliability and Maintainability Symposium 1993 Proceedings.

[14]  Ravishankar K. Iyer,et al.  Measurement-Based Analysis of Error Latency , 1987, IEEE Transactions on Computers.

[15]  Barry W. Johnson,et al.  System-level modeling in the ADEPT environment of a distributed computer system for real-time applications , 1995, Proceedings of 1995 IEEE International Computer Performance and Dependability Symposium.

[16]  Jacob A. Abraham,et al.  FERRARI: a tool for the validation of system dependability properties , 1992, [1992] Digest of Papers. FTCS-22: The Twenty-Second International Symposium on Fault-Tolerant Computing.

[17]  Ravishankar K. Iyer,et al.  FOCUS: An Experimental Environment for Fault Sensitivity Analysis , 1992, IEEE Trans. Computers.

[18]  Ram Chillarege,et al.  Understanding large system failures-a fault injection experiment , 1989, [1989] The Nineteenth International Symposium on Fault-Tolerant Computing. Digest of Papers.

[19]  Ravishankar K. Iyer,et al.  An Experimental Study of Memory Fault Latency , 1989, IEEE Trans. Computers.

[20]  Daniel P. Siewiorek,et al.  A Methodology for the Rapid Injection of Transient Hardware Errors , 1996, IEEE Trans. Computers.

[21]  Johan Karlsson,et al.  Evaluation of error detection schemes using fault injection by heavy-ion radiation , 1989, [1989] The Nineteenth International Symposium on Fault-Tolerant Computing. Digest of Papers.