Hierarchical approach to accurate fault modeling for system evaluation

This paper presents a hierarchical simulation methodology that enables accurate system evaluation under realistic faults and conditions. In this methodology effects of low-level (i.e., transistor or circuit levels) faults are propagated to higher levels (i.e., system level) using fault dictionaries. The primary fault model is obtained via simulation of the transistor-level effect of a radiation particle penetrating the device. The resulting current burst is used as a fault model in the circuit-level simulation and is injected into the nodes of a circuit/subcircuit. The latched outputs are collected in a fault dictionary and applied in conducting fault injection at the chip level under a selected workload. Faults injected at the chip level result in memory corruption, which is used as a fault model in the system-level simulation. When an application terminates, either normally or abnormally, the overall fault impact on the software behavior is quantified and analyzed. The simulation method is demonstrated and validated in the case study of Myrinet, a commercial, high-speed network. The study shows that the proposed approach offers a high confidence in the evaluation results, as the system is analyzed in presence of realistic fault conditions. It also demonstrates that the conducted analysis can be used to improve system dependability by identifying recovery mechanisms for failures observed during the experiments.

[1]  Ravishankar K. Iyer,et al.  Device-level transient fault modeling , 1994, Proceedings of IEEE 24th International Symposium on Fault- Tolerant Computing.

[2]  Ravishankar K. Iyer,et al.  FAMAS: FAult Modeling via Adaptive Simulation , 1997, Proceedings Tenth International Conference on VLSI Design.

[3]  Ravishankar K. Iyer,et al.  FOCUS: An Experimental Environment for Fault Sensitivity Analysis , 1992, IEEE Trans. Computers.

[4]  Ravishankar K. Iyer,et al.  Experimental analysis of computer system dependability , 1996 .

[5]  G. C. Messenger,et al.  Collection of Charge on Junction Nodes from Ion Tracks , 1982, IEEE Transactions on Nuclear Science.

[6]  Johan Karlsson,et al.  Fault injection into VHDL models: the MEFISTO tool , 1994 .

[7]  Ravishankar K. Iyer,et al.  DEPEND: A Simulation-Based Environment for System Level Dependability Analysis , 1997, IEEE Trans. Computers.

[8]  Fred L. Yang,et al.  Simulation of faults causing analog behavior in digital circuits , 1992 .

[9]  Barry W. Johnson,et al.  System-level modeling in the ADEPT environment of a distributed computer system for real-time applications , 1995, Proceedings of 1995 IEEE International Computer Performance and Dependability Symposium.

[10]  R. R. O'Brien,et al.  Dynamics of Charge Collection from Alpha-Particle Tracks in Integrated Circuits , 1981, 19th International Reliability Physics Symposium.

[11]  J. A. Clark,et al.  REACT: a synthesis and evaluation tool for fault-tolerant multiprocessor architectures , 1993, Annual Reliability and Maintainability Symposium 1993 Proceedings.