The Byzantine hardware fault model

A new fault model for temporary failures is presented. This model is motivated and supported by recent experimental studies on types of temporary failures which cannot be explained by existing models. This new fault is called a Byzantine fault by analogy with the well-known Byzantine Generals problem in distributed systems. An example of an important type of Byzantine fault called a short transient is analyzed. The effects of Byzantine faults on concurrent error checking circuits are discussed. Design techniques to eliminate the effects of Byzantine faults are presented. >

[1]  Zvonko G. Vranesic,et al.  On Fault Detection in CMOS Logic Networks , 1983, 20th Design Automation Conference Proceedings.

[2]  Edward J. McCluskey,et al.  An Experiment on Intermittent-Failure Mechanisms , 1986, ITC.

[3]  Carl V. Page,et al.  Intermittent Faults: A Model and a Detection Procedure , 1974, IEEE Transactions on Computers.

[4]  Y. Savaria,et al.  Soft-error filtering: A solution to the reliability problem of future VLSI digital circuits , 1986, Proceedings of the IEEE.

[5]  Melvin A. Breuer,et al.  Testing for Intermittent Faults in Digital Circuits , 1973, IEEE Transactions on Computers.

[6]  John M. Acken Testing for Bridging Faults (Shorts) in CMOS Circuits , 1983, 20th Design Automation Conference Proceedings.

[7]  M. Y. Hsiao,et al.  Model for Transient and Permanent Error-Detection and Fault-Isolation Coverage , 1982, IBM J. Res. Dev..

[8]  Pramod K. Varshney,et al.  On Analytical Modeling of Intermittent Faults in Digital Systems , 1979, IEEE Transactions on Computers.

[9]  Probability of error in combinational logic systems containing soft fails , 1983 .

[10]  W. W. Peterson,et al.  Error-Correcting Codes. , 1962 .

[11]  Leonard R. Marino,et al.  General theory of metastable operation , 1981, IEEE Transactions on Computers.

[12]  M. Ball,et al.  Effects and detection of intermittent failures in digital systems , 1969, AFIPS '69 (Fall).

[13]  Charles F. Hawkins,et al.  Electrical Characteristics and Testing Considerations for Gate Oxide Shorts in CMOS ICs , 1985, ITC.

[14]  James E. Smith,et al.  Strongly Fault Secure Logic Networks , 1978, IEEE Transactions on Computers.

[15]  Gernot Metze,et al.  Design of Totally Self-Checking Check Circuits for m-Out-of-n Codes , 1973, IEEE Transactions on Computers.

[16]  W. C. Carter Hardware fault tolerance , 1986 .

[17]  Samir Kamal,et al.  An Approach to the Diagnosis of Intermittent Faults , 1975, IEEE Transactions on Computers.

[18]  F. Joel Ferguson Book Review: Logic Design Principles by Edward J. McCluskey: Prentice-Hall Publishers, Englewood Cliffs, New Jersey, 549 pp., $39.95 , 1988, CARN.

[19]  Israel Koren,et al.  A Continuous-Parameter Markov Model and Detection Procedures for Intermittent Faults , 1978, IEEE Transactions on Computers.

[20]  Leslie Lamport,et al.  The Byzantine Generals Problem , 1982, TOPL.