Modeling of common-mode failures in digital embedded systems

This paper demonstrates how to accurately model the effects of common mode failures for digital embedded systems. By modeling the system's information flow, the integrated nature of the software and hardware components contained within such a system is represented. This modeling scheme allows for the system to be partitioned into error containment regions (ECRs), which are an extension of the fault containment region (FCR) concept. These ECRs are defined such that an error at their boundary results in system failure. If two or more ECRs produce errors at their boundaries and the underlying cause of these errors is identical, then the identification of common mode failures is achieved.

[1]  George Apostolakis,et al.  Demonstration of the Dynamic Flowgraph Methodology using the Titan II Space Launch Vehicle Digital Flight Control System , 1993 .

[2]  Salvatore J. Bavuso,et al.  Dynamic fault-tree models for fault-tolerant computer systems , 1992 .

[3]  E. W. Hagen,et al.  Common-mode/common-cause failure: A review , 1980 .

[4]  G. Romanski,et al.  Dynamic modeling and verification of safe-set architectures , 1996, Wescon/96.

[5]  Chris Garrett,et al.  Assessing the Dependability of Embedded Software Systems Using the Dynamic Flowgraph Methodology , 1995 .

[6]  J. H. Lala,et al.  Architectural principles for safety-critical real-time applications , 1994, Proc. IEEE.

[7]  M. Modarres What every engineer should know about reliability and risk analysis , 1992 .

[8]  Sergio B. Guarro,et al.  The use of prime implicants in dependability analysis of software controlled systems , 1998 .

[9]  Sergio Guarro,et al.  The logic flowgraph: A new approach to process failure modeling and diagnosis for disturbance analysis applications , 1984 .

[10]  Barry W. Johnson Design & analysis of fault tolerant digital systems , 1988 .

[11]  Michael Yau,et al.  Development of tools for safety analysis of control software in advanced reactors , 1996 .

[12]  Edward J. McCluskey,et al.  Common-mode failures in redundant VLSI systems: a survey , 2000, IEEE Trans. Reliab..

[13]  Jeffrey M. Voas,et al.  Reducing uncertainty about common-mode failures , 1997, Proceedings The Eighth International Symposium on Software Reliability Engineering.

[14]  G. Apostolakis,et al.  The Use of the Dynamic Flowgraph Methodology in Modeling Human Performance and Team Effects , 1996 .

[15]  George E. Apostolakis,et al.  The dynamic flowgraph methodology for assessing the dependability of embedded software systems , 1995, IEEE Trans. Syst. Man Cybern..

[16]  W.C. Gangloff,et al.  Common mode failure analysis , 1975, IEEE Transactions on Power Apparatus and Systems.

[17]  B. Dhillion,et al.  Reliability engineering in systems design and operation , 1983 .