Integrating selected fault masking and self-diagnosis mechanisms

Fault Detection and Isolation (FDI) using self-diagnosing elements is used in distributed systems to facilitate system monitoring and maintenance. Voting algorithms are used to increase reliability by masking faults. This research addresses the integration of self-diagnosing elements and voting algorithms in a Triple Modular Redundant (TMR) configuration. A self-diagnosing element model is presented and the problem of masking element faults is considered. A range of integrated (or hybrid) voters which handle diagnosis information are introduced. Experimental results show that the appropriate use of diagnosis in a fault masking system enables the voter to select more correct results than voting on element results values alone. Three categories of architecture are proposed which have ability to mask certain self-diagnosing element common error modes. This approach results in a modest improvement in the safety and reliability levels of the application. The hybrid voters proposed capture certain advantages of both fault masking and fault detection and isolation features.

[1]  Manus P. Henry,et al.  The self-validating sensor: rationale, definitions and examples , 1993 .

[2]  Rolf Isermann,et al.  Supervision, fault-detection and fault-diagnosis methods — An introduction , 1997 .

[3]  Peter J. Fleming,et al.  Implementation of a Smart Sensor Using Analytical Redundancy Techniques , 1997 .

[4]  Manus P. Henry,et al.  A STANDARD INTERFACE FOR SELF-VALIDATING SENSORS , 1992 .

[5]  J.L. Gersting,et al.  A comparison of voting algorithms for n-version programming , 1991, Proceedings of the Twenty-Fourth Annual Hawaii International Conference on System Sciences.

[6]  David Clarke,et al.  Local Sensor Validation , 1989 .

[7]  Richard W. Buskens,et al.  Distributed on-line diagnosis in the presence of arbitrary faults , 1993, FTCS-23 The Twenty-Third International Symposium on Fault-Tolerant Computing.

[8]  R. Ramaswami,et al.  Book Review: Design and Analysis of Fault-Tolerant Digital Systems , 1990 .

[9]  Stuart Bennett,et al.  Complete Disagreement in Redundant Real-Time Control Applications , 1998 .

[10]  Ron J. Patton,et al.  Fault-Tolerant Control: The 1997 Situation , 1997 .

[11]  David Clarke,et al.  Control using self-validating sensors , 1996 .

[12]  Hirokazu Ihara,et al.  Dependable onboard computer systems with a new method-stepwise negotiating voting , 1989, [1989] The Nineteenth International Symposium on Fault-Tolerant Computing. Digest of Papers.

[13]  Hermann Kopetz,et al.  Fault tolerance, principles and practice , 1990 .