论文信息 - Estimators for fault tolerance coverage evaluation

Estimators for fault tolerance coverage evaluation

The problem of estimating the coverage of a fault tolerance mechanism through statistical processing of observations collected in fault injection experiments is addressed. A formal definition of coverage is given in terms of the fault and activation sets that characterize the input space. Two categories of sampling techniques are considered for coverage estimation: sampling in the whole space and sampling in a space partitioned into classes. The estimators for each technique are compared by means of hypothetical examples. Techniques for early estimations of coverage are then studied. These techniques allow unbiased estimations of coverage to be made before all classes of the sampling space have been tested. Finally, the "no-reply" problem that hampers most practical fault-injection experiments is discussed and an a posteriori stratification technique is proposed that allows the scope of incomplete tests to be widened by accounting for available structural information about the target system.

[1] Thomas F. Arnold,et al. The Concept of Coverage and Its Effect on the Reliability Model of a Repairable System , 1973, IEEE Transactions on Computers.

[2] G. M. Jenkins,et al. Airborne Advanced Reconfigurable Computer System (ARCS) , 1976 .

[3] Jean Arlat,et al. Fault Injection and Dependability Evaluation of Fault-Tolerant Systems , 1993, IEEE Trans. Computers.

[4] Robert S. Swarz,et al. Reliable Computer Systems: Design and Evaluation , 1992 .

[5] Jean Arlat,et al. Experimental evaluation of the fault tolerance of an atomic multicast system , 1990 .

[6] N. L. Johnson,et al. Distributions in Statistics: Discrete Distributions. , 1970 .

[7] Daniel P. Siewiorek,et al. Development of a benchmark to measure system robustness , 1993, FTCS-23 The Twenty-Third International Symposium on Fault-Tolerant Computing.

[8] Siegfried Gabler,et al. Theory and Practice of Sample Surveys , 1994 .

[9] G. W. Snedecor. Statistical Methods , 1964 .

[10] D. A. Rennels,et al. Fault-tolerance experiments with the JPL STAR computer. , 1972 .

[11] Jean Arlat,et al. Fault Injection for Dependability Validation: A Methodology and Some Applications , 1990, IEEE Trans. Software Eng..

[12] Kishor S. Trivedi,et al. Coverage Modeling for Dependability Analysis of Fault-Tolerant Systems , 1989, IEEE Trans. Computers.

[13] W. C. Carter,et al. Reliability modeling techniques for self-repairing computer systems , 1969, ACM '69.