Fast Simulation of Highly Dependable Systems with General Failure and Repair Processes

An approach for simulating models of highly dependable systems with general failure and repair time distribution is described. The approach combines importance sampling with event rescheduling in order to obtain variance reductions in such rare event simulations. The approach is general in nature and allows a variety of features commonly arising in dependability modeling to be simulated effectively. It is shown how the technique can be applied to systems with redundant components and/or periodic maintenance. For different failure time distributions, the effect of the maintenance period on the steady-state availability is explored. The amount of component redundancy needed to achieve a certain reliability level is determined. >

[1]  P. Glynn A GSMP formalism for discrete event systems , 1989, Proc. IEEE.

[2]  Philip Heidelberger,et al.  Measure specific dynamic importance sampling for availability simulations , 1987, WSC '87.

[3]  Philip Heidelberger,et al.  Fast simulation of dependability models with general failure, repair and maintenance processes , 1990, [1990] Digest of Papers. Fault-Tolerant Computing: 20th International Symposium.

[4]  Kishor S. Trivedi,et al.  The hybrid automated reliability predictor , 1986 .

[5]  Kishor S. Trivedi,et al.  Ultrahigh Reliability Prediction for Fault-Tolerant Computer Systems , 1983, IEEE Transactions on Computers.

[6]  P. Haas,et al.  Regenerative generalized semi-markov processes , 1987 .

[7]  Stephen S. Lavenberg,et al.  Modeling and Analysis of Computer System Availability , 1987, Computer Performance and Reliability.

[8]  Jacob A. Abraham,et al.  A Numerical Technique for the Hierarchical Evaluation of Large, Closed Fault-Tolerant Systems , 1992 .

[9]  Erhan Çinlar,et al.  Introduction to stochastic processes , 1974 .

[10]  Jean-Claude Laprie,et al.  The KAT (Knowledge-Action-Transformation) Approach to the Modeling and Evaluation of Reliability and Availability Growth , 1991, IEEE Trans. Software Eng..

[11]  L. Devroye Non-Uniform Random Variate Generation , 1986 .

[12]  Richard R. Muntz,et al.  Bounding Availability of Repairable Computer Systems , 1989, IEEE Trans. Computers.

[13]  Herb Schwetman,et al.  Using CSIM to model complex systems , 1988, 1988 Winter Simulation Conference Proceedings.

[14]  Philip Heidelberger,et al.  A Unified Framework for Simulating Markovian Models of Highly Dependable Systems , 1992, IEEE Trans. Computers.

[15]  Donald L. Iglehart,et al.  Importance sampling for stochastic simulations , 1989 .

[16]  Jean Walrand,et al.  A quick simulation method for excessive backlogs in networks of queues , 1989 .

[17]  Robert Geist,et al.  Ultrahigh reliability estimates through simulation , 1989, Proceedings., Annual Reliability and Maintainability Symposium.

[18]  Kishor S. Trivedi,et al.  A unified performance reliability analysis of a system with a cumulative down time constraint , 1992 .

[19]  Boudewijn R. Haverkort,et al.  Fault Injection Simulation: A Variance Reduction Technique for Systems with Rare Events , 1992 .

[20]  Elmer E Lewis,et al.  Monte Carlo simulation of Markov unreliability models , 1984 .

[21]  P. Glynn,et al.  Varaince reduction in mean time to failure simulations , 1988, 1988 Winter Simulation Conference Proceedings.

[22]  Peter W. Glynn,et al.  Simulation and analysis of highly reliable systems , 1990 .

[23]  M. Nakayama Asymptotics of likelihood ratio derivative estimators in simulations of highly reliable Markovian systems , 1995 .