Recovery Block Reliability Analysis with Failure Clustering

New discrete- and continuous-time models of the recovery block are presented using Markov chain techniques. The time to failure distributions are fully described by their probability generating functions and Laplace transforms respectively. A discrete-time model is developed accounting for the commonly observed clustering of failure points in the input space. The Markov framework is a useful tool to solve this model even though the system’s behaviour is non-Markovian: additional states are artificially introduced to model the degraded state of individual modules.

[1]  Jean-Claude Laprie,et al.  Dependability Evaluation of Software Systems in Operation , 1984, IEEE Transactions on Software Engineering.

[2]  M. S. Bartlett,et al.  An introduction to stochastic processes, with special reference to methods and applications , 1955 .

[3]  William Feller,et al.  An Introduction to Probability Theory and Its Applications , 1967 .

[4]  Sung D. Cha A Recovery Block Model and Its Analysis , 1986 .

[5]  John G. Kemeny,et al.  Finite Markov chains , 1960 .

[6]  Bev Littlewood,et al.  A Reliability Model for Systems with Markov Structure , 1975 .

[7]  Kishor S. Trivedi,et al.  Ultrahigh Reliability Prediction for Fault-Tolerant Computer Systems , 1983, IEEE Transactions on Computers.

[8]  Brian Randell System structure for software fault tolerance , 1975 .

[9]  Hermann Kopetz,et al.  Fault tolerance, principles and practice , 1990 .

[10]  B. Bhat,et al.  SOME PROPERTIES OF REGULAR MARKOV CHAINS , 1961 .

[11]  C. L. Chiang,et al.  Introduction to stochastic processes in biostatistics. , 1968 .

[12]  Kyle Siegrist,et al.  Reliability of Systems with Markov Transfer of Control , 1988, IEEE Trans. Software Eng..

[13]  Leonard Kleinrock,et al.  Queueing Systems: Volume I-Theory , 1975 .

[14]  Kishor S. Trivedi Probability and Statistics with Reliability, Queuing, and Computer Science Applications , 1984 .

[15]  William Feller,et al.  An Introduction to Probability Theory and Its Applications , 1951 .

[16]  Paul Ammann,et al.  Data Diversity: An Approach to Software Fault Tolerance , 1988, IEEE Trans. Computers.