Reliability models for computer systems: An overview including dataflow graphs

The reliability of a system is the probability that the system will perform its intended mission under given conditions. This paper provides an overview of the approaches to reliability modelling and identifies their strengths and weaknesses. The models discussed include structure models, simple stochastic models and decomposable stochastic models. Ignoring time-dependence, structure models give reliability as a function of the topological structure of the system. Simple stochastic models make direct use of the properties of underlying stochastic processes, while decomposable models consider more complex systems and analyse them through subsystems. Petri nets and dataflow graphs facilitate the analysis of complex systems by providing a convenient framework for reliability analysis.

[1]  William Feller,et al.  An Introduction to Probability Theory and Its Applications , 1967 .

[2]  Avinash Agrawal,et al.  A Survey of Network Reliability and Domination Theory , 1984, Oper. Res..

[3]  R. Karp,et al.  Properties of a model for parallel computations: determinacy , 1966 .

[4]  U. Montanari,et al.  A Boolean algebra method for computing the terminal reliability in a communication network , 1973 .

[5]  Andrew W. Shogan Sequential Bounding of the Reliability of a Stochastic Network , 1976, Oper. Res..

[6]  A. Satyanarayana,et al.  New Topological Formula and Rapid Algorithm for Reliability Analysis of Complex Networks , 1978 .

[7]  J. Abraham An Improved Algorithm for Network Reliability , 1979, IEEE Transactions on Reliability.

[8]  Krishna M. Kavi,et al.  Reliability Analysis of Computer Systems Using Dataflow Graph Models , 1986, IEEE Transactions on Reliability.

[9]  Jan Magott,et al.  Performance Evaluation of Concurrent Systems Using Petri Nets , 1984, Inf. Process. Lett..

[10]  K.B. Misra,et al.  A Fast Algorithm for Reliability Evaluation , 1975, IEEE Transactions on Reliability.

[11]  K. Mani Chandy,et al.  Approximate Analysis of General Queuing Networks , 1975, IBM J. Res. Dev..

[12]  N. L. Lawrie,et al.  Mathematical Computer Performance and Reliability , 1984 .

[13]  Way Kuo,et al.  System Effectiveness Models: An Annotated Bibliography , 1980, IEEE Transactions on Reliability.

[14]  Joseph Sifakis Performance Evaluation of Systems Using Nets , 1979, Advanced Course: Net Theory and Applications.

[15]  Krishna M. Kavi,et al.  A Formal Definition of Data Flow Graph Models , 1986, IEEE Transactions on Computers.

[16]  Peter Radford,et al.  Petri Net Theory and the Modeling of Systems , 1982 .

[17]  S.H. Lee,et al.  Reliability Evaluation of a Flow Network , 1980, IEEE Transactions on Reliability.

[18]  Mark K. Chang,et al.  Network reliability and the factoring theorem , 1983, Networks.

[19]  Raymond E. Miller,et al.  A Comparison of Some Theoretical Models of Parallel Computation , 1973, IEEE Transactions on Computers.

[20]  A. Rosenthal Computing the Reliability of Complex Networks , 1977 .

[21]  Ashok Kumar,et al.  A Review of Standby Redundant Systems , 1980, IEEE Transactions on Reliability.

[22]  C. V. Ramamoorthy,et al.  Performance Evaluation of Asynchronous Concurrent Systems Using Petri Nets , 1980, IEEE Transactions on Software Engineering.

[23]  Terry Williams,et al.  Probability and Statistics with Reliability, Queueing and Computer Science Applications , 1983 .

[24]  Nick Roussopoulos,et al.  Timing Requirements for Time-Driven Systems Using Augmented Petri Nets , 1983, IEEE Transactions on Software Engineering.

[25]  Marco Ajmone Marsan,et al.  A class of generalized stochastic Petri nets for the performance evaluation of multiprocessor systems , 1984, TOCS.

[26]  Kishor S. Trivedi,et al.  Ultrahigh Reliability Prediction for Fault-Tolerant Computer Systems , 1983, IEEE Transactions on Computers.

[27]  K. Mani Chandy,et al.  Approximate Methods for Analyzing Queueing Network Models of Computing Systems , 1978, CSUR.

[28]  Michael K. Molloy Performance Analysis Using Stochastic Petri Nets , 1982, IEEE Transactions on Computers.

[29]  A. Satyanarayana,et al.  A New Algorithm for the Reliability Analysis of Multi-Terminal Networks , 1981, IEEE Transactions on Reliability.

[30]  Ching-Lai Hwang,et al.  Availability of Maintained Systems: A State-of-the-Art Survey , 1977 .

[31]  A. Satyanarayana,et al.  A Unified Formula for Analysis of Some Network Reliability Problems , 1982, IEEE Transactions on Reliability.

[32]  Toshio Nakagawa,et al.  Bibliography for Reliability and Availability of Stochastic Systems , 1976 .

[33]  Kishor S. Trivedi Probability and Statistics with Reliability, Queuing, and Computer Science Applications , 1984 .

[34]  William Feller,et al.  An Introduction to Probability Theory and Its Applications , 1951 .

[35]  Michael K. Molloy Discrete Time Stochastic Petri Nets , 1985, IEEE Transactions on Software Engineering.

[36]  Richard E. Barlow,et al.  Statistical Theory of Reliability and Life Testing: Probability Models , 1976 .

[37]  Arnold Kaufmann,et al.  Mathematical Models for the Study of the Reliability of Systems , 1977 .

[38]  Kishor S. Trivedi,et al.  Decomposition in Reliability Analysis of Fault-Tolerant Systems , 1983, IEEE Transactions on Reliability.

[39]  Bernard M. Levin,et al.  Statistical theory of reliability , 1963 .

[40]  Michael K. Molloy,et al.  On the integration of delay and throughput measures in distributed processing models , 1981 .

[41]  Jean-Loup Baer,et al.  Computer systems architecture , 1980 .