Software Fault Tolerance

This chapter presents a non-homogeneous Poisson progress reliability model for N-version programming systems. We separate all faults within NVP systems into independent faults and common faults, and model each type of failure as NHPP. We further develop a reliability model for common failures in NVP systems and also present a model for concurrent independent failures in NVP systems. By combining the CF model and the CIF model together, we establish an NHPP reliability model for NVP systems. We also give an example to illustrate how to estimate all unknown parameters by using the maximum likelihood estimation method, and how to compute the variances for all parameter estimates in order to obtain the confidence intervals of NVP system reliability prediction.

[1]  Liming Chen,et al.  N-VERSION PROGRAMMINC: A FAULT-TOLERANCE APPROACH TO RELlABlLlTY OF SOFTWARE OPERATlON , 1995, Twenty-Fifth International Symposium on Fault-Tolerant Computing, 1995, ' Highlights from Twenty-Five Years'..

[2]  Richard E. Fairley,et al.  Software engineering concepts , 1985, McGraw-Hill series in software engineering and technology.

[3]  Xuemei Zhang,et al.  An NHPP Software Reliability Model and Its Comparison , 1997 .

[4]  Michael R. Lyu,et al.  System reliability analysis of an N-version programming application , 1994 .

[5]  Brian Randell System structure for software fault tolerance , 1975 .

[6]  Hoang Pham,et al.  A general imperfect-software-debugging model with S-shaped fault-detection rate , 1999 .

[7]  David F. McAllister,et al.  An Experimental Evaluation of Software Redundancy as a Strategy For Improving Reliability , 1991, IEEE Trans. Software Eng..

[8]  Jeffrey M. Voas,et al.  Reducing uncertainty about common-mode failures , 1997, Proceedings The Eighth International Symposium on Software Reliability Engineering.

[9]  Hoang Pham,et al.  Software reliability models for critical applications , 1991 .

[10]  David F. McAllister,et al.  Fault-Tolerant SoFtware Reliability Modeling , 1987, IEEE Transactions on Software Engineering.

[11]  Lui Sha,et al.  Using Simplicity to Control Complexity , 2001, IEEE Softw..

[12]  Jeffrey M. Voas Fault Tolerance , 2001, IEEE Softw..

[13]  Victor F. Nicola,et al.  Modeling of Correlated Failures and Community Error Recovery in Multiversion Software , 1990, IEEE Trans. Software Eng..

[14]  Hsin-Hui Lin,et al.  Nonhomogeneous Poisson process software-debugging models with linear dependence , 1993 .

[15]  Jean Arlat,et al.  Definition and analysis of hardware- and software-fault-tolerant architectures , 1990, Computer.

[16]  Bev Littlewood,et al.  Conceptual Modeling of Coincident Failures in Multiversion Software , 1989, IEEE Trans. Software Eng..

[17]  Dave E. Eckhardt,et al.  A Theoretical Basis for the Analysis of Multiversion Software Subject to Coincident Errors , 1985, IEEE Transactions on Software Engineering.

[18]  Piotr Jędrzejowicz,et al.  Fault-tolerant programs and their reliability , 1990 .

[19]  Nancy G. Leveson,et al.  An experimental evaluation of the assumption of independence in multiversion programming , 1986, IEEE Transactions on Software Engineering.

[20]  Yiu-Wing Leung,et al.  Maximum likelihood voting for fault-tolerant software with finite output-space , 1995 .

[21]  Hoang Pham Software Reliability , 1999 .

[22]  Jean Arlat,et al.  Reliability growth of fault-tolerant software , 1993 .

[23]  Katerina Goseva-Popstojanova,et al.  Performability modeling of N version programming technique , 1995, Proceedings of Sixth International Symposium on Software Reliability Engineering. ISSRE'95.

[24]  David F. McAllister,et al.  Reliability of voting in fault-tolerant software systems for small output-spaces , 1990 .

[25]  F. Guess,et al.  Exact maximum likelihood estimation using masked system data , 1993 .

[26]  Ann T. Tai,et al.  Performability enhancement of fault-tolerant software , 1993 .

[27]  Xiaolin Teng,et al.  A software-reliability growth model for N-version programming systems , 2002, IEEE Trans. Reliab..

[28]  K Okumoto,et al.  TIME-DEPENDENT ERROR-DETECTION RATE MODEL FOR SOFTWARE AND OTHER PERFORMANCE MEASURES , 1979 .

[29]  Michael R. Lyu,et al.  Improving the N-version programming process through the evolution of a design paradigm , 1993 .