Better Reliability Assessment and Prediction through Data Clustering

This paper presents a new approach to software reliability modeling by grouping data into clusters of homogeneous failure intensities. This series of data clusters associated with different time segments can be directly used as a piecewise linear model for reliability assessment and problem identification, which can produce meaningful results early in the testing process. The dual model fits traditional software reliability growth models (SRGMs) to these grouped data to provide long-term reliability assessments and predictions. These models were evaluated in the testing of two large software systems from IBM. Compared with existing SRGMs fitted to raw data, our models are generally more stable over time and produce more consistent and accurate reliability assessments and predictions.

[1]  Jeff Tian,et al.  Test workload measurement and reliability analysis for large commercial software systems , 1997, Ann. Softw. Eng..

[2]  Peng Lu,et al.  Test-Execution-Based Reliability Measurement and Modeling for Large Commercial Software , 1995, IEEE Trans. Software Eng..

[3]  Eldred Nelson,et al.  Estimating software reliability from test data , 1978 .

[4]  Calvin L. Williams,et al.  Modern Applied Statistics with S-Plus , 1997 .

[5]  Jean-Claude Laprie,et al.  Trend Analysis , 1996, The SAGE Encyclopedia of Research Design.

[6]  Michael R. Lyu,et al.  Handbook of software reliability engineering , 1996 .

[7]  Kai-Yuan Cai,et al.  Censored software-reliability models , 1997 .

[8]  Z. Jelinski,et al.  Software reliability Research , 1972, Statistical Computer Performance Evaluation.

[9]  Daryl Pregibon,et al.  Tree-based models , 1992 .

[10]  Thomas A. Mazzuchi,et al.  Investigating a specific class of software reliability growth models , 2002, Annual Reliability and Maintainability Symposium. 2002 Proceedings (Cat. No.02CH37318).

[11]  Norman F. Schneidewind,et al.  Analysis of error processes in computer software , 1975, Reliable Software.

[12]  John D. Musa,et al.  Software reliability - measurement, prediction, application , 1987, McGraw-Hill series in software engineering and technology.

[13]  Norman F. Schneidewind,et al.  Software Reliability Model with Optimal Selection of Failure Data , 1993, IEEE Trans. Software Eng..

[14]  Amrit L. Goel,et al.  Software Reliability Models: Assumptions, Limitations, and Applicability , 1985, IEEE Transactions on Software Engineering.

[15]  John D. Musa,et al.  Software reliability measurement , 1984, J. Syst. Softw..

[16]  K Okumoto,et al.  TIME-DEPENDENT ERROR-DETECTION RATE MODEL FOR SOFTWARE AND OTHER PERFORMANCE MEASURES , 1979 .

[17]  Amrit L. Goel,et al.  Time-Dependent Error-Detection Rate Model for Software Reliability and Other Performance Measures , 1979, IEEE Transactions on Reliability.

[18]  Katerina Goseva-Popstojanova,et al.  Failure correlation in software reliability models , 2000, IEEE Trans. Reliab..

[19]  Bev Littlewood,et al.  A Bayesian Reliability Growth Model for Computer Software , 1973 .

[20]  J. Davenport Editor , 1960 .

[21]  Nozer D. Singpurwalla Software reliability modeling by concatenating failure rates , 1998, Proceedings Ninth International Symposium on Software Reliability Engineering (Cat. No.98TB100257).

[22]  M. Lipow,et al.  Testing for software reliability , 1975, Reliable Software.

[23]  Shigeru Yamada,et al.  S-Shaped Reliability Growth Modeling for Software Error Detection , 1983, IEEE Transactions on Reliability.

[24]  Jeff Tian,et al.  Integrating Time Domain and Input Domain Analyses of Software Reliability Using Tree-Based Models , 1995, IEEE Trans. Software Eng..

[25]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.