Estimation of the Total Number of Software Failures from Test Data and Code Coverage: A Bayesian Approach

Total number of failures of a software system can help practitioners to have a better understanding of the software quality. In this paper, we propose a model to predict the total number of software failures in a software system by analyzing the failure data from testing using models based on Zipf's law together with the information on code coverage. Failure data and code coverage are combined in a Bayesian way. The methodology is applied to real world failure data to validate its predictability. The predictive accuracy of our model is also evaluated with different methods. The results of our experiment show that our proposed model can provide a very good estimation of the total number of failures. The estimation is stable from a very early point on.

[1]  Harald A. Stieber,et al.  A Family of Software Reliability Growth Models , 2007, 31st Annual International Computer Software and Applications Conference (COMPSAC 2007).

[2]  Harald A. Stieber Statistical quality control: How to detect unreliable software components , 1999 .

[3]  Haifeng Li,et al.  Software reliability modeling based on test coverage , 2011, The Proceedings of 2011 9th International Conference on Reliability, Maintainability and Safety.

[4]  James M. Bieman,et al.  Software reliability growth with test coverage , 2002, IEEE Trans. Reliab..

[5]  J. Bert Keats,et al.  Statistical Methods for Reliability Data , 1999 .

[6]  John D. Musa,et al.  Software reliability - measurement, prediction, application , 1987, McGraw-Hill series in software engineering and technology.

[7]  Harald A. Stieber,et al.  Estimating the Total Number of Software Faults Reliability Models and Mutation Testing a Bayesian Approach , 2015, 2015 IEEE 39th Annual Computer Software and Applications Conference.

[8]  W. Nelson Statistical Methods for Reliability Data , 1998 .