Evaluation of capture-recapture models for estimating the abundance of naturally-occurring defects

Project managers can use capture-recapture models to manage the inspection process by estimating the number of defects present in an artifact and determining whether a reinspection is necessary. Researchers have previously evaluated capture-recapture models on artifacts with a known number of defects. Before applying capture-recapture models in real development, an evaluation of those models on naturally-occurring defects is imperative. The data in this study is drawn from two inspections of real requirements documents (that later guided implementation) created as part of a capstone course (i.e. with naturally occurring defects). The major results show that: a) estimators improve from being negatively biased after one inspection to being positively biased after two inspections, b) the results contradict the earlier result that a model that includes two sources of variation is a significant improvement over models with one source of variation, and c) estimates are useful in determining the need for artifact reinspection.

[1]  Anne Chao,et al.  Estimating Animal Abundance with Capture Frequency Data , 1988 .

[2]  Claes Wohlin,et al.  Capture-recapture in software inspections after 10 years research--theory, evaluation and application , 2004, J. Syst. Softw..

[3]  Jeffrey C. Carver,et al.  Requirement error abstraction and classification: an empirical study , 2006, ISESE '06.

[4]  Paul S. F. Yip,et al.  A Martingale Estimating Equation for a Capture-Recapture Experiment in Discrete Time , 1991 .

[5]  Lionel C. Briand,et al.  A Comprehensive Evaluation of Capture-Recapture Models for Estimating Software Defect Content , 2000, IEEE Trans. Software Eng..

[6]  Khaled El Emam,et al.  The application of subjective estimates of effectiveness to controlling software inspections , 2000, J. Syst. Softw..

[7]  Per Runeson,et al.  Confidence intervals for capture-recapture estimations in software inspections , 2002, Inf. Softw. Technol..

[8]  A Chao,et al.  Estimating population size via sample coverage for closed capture-recapture models. , 1994, Biometrics.

[9]  Claes Wohlin,et al.  An experimental evaluation of capture‐recapture in software inspections , 1995, Softw. Test. Verification Reliab..

[10]  David R. Anderson,et al.  Capture-Recapture and Removal Methods for Sampling Closed Populations , 1983 .

[11]  David R. Anderson,et al.  Statistical inference from capture data on closed animal populations , 1980 .

[12]  Khaled El Emam,et al.  Evaluating Capture-Recapture Models with Two Inspectors , 2001, IEEE Trans. Software Eng..

[13]  A. Frank Ackerman,et al.  Software inspections: an effective verification process , 1989, IEEE Software.

[14]  Claes Wohlin,et al.  An Experimental Evaluation of an Experience-Based Capture-Recapture Method in Software Code Inspections , 1998, Empirical Software Engineering.

[15]  Jeffrey C. Carver,et al.  The effect of the number of inspectors on the defect estimates produced by capture-recapture models , 2008, 2008 ACM/IEEE 30th International Conference on Software Engineering.

[16]  A. Chao Estimating the population size for capture-recapture data with unequal catchability. , 1987, Biometrics.

[17]  Claes Wohlin,et al.  Defect content estimations from review data , 1998, Proceedings of the 20th International Conference on Software Engineering.

[18]  K. Burnham,et al.  Estimation of the size of a closed population when capture probabilities vary among animals , 1978 .

[19]  J. Darroch THE MULTIPLE-RECAPTURE CENSUS I. ESTIMATION OF A CLOSED POPULATION , 1958 .

[20]  Stephen G. Eick,et al.  Estimating software fault content before coding , 1992, International Conference on Software Engineering.

[21]  Lawrence G. Votta,et al.  Assessing Software Designs Using Capture-Recapture Methods , 1993, IEEE Trans. Software Eng..

[22]  David R. Anderson,et al.  Capture-Recapture and Removal Methods for Sampling Closed Populations , 1983 .

[23]  David R. Anderson,et al.  Capture-Recapture and Removal Methods for Sampling Closed Populations. , 1984 .

[24]  John D. Musa,et al.  Software reliability - measurement, prediction, application , 1987, McGraw-Hill series in software engineering and technology.

[25]  James Miller,et al.  Estimating the number of remaining defects after inspection , 1999, Softw. Test. Verification Reliab..