论文信息 - Iterative identification of fault-prone binaries using in-process metrics

Iterative identification of fault-prone binaries using in-process metrics

Code churn, the amount of code change taking place within a software unit over time, has been correlated with fault-proneness in software systems. We investigate the use of code churn and static metrics collected at regular time intervals during the development cycle to predict faults in an iterative, in-process manner. We collected 159 churn and structure metrics from six, four-month snapshots of a 1 million LOC Microsoft product. The number of software faults fixed during each period is recorded per binary module. Using stepwise logistic regression, we create a prediction model to identify fault-prone binaries using three parameters: code churn (the number of new and changed blocks); class Fan In and class Fan Out (normalized by lines of code). The iteratively-built model is 80.0% accurate at predicting fault-prone and non-fault-prone binaries. These fault-prediction models have the advantage of allowing the engineers to observe how their fault-prediction profile evolves over time.

[1] Ping Zhang,et al. Predictors of customer perceived software quality , 2005, Proceedings. 27th International Conference on Software Engineering, 2005. ICSE 2005..

[2] Khaled El Emam,et al. The Confounding Effect of Class Size on the Validity of Object-Oriented Metrics , 2001, IEEE Trans. Software Eng..

[3] Alfred V. Aho,et al. Compilers: Principles, Techniques, and Tools , 1986, Addison-Wesley series in computer science / World student series edition.

[4] Audris Mockus,et al. Drivers for Customer Perceived Software Quality , 2005, ICSE 2005.

[5] A. von Mayrhauser,et al. Code decay analysis of legacy software through successive releases , 1999, 1999 IEEE Aerospace Conference. Proceedings (Cat. No.99TH8403).

[6] Mladen A. Vouk,et al. Some issues in multi-phase software reliability modeling , 1993, CASCON.

[7] Chris F. Kemerer,et al. A Metrics Suite for Object Oriented Design , 2015, IEEE Trans. Software Eng..

[8] Brendan Murphy,et al. Using Historical In-Process and Product Metrics for Early Estimation of Software Failures , 2006, 2006 17th International Symposium on Software Reliability Engineering.

[9] Victor R. Basili,et al. A Validation of Object-Oriented Design Metrics as Quality Indicators , 1996, IEEE Trans. Software Eng..

[10] Elaine J. Weyuker,et al. Where the bugs are , 2004, ISSTA '04.

[11] Ramanath Subramanyam,et al. Empirical Analysis of CK Metrics for Object-Oriented Design Complexity: Implications for Software Defects , 2003, IEEE Trans. Software Eng..

[12] Mei-Hwa Chen,et al. An empirical study on object-oriented metrics , 1999, Proceedings Sixth International Software Metrics Symposium (Cat. No.PR00403).

[13] N. Nagappan,et al. Use of relative code churn measures to predict system defect density , 2005, Proceedings. 27th International Conference on Software Engineering, 2005. ICSE 2005..

[14] Lionel C. Briand,et al. Investigating quality factors in object-oriented designs: an industrial case study , 1999, Proceedings of the 1999 International Conference on Software Engineering (IEEE Cat. No.99CB37002).

[15] J. E. Jackson. A User's Guide to Principal Components , 1991 .

[16] J. Edward Jackson,et al. A User's Guide to Principal Components. , 1991 .

[17] Harvey P. Siy,et al. Predicting Fault Incidence Using Software Change History , 2000, IEEE Trans. Software Eng..