论文信息 - An investigation of the effect of module size on defect prediction using static measures

An investigation of the effect of module size on defect prediction using static measures

We used several machine learning algorithms to predict the d ef ctive modules in five NASA products, namely, CM1, JM1, KC1, KC2, and PC1. A set of static measures were used as predictor variables. While doing so, we observed that a large porti on of the modules were small, as measured by lines of code (LOC). When we experimented on the data subsets created by partitio ning according to module size, we obtained higher prediction per formance for the subsets that include larger modules. We also pe rformed defect prediction using class-level data for KC1 rat her han method-level data. In this case, the use of class-level data resulted in improved prediction performance compared to using metho dlevel data. These findings suggest that quality assurance ac tivities can be guided even better if defect predictions are made by us ing data that belong to larger modules.

Hongfang Liu | Akif Günes Koru

[1] Tim Menzies,et al. Mining Repositories to Assist in Project Planning and Resource Allocation , 2004, MSR.

[2] John C. Munson,et al. The effects of fault counting methods on fault model quality , 2004, Proceedings of the 28th Annual International Computer Software and Applications Conference, 2004. COMPSAC 2004..

[3] Sallie M. Henry,et al. Software Structure Metrics Based on Information Flow , 1981, IEEE Transactions on Software Engineering.

[4] Khaled El Emam,et al. The Confounding Effect of Class Size on the Validity of Object-Oriented Metrics , 2001, IEEE Trans. Software Eng..

[5] Taghi M. Khoshgoftaar,et al. Early Quality Prediction: A Case Study in Telecommunications , 1996, IEEE Softw..

[6] Leo Breiman,et al. Classification and Regression Trees , 1984 .

[7] Martin Shepperd,et al. Derivation and Validation of Software Metrics , 1993 .

[8] Abhijit S. Pandya,et al. Application of neural networks for predicting program faults , 1995, Ann. Softw. Eng..

[9] Jeff Tian,et al. Experience with identifying and characterizing problem-prone modules in telecommunication software systems , 2001, J. Syst. Softw..