论文信息 - Empirical analysis of network measures for effort-aware fault-proneness prediction

Empirical analysis of network measures for effort-aware fault-proneness prediction

ContextRecently, network measures have been proposed to predict fault-prone modules. Leveraging the dependency relationships between software entities, network measures describe the structural features of software systems. However, there is no consensus about their effectiveness for fault-proneness prediction. Specifically, the predictive ability of network measures in effort-aware context has not been addressed. ObjectiveWe aim to provide a comprehensive evaluation on the predictive effectiveness of network measures with the effort needed to inspect the code taken into consideration. MethodWe first constructed software source code networks of 11 open-source projects by extracting the data and call dependencies between modules. We then employed univariate logistic regression to investigate how each single network measure was correlated with fault-proneness. Finally, we built multivariate prediction models to examine the usefulness of network measures under three prediction settings: cross-validation, across-release, and inter-project predictions. In particular, we used the effort-aware performance indicators to compare their predictive ability against the commonly used code metrics in both ranking and classification scenarios. ResultsBased on the 11 open-source software systems, our results show that: (1) most network measures are significantly positively related to fault-proneness; (2) the performance of network measures varies under different prediction settings; (3) network measures have inconsistent effects on various projects. ConclusionNetwork measures are of practical value in the context of effort-aware fault-proneness prediction, but researchers and practitioners should be careful of choosing whether and when to use network measures in practice.

[1] Frank E. Harrell,et al. Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis , 2001 .

[2] Akito Monden,et al. Revisiting common bug prediction findings using effort-aware models , 2010, 2010 IEEE International Conference on Software Maintenance.

[3] Akif Günes Koru,et al. Comparing high-change modules and modules with the highest measurement values in two large-scale open-source products , 2005, IEEE Transactions on Software Engineering.

[4] R. Ledesma,et al. Cliff's Delta Calculator: A non-parametric effect size program for two groups of observations , 2010 .

[5] Nachiappan Nagappan,et al. Predicting defects with program dependencies , 2009, ESEM 2009.

[6] Jeffrey D. Kromrey,et al. Analysis Options for Testing Group Differences on Ordered Categorical Variables: An Empirical Investigation of Type I Error Control and Statistical Power , 1998 .

[7] Lech Madeyski,et al. Towards identifying software project clusters with regard to defect prediction , 2010, PROMISE '10.

[8] Ayse Basar Bener,et al. Validation of network measures as indicators of defective modules in software systems , 2009, PROMISE '09.

[9] Maurice H. Halstead,et al. Elements of software science , 1977 .

[10] C. Cassell,et al. Essential guide to qualitative methods in organizational research , 2004 .

[11] Sunghun Kim,et al. Bug Classification Using Program Slicing Metrics , 2006, 2006 Sixth IEEE International Workshop on Source Code Analysis and Manipulation.

[12] Subhabrata Chakraborti,et al. Nonparametric Statistical Inference , 2011, International Encyclopedia of Statistical Science.

[13] Zhaowei Shang,et al. Negative samples reduction in cross-company software defects prediction , 2015, Inf. Softw. Technol..

[14] Ayse Basar Bener,et al. Empirical evaluation of the effects of mixed project data on learning defect predictors , 2013, Inf. Softw. Technol..

[15] Y. Benjamini,et al. Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[16] Yuming Zhou,et al. Are Slice-Based Cohesion Metrics Actually Useful in Effort-Aware Post-Release Fault-Proneness Prediction? An Empirical Study , 2015, IEEE Transactions on Software Engineering.

[17] Rongxin Wu,et al. Dealing with noise in defect prediction , 2011, 2011 33rd International Conference on Software Engineering (ICSE).

[18] Einar J. Aas. Design Quality Metrics Based on an Event-Oriented Design Process Model: Theory and Example of Use in Electronic Design , 1996 .

[19] Daniel B. Mark,et al. TUTORIAL IN BIOSTATISTICS MULTIVARIABLE PROGNOSTIC MODELS: ISSUES IN DEVELOPING MODELS, EVALUATING ASSUMPTIONS AND ADEQUACY, AND MEASURING AND REDUCING ERRORS , 1996 .

[20] Rahul Premraj,et al. Network Versus Code Metrics to Predict Defects: A Replication Study , 2011, 2011 International Symposium on Empirical Software Engineering and Measurement.

[21] Jeffrey D. Kromrey,et al. Robust Confidence Intervals for Effect Sizes: A Comparative Study of Cohen's d and Cliff's Delta Under Non-normality and Heterogeneous Variances , 2004 .

[22] Tim Menzies,et al. Balancing Privacy and Utility in Cross-Company Defect Prediction , 2013, IEEE Transactions on Software Engineering.

[23] Tim Menzies,et al. Data Mining Static Code Attributes to Learn Defect Predictors , 2007 .

[24] Stefan Biffl,et al. A Framework for Defect Prediction in Specific Software Project Contexts , 2008, CEE-SET.

[25] Ye Yang,et al. An investigation on the feasibility of cross-project defect prediction , 2012, Automated Software Engineering.

[26] Chris F. Kemerer,et al. A Metrics Suite for Object Oriented Design , 2015, IEEE Trans. Software Eng..

[27] J. Neter,et al. Applied Linear Regression Models , 1983 .

[28] Anas N. Al-Rabadi,et al. A comparison of modified reconstructability analysis and Ashenhurst‐Curtis decomposition of Boolean functions , 2004 .

[29] Audris Mockus,et al. A large-scale empirical study of just-in-time quality assurance , 2013, IEEE Transactions on Software Engineering.

[30] Brian Henderson-Sellers,et al. Object-Oriented Metrics , 1995, TOOLS.

[31] Yuming Zhou,et al. Examining the Potentially Confounding Effect of Class Size on the Associations between Object-Oriented Metrics and Change-Proneness , 2009, IEEE Transactions on Software Engineering.

[32] Ahmed E. Hassan,et al. Studying the impact of dependency network measures on software quality , 2010, 2010 IEEE International Conference on Software Maintenance.

[33] Gretchen G. Moisen,et al. A comparison of the performance of threshold criteria for binary classification in terms of predicted prevalence and Kappa , 2008 .

[34] Laurie A. Williams,et al. Evaluating Complexity, Code Churn, and Developer Activity Metrics as Indicators of Software Vulnerabilities , 2011, IEEE Transactions on Software Engineering.

[35] Ayse Basar Bener,et al. Defect prediction from static code features: current results, limitations, new approaches , 2010, Automated Software Engineering.

[36] May,et al. [Wiley Series in Probability and Statistics] Applied Survival Analysis (Regression Modeling of Time-to-Event Data) || Extensions of the Proportional Hazards Model , 2008 .

[37] Tracy Hall,et al. A Systematic Literature Review on Fault Prediction Performance in Software Engineering , 2012, IEEE Transactions on Software Engineering.

[38] Ramanath Subramanyam,et al. Empirical Analysis of CK Metrics for Object-Oriented Design Complexity: Implications for Software Defects , 2003, IEEE Trans. Software Eng..

[39] Mei-Hwa Chen,et al. An empirical study on object-oriented metrics , 1999, Proceedings Sixth International Software Metrics Symposium (Cat. No.PR00403).

[40] Isabelle Guyon,et al. An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[41] Lionel C. Briand,et al. A systematic and comprehensive investigation of methods to build and evaluate fault prediction models , 2010, J. Syst. Softw..

[42] Yuming Zhou,et al. On the ability of complexity metrics to predict fault-prone classes in object-oriented systems , 2010, J. Syst. Softw..

[43] Rainer Koschke,et al. Revisiting the evaluation of defect prediction models , 2009, PROMISE '09.

[44] Andreas Zeller,et al. Mining metrics to predict component failures , 2006, ICSE.

[45] Rainer Koschke,et al. Effort-Aware Defect Prediction Models , 2010, 2010 14th European Conference on Software Maintenance and Reengineering.

[46] Carl G. Davis,et al. A Hierarchical Model for Object-Oriented Design Quality Assessment , 2002, IEEE Trans. Software Eng..

[47] Hareton K. N. Leung,et al. An in-depth study of the potentially confounding effect of class size in fault prediction , 2014, TSEM.

[48] Maurice H. Halstead,et al. Elements of software science (Operating and programming systems series) , 1977 .

[49] W. W. Muir,et al. Regression Diagnostics: Identifying Influential Data and Sources of Collinearity , 1980 .

[50] Anjaneyulu Pasala,et al. Evaluating Performance of Network Metrics for Bug Prediction in Software , 2013, 2013 20th Asia-Pacific Software Engineering Conference (APSEC).

[51] Nachiappan Nagappan,et al. Predicting defects using network analysis on dependency graphs , 2008, 2008 ACM/IEEE 30th International Conference on Software Engineering.

[52] Victor R. Basili,et al. A Validation of Object-Oriented Design Metrics as Quality Indicators , 1996, IEEE Trans. Software Eng..

[53] Xiao Liu,et al. An empirical study on software defect prediction with a simplified metric set , 2014, Inf. Softw. Technol..

[54] C. Y. Peng,et al. An Introduction to Logistic Regression Analysis and Reporting , 2002 .

[55] Bart Baesens,et al. Benchmarking Classification Models for Software Defect Prediction: A Proposed Framework and Novel Findings , 2008, IEEE Transactions on Software Engineering.