论文信息 - Automatically identifying code features for software defect prediction: Using AST N-grams

Automatically identifying code features for software defect prediction: Using AST N-grams

Context: Identifying defects in code early is important. A wide range of static code metrics have been evaluated as potential defect indicators. Most of these metrics offer only high level insights and focus on particular pre-selected features of the code. None of the currently used metrics clearly performs best in defect prediction. Objective: We use Abstract Syntax Tree (AST) n-grams to identify features of defective Java code that improve defect prediction performance. Method: Our approach is bottom-up and does not rely on pre-selecting any specific features of code. We use non-parametric testing to determine relationships between AST n-grams and faults in both open source and commercial systems. We build defect prediction models using three machine learning techniques. Results: We show that AST n-grams are very significantly related to faults in some systems, with very large effect sizes. The occurrence of some frequently occurring AST n-grams in a method can mean that the method is up to three times more likely to contain a fault. AST n-grams can have a large effect on the performance of defect prediction models. Conclusions: We suggest that AST n-grams offer developers a promising approach to identifying potentially defective code.

[1] Ying Zou,et al. Studying the Impact of Clones on Software Defects , 2010, 2010 17th Working Conference on Reverse Engineering.

[2] Stan Matwin,et al. Feature Engineering for Text Classification , 1999, ICML.

[3] Rainer Koschke,et al. Revisiting the evaluation of defect prediction models , 2009, PROMISE '09.

[4] Tracy Hall,et al. Code Bad Smells: a review of current knowledge , 2011, J. Softw. Maintenance Res. Pract..

[5] N. Nagappan,et al. Use of relative code churn measures to predict system defect density , 2005, Proceedings. 27th International Conference on Software Engineering, 2005. ICSE 2005..

[6] Elaine J. Weyuker,et al. Comparing the effectiveness of several modeling methods for fault prediction , 2010, Empirical Software Engineering.

[7] Sunghun Kim,et al. Reducing Features to Improve Bug Prediction , 2009, 2009 IEEE/ACM International Conference on Automated Software Engineering.

[8] Osamu Mizuno,et al. Spam Filter Based Approach for Finding Fault-Prone Software Modules , 2007, Fourth International Workshop on Mining Software Repositories (MSR'07:ICSE Workshops 2007).

[9] Elmar Jürgens,et al. Do code clones matter? , 2009, 2009 IEEE 31st International Conference on Software Engineering.

[10] Victor R. Basili,et al. A Validation of Object-Oriented Design Metrics as Quality Indicators , 1996, IEEE Trans. Software Eng..

[11] Norman E. Fenton,et al. Quantitative Analysis of Faults and Failures in a Complex Software System , 2000, IEEE Trans. Software Eng..

[12] Qinbao Song,et al. A General Software Defect-Proneness Prediction Framework , 2011, IEEE Transactions on Software Engineering.

[13] Rainer Koschke,et al. Effort-Aware Defect Prediction Models , 2010, 2010 14th European Conference on Software Maintenance and Reengineering.

[14] Andreas Zeller,et al. Predicting faults from cached history , 2008, ISEC '08.

[15] Tim Menzies,et al. Data Mining Static Code Attributes to Learn Defect Predictors , 2007 .

[16] Koushik Sen,et al. Deep Learning to Find Bugs , 2017 .

[17] Harald C. Gall,et al. On the relation of refactorings and software defect prediction , 2008, MSR '08.

[18] Forrest Shull,et al. Local versus Global Lessons for Defect Prediction and Effort Estimation , 2013, IEEE Transactions on Software Engineering.

[19] Hridesh Rajan,et al. A study of repetitiveness of code changes in software evolution , 2013, 2013 28th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[20] A. Zeller,et al. Predicting Defects for Eclipse , 2007, Third International Workshop on Predictor Models in Software Engineering (PROMISE'07: ICSE Workshops 2007).

[21] Taghi M. Khoshgoftaar,et al. Attribute Selection and Imbalanced Data: Problems in Software Defect Prediction , 2010, 2010 22nd IEEE International Conference on Tools with Artificial Intelligence.

[22] Harald C. Gall,et al. Putting It All Together: Using Socio-technical Networks to Predict Failures , 2009, 2009 20th International Symposium on Software Reliability Engineering.

[23] Yann-Gaël Guéhéneuc,et al. Can Lexicon Bad Smells Improve Fault Prediction? , 2012, 2012 19th Working Conference on Reverse Engineering.

[24] Horst Zuse,et al. Software complexity: Measures and methods , 1990 .

[25] Uirá Kulesza,et al. A Framework for Evaluating the Results of the SZZ Approach for Identifying Bug-Introducing Changes , 2017, IEEE Transactions on Software Engineering.

[26] Tracy Hall,et al. So You Need More Method Level Datasets for Your Software Defect Prediction?: Voilà! , 2016, ESEM.

[27] Tracy Hall,et al. A Systematic Literature Review on Fault Prediction Performance in Software Engineering , 2012, IEEE Transactions on Software Engineering.

[28] Taghi M. Khoshgoftaar,et al. Comparative Assessment of Software Quality Classification Techniques: An Empirical Case Study , 2004, Empirical Software Engineering.

[29] Erhard Plödereder,et al. Bauhaus - A Tool Suite for Program Analysis and Reverse Engineering , 2006, Ada-Europe.

[30] Harald C. Gall,et al. Cross-project defect prediction: a large scale experiment on data vs. domain vs. process , 2009, ESEC/SIGSOFT FSE.

[31] Andreas Zeller,et al. Change Bursts as Defect Predictors , 2010, 2010 IEEE 21st International Symposium on Software Reliability Engineering.

[32] Jaime Spacco,et al. SZZ revisited: verifying when changes induce fixes , 2008, DEFECTS '08.

[33] Tim Menzies,et al. Data Mining Static Code Attributes to Learn Defect Predictors , 2007, IEEE Transactions on Software Engineering.

[34] Lionel C. Briand,et al. A systematic and comprehensive investigation of methods to build and evaluate fault prediction models , 2010, J. Syst. Softw..

[35] Premkumar T. Devanbu,et al. Clones: What is that smell? , 2010, MSR.

[36] Gail C. Murphy,et al. Hipikat: recommending pertinent software development artifacts , 2003, 25th International Conference on Software Engineering, 2003. Proceedings..

[37] Chanchal K. Roy,et al. A Survey on Software Clone Detection Research , 2007 .

[38] Hongyu Zhang,et al. An investigation of the relationships between lines of code and defects , 2009, 2009 IEEE International Conference on Software Maintenance.

[39] Jens Krinke,et al. Is Cloned Code More Stable than Non-cloned Code? , 2008, 2008 Eighth IEEE International Working Conference on Source Code Analysis and Manipulation.

[40] Osamu Mizuno,et al. Bug prediction based on fine-grained module histories , 2012, 2012 34th International Conference on Software Engineering (ICSE).

[41] Bruce Christianson,et al. The misuse of the NASA metrics data program data sets for automated software defect prediction , 2011, EASE.

[42] Ahmed E. Hassan,et al. Predicting faults using the complexity of code changes , 2009, 2009 IEEE 31st International Conference on Software Engineering.

[43] Yi Sun,et al. Some Code Smells Have a Significant but Small Effect on Faults , 2014, TSEM.

[44] Premkumar T. Devanbu,et al. Fair and balanced?: bias in bug-fix datasets , 2009, ESEC/FSE '09.

[45] Koichiro Ochimizu,et al. Towards logistic regression models for predicting fault-prone code across software projects , 2009, ESEM 2009.

[46] Martin Fowler,et al. Refactoring - Improving the Design of Existing Code , 1999, Addison Wesley object technology series.

[47] Michele Lanza,et al. An extensive comparison of bug prediction approaches , 2010, 2010 7th IEEE Working Conference on Mining Software Repositories (MSR 2010).

[48] Chanchal Kumar Roy,et al. Comparison and evaluation of code clone detection techniques and tools: A qualitative approach , 2009, Sci. Comput. Program..

[49] Harald C. Gall,et al. Populating a Release History Database from version control and bug tracking systems , 2003, International Conference on Software Maintenance, 2003. ICSM 2003. Proceedings..

[50] Yann-Gaël Guéhéneuc,et al. Physical and conceptual identifier dispersion: Measures and relation to fault proneness , 2010, 2010 IEEE International Conference on Software Maintenance.

[51] T. Zimmermann,et al. Predicting Faults from Cached History , 2007, 29th International Conference on Software Engineering (ICSE'07).

[52] Martin Pinzger,et al. Method-level bug prediction , 2012, Proceedings of the 2012 ACM-IEEE International Symposium on Empirical Software Engineering and Measurement.

[53] Tibor Gyimóthy,et al. Empirical validation of object-oriented metrics on open source software for fault prediction , 2005, IEEE Transactions on Software Engineering.

[54] David W. Binkley,et al. Increasing diversity: Natural language measures for software fault prediction , 2009, J. Syst. Softw..

[55] Shinji Kusumoto,et al. CCFinder: A Multilinguistic Token-Based Code Clone Detection System for Large Scale Source Code , 2002, IEEE Trans. Software Eng..

[56] Lionel C. Briand,et al. Predicting fault-prone components in a java legacy system , 2006, ISESE '06.

[57] Rongxin Wu,et al. Dealing with noise in defect prediction , 2011, 2011 33rd International Conference on Software Engineering (ICSE).

[58] Yuming Zhou,et al. On the ability of complexity metrics to predict fault-prone classes in object-oriented systems , 2010, J. Syst. Softw..

[59] Andreas Zeller,et al. Detecting object usage anomalies , 2007, ESEC-FSE '07.

[60] Charles A. Sutton,et al. Mining idioms from source code , 2014, SIGSOFT FSE.

[61] Andreas Zeller,et al. When do changes induce fixes? , 2005, ACM SIGSOFT Softw. Eng. Notes.

[62] Anas N. Al-Rabadi,et al. A comparison of modified reconstructability analysis and Ashenhurst‐Curtis decomposition of Boolean functions , 2004 .

[63] Thomas Zimmermann,et al. Automatic Identification of Bug-Introducing Changes , 2006, 21st IEEE/ACM International Conference on Automated Software Engineering (ASE'06).

[64] Fan Wu,et al. Mutation-aware fault prediction , 2016, ISSTA.

[65] Elaine J. Weyuker,et al. Looking for bugs in all the right places , 2006, ISSTA '06.

[66] Song Wang,et al. Automatically Learning Semantic Features for Defect Prediction , 2016, 2016 IEEE/ACM 38th International Conference on Software Engineering (ICSE).

[67] J. Cornfield,et al. A method of estimating comparative rates from clinical data; applications to cancer of the lung, breast, and cervix. , 1951, Journal of the National Cancer Institute.

[68] Ayse Basar Bener,et al. On the relative value of cross-company and within-company data for defect prediction , 2009, Empirical Software Engineering.

[69] Yijun Yu,et al. Relating Identifier Naming Flaws and Code Quality: An Empirical Study , 2009, 2009 16th Working Conference on Reverse Engineering.

[70] Rainer Koschke,et al. Frequency and risks of changes to clones , 2011, 2011 33rd International Conference on Software Engineering (ICSE).

[71] Dietmar Seipel,et al. Clone detection in source code by frequent itemset techniques , 2004 .

[72] Michele Marchesi,et al. An analysis of anti-micro-patterns effects on fault-proneness in large Java systems , 2012, SAC '12.

[73] Michele Lanza,et al. On the Relationship Between Change Coupling and Software Defects , 2009, 2009 16th Working Conference on Reverse Engineering.

[74] Fabio Palomba,et al. Re-evaluating method-level bug prediction , 2018, SANER.

[75] Itay Maman,et al. Micro patterns in Java code , 2005, OOPSLA '05.

[76] Shari Lawrence Pfleeger,et al. Software Metrics : A Rigorous and Practical Approach , 1998 .

[77] Andreas Zeller,et al. Failure is a four-letter word: a parody in empirical research , 2011, Promise '11.