论文信息 - Institutional Knowledge at Singapore Management University Institutional Knowledge at Singapore Management University When would this bug get reported? When would this bug get reported?

Institutional Knowledge at Singapore Management University Institutional Knowledge at Singapore Management University When would this bug get reported? When would this bug get reported?

—Not all bugs in software would be experienced and reported by end users right away: Some bugs manifest themselves quickly and may be reported by users a few days after they get into the code base; others manifest many months or even years later, and may only be experienced and reported by a small number of users. We refer to the period of time between the time when a bug is introduced into code and the time when it is reported by a user as bug reporting latency . Knowledge of bug reporting latencies has an implication on prioritization of bug ﬁxing activities—bugs with low reporting latencies may be ﬁxed earlier than those with high latencies to shift debugging resources towards bugs highly concerning users. To investigate bug reporting latencies, we analyze bugs from three Java software systems: AspectJ, Rhino, and Lucene. We extract bug reporting data from their version control repositories and bug tracking systems, identify bug locations based on bug ﬁxes, and back-trace bug introducing time based on change histories of the buggy code. Also, we remove non-essential changes, and most importantly, recover root causes of bugs from their treatments/ﬁxes. We then calculate the bug reporting latencies, and ﬁnd that bugs have diverse reporting latencies. Based on the calculated reporting latencies and features we extract from bugs, we build classiﬁcation models that can predict whether a bug would be reported early (within 30 days) or later, which may be helpful for prioritizing bug ﬁxing activities. Our evaluation on the three software systems shows that our bug reporting latency prediction models could achieve an AUC (Area Under the Receiving Operating Characteristics Curve) of 70.869%.

[1] David Lo,et al. Improved Duplicate Bug Report Identification , 2012, 2012 16th European Conference on Software Maintenance and Reengineering.

[2] Serge Demeyer,et al. Filtering Bug Reports for Fix-Time Analysis , 2012, 2012 16th European Conference on Software Maintenance and Reengineering.

[3] Michael W. Godfrey,et al. A Market-Based Bug Allocation Mechanism Using Predictive Bug Lifetimes , 2012, 2012 16th European Conference on Software Maintenance and Reengineering.

[4] LiGuo Huang,et al. AutoODC: Automated generation of orthogonal defect classifications , 2011, 2011 26th IEEE/ACM International Conference on Automated Software Engineering (ASE 2011).

[5] Siau-Cheng Khoo,et al. Towards more accurate retrieval of duplicate bug reports , 2011, 2011 26th IEEE/ACM International Conference on Automated Software Engineering (ASE 2011).

[6] Daniele Romano,et al. Using source code metrics to predict change-prone Java interfaces , 2011, 2011 27th IEEE International Conference on Software Maintenance (ICSM).

[7] Martin P. Robillard,et al. Non-essential changes in version histories , 2011, 2011 33rd International Conference on Software Engineering (ICSE).

[8] Sooyong Park,et al. Which Crashes Should I Fix First?: Predicting Top Crashes at an Early Stage to Prioritize Debugging Efforts , 2011, IEEE Transactions on Software Engineering.

[9] Christophe Calvès,et al. Faults in linux: ten years later , 2011, ASPLOS XVI.

[10] Serge Demeyer,et al. Comparing Mining Algorithms for Predicting the Severity of a Reported Bug , 2011, 2011 15th European Conference on Software Maintenance and Reengineering.

[11] Sangkyum Kim,et al. NDPMine: Efficiently Mining Discriminative Numerical Features for Pattern-Based Classification , 2010, ECML/PKDD.

[12] William Pugh,et al. The Google FindBugs fixit , 2010, ISSTA '10.

[13] Bart Goethals,et al. Predicting the severity of a reported bug , 2010, 2010 7th IEEE Working Conference on Mining Software Repositories (MSR 2010).

[14] Siau-Cheng Khoo,et al. A discriminative model approach for accurate duplicate bug report retrieval , 2010, 2010 ACM/IEEE 32nd International Conference on Software Engineering.

[15] Mangala Gowri Nanda,et al. Making defect-finding tools work for you , 2010, 2010 ACM/IEEE 32nd International Conference on Software Engineering.

[16] Premkumar T. Devanbu,et al. Fair and balanced?: bias in bug-fix datasets , 2009, ESEC/FSE '09.

[17] Jiawei Han,et al. Classification of software behaviors for failure detection: a discriminative pattern mining approach , 2009, KDD.

[18] KhooSiau-Cheng,et al. Non-redundant sequential rules-Theory and algorithm , 2009 .

[19] Siau-Cheng Khoo,et al. Non-redundant sequential rules - Theory and algorithm , 2009, Inf. Syst..

[20] Sunghun Kim,et al. Toward an understanding of bug fix patterns , 2009, Empirical Software Engineering.

[21] Westley Weimer,et al. The road not taken: Estimating path execution frequency statically , 2009, 2009 IEEE 31st International Conference on Software Engineering.

[22] Tim Menzies,et al. Automated severity assessment of software defect reports , 2008, 2008 IEEE International Conference on Software Maintenance.

[23] Bart Baesens,et al. Benchmarking Classification Models for Software Defect Prediction: A Proposed Framework and Novel Findings , 2008, IEEE Transactions on Software Engineering.

[24] Nicholas Jalbert,et al. Automated duplicate detection for bug tracking systems , 2008, 2008 IEEE International Conference on Dependable Systems and Networks With FTCS and DCC (DSN).

[25] Tao Xie,et al. An approach to detecting duplicate bug reports using natural language and execution information , 2008, 2008 ACM/IEEE 30th International Conference on Software Engineering.

[26] Daniel M. Germán,et al. Towards a simplification of the bug report form in eclipse , 2008, MSR '08.

[27] Chao Liu,et al. Efficient Mining of Recurrent Rules from a Sequence Database , 2008, DASFAA.

[28] Michael D. Ernst,et al. Which warnings should I fix first? , 2007, ESEC-FSE '07.

[29] Per Runeson,et al. Detection of Duplicate Defect Reports Using Natural Language Processing , 2007, 29th International Conference on Software Engineering (ICSE'07).

[30] Andreas Zeller,et al. How Long Will It Take to Fix This Bug? , 2007, Fourth International Workshop on Mining Software Repositories (MSR'07:ICSE Workshops 2007).

[31] Jiawei Han,et al. Discriminative Frequent Pattern Analysis for Effective Classification , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[32] Thomas Zimmermann,et al. Automatic Identification of Bug-Introducing Changes , 2006, 21st IEEE/ACM International Conference on Automated Software Engineering (ASE'06).

[33] Thorsten Joachims,et al. Training linear SVMs in linear time , 2006, KDD '06.

[34] Tom Fawcett,et al. An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[35] Sunghun Kim,et al. How long did it take to fix bugs? , 2006, MSR '06.

[36] David Leon,et al. Tree-based methods for classifying software failures , 2004, 15th International Symposium on Software Reliability Engineering.

[37] David R. Karger,et al. Tackling the Poor Assumptions of Naive Bayes Text Classifiers , 2003, ICML.

[38] Charles X. Ling,et al. AUC: A Better Measure than Accuracy in Comparing Learning Algorithms , 2003, Canadian Conference on AI.

[39] Bin Wang,et al. Automated support for classifying software failure reports , 2003, 25th International Conference on Software Engineering, 2003. Proceedings..

[40] YangJunfeng,et al. An empirical study of operating systems errors , 2001 .

[41] Yoav Freund,et al. The Alternating Decision Tree Learning Algorithm , 1999, ICML.

[42] Yoav Freund,et al. Large Margin Classification Using the Perceptron Algorithm , 1998, COLT' 98.

[43] Sergey Brin,et al. The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[44] William W. Cohen. Fast Effective Rule Induction , 1995, ICML.

[45] P. Lachenbruch. Statistical Power Analysis for the Behavioral Sciences (2nd ed.) , 1989 .

[46] Leonard M. Freeman,et al. A set of measures of centrality based upon betweenness , 1977 .

[47] Jacob Cohen. Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[48] Sebastian G. Elbaum,et al. Predicting accurate and actionable static analysis warnings , 2008, 2008 ACM/IEEE 30th International Conference on Software Engineering.

[49] อนิรุธ สืบสิงห์,et al. Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[50] Jinyan Li,et al. Mining and Ranking Generators of Sequential Patterns , 2008, SDM.