High-Dimensional Hybrid Data Reduction for Effective Bug Triage

Owing to the ever-expanding scale of software, solving the problem of bug triage efficiently and reasonably has become one of the most important issues in software project maintenance. However, there are two challenges in bug triage: low quality of bug reports and engagement of developers. Most of the existing bug triage solutions are based on the text information and have no consideration of developer engagement, which leads to the loss of bug triage accuracy. To overcome these two challenges, we propose a high-dimensional hybrid data reduction method that combines feature selection with instance selection to build a small-scale and high-quality dataset of bug reports by removing redundant or noninformative bug reports and words. In addition, we also study the recent engagement of developers, which can effectively distinguish similar bug reports and provide a more suitable list of the recommended developers. Finally, we experiment with four bug repositories: GCC, OpenOffice, Mozilla, and NetBeans. We experimentally verify that our method can effectively improve the efficiency of bug triage.

[1]  Rong Chen,et al.  Improved SMOTE Algorithm to Deal with Imbalanced Activity Classes in Smart Homes , 2018, Neural Processing Letters.

[2]  Rong Chen,et al.  Ensemble Data Reduction Techniques and Multi-RSMOTE via Fuzzy Integral for Bug Report Classification , 2018, IEEE Access.

[3]  Xiaochen Li,et al.  Toward Better Summarizing Bug Reports With Crowdsourcing Elicited Attributes , 2019, IEEE Transactions on Reliability.

[4]  He Jiang,et al.  Towards Effective Bug Triage with Software Data Reduction Techniques , 2017, IEEE Transactions on Knowledge and Data Engineering.

[5]  Jing Liu,et al.  Analyzing the structure of Java software systems by weighted K-core decomposition , 2017, Future Gener. Comput. Syst..

[6]  Damien Challet,et al.  Bug propagation and debugging in asymmetric software structures , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[7]  Wu Deng,et al.  A novel collaborative optimization algorithm in solving complex optimization problems , 2016, Soft Computing.

[8]  David Lo,et al.  Improving Automated Bug Triaging with Specialized Topic Model , 2017, IEEE Transactions on Software Engineering.

[9]  Gail C. Murphy,et al.  Reducing the effort of bug report triage: Recommenders for development-oriented decisions , 2011, TSEM.

[10]  Thomas Zimmermann,et al.  What Makes a Good Bug Report? , 2010, IEEE Trans. Software Eng..

[11]  Rong Chen,et al.  Identify Severity Bug Report with Distribution Imbalance by CR-SMOTE and ELM , 2019, Int. J. Softw. Eng. Knowl. Eng..

[12]  Meng Sun,et al.  A New Feature Extraction Method Based on EEMD and Multi-Scale Fuzzy Entropy for Motor Bearing , 2016, Entropy.

[13]  Wu Deng,et al.  An Improved Ant Colony Optimization Algorithm Based on Hybrid Strategies for Scheduling Problem , 2019, IEEE Access.

[14]  Weifeng Pan,et al.  Structure-aware Mashup service Clustering for cloud-based Internet of Things using genetic algorithm based clustering algorithm , 2018, Future Gener. Comput. Syst..

[15]  Rong Chen,et al.  The Influence Ranking for Testers in Bug Tracking Systems , 2019, Int. J. Softw. Eng. Knowl. Eng..

[16]  Bo Li,et al.  Study on an improved adaptive PSO algorithm for solving multi-objective gate assignment , 2017, Applied Soft Computing.

[17]  Ling Xu,et al.  Study on a Novel Fault Damage Degree Identification Method Using High-Order Differential Mathematical Morphology Gradient Spectrum Entropy , 2018, Entropy.

[18]  Rong Chen,et al.  Fusion of Multi-RSMOTE With Fuzzy Integral to Classify Bug Reports With an Imbalanced Distribution , 2019, IEEE Transactions on Fuzzy Systems.

[19]  Wu Deng,et al.  A Novel Fault Diagnosis Method Based on Integrating Empirical Wavelet Transform and Fuzzy Entropy for Motor Bearing , 2018, IEEE Access.

[20]  Kangshun Li,et al.  Identifying key classes in object-oriented software using generalized k-core decomposition , 2018, Future Gener. Comput. Syst..

[21]  Jing Wang,et al.  Topology and Topic-Aware Service Clustering , 2018, Int. J. Web Serv. Res..

[22]  Yutao Ma,et al.  Measuring Structural Quality of Object-Oriented Softwares via Bug Propagation Analysis on Weighted Software Networks , 2010, Journal of Computer Science and Technology.

[23]  Tao Zhang,et al.  ROSF: Leveraging Information Retrieval and Supervised Learning for Recommending Code Snippets , 2017, IEEE Transactions on Services Computing.

[24]  Haibo Jiang,et al.  Measuring Software Modularity Based on Software Networks , 2019, Entropy.