Predicting Blocking Bugs with Machine Learning Techniques: A Systematic Review

— The application of machine learning (ML) techniques to predict blocking bugs have emerged for the early detection of Blocking Bugs (BBs) in software components to mitigate the adverse effect of BBs on software release and project cost. This study presents a systematic literature review of the trends in the application of ML techniques in BB prediction, existing research gaps, and possible research directions to serve as a reference for future research and an application insight for software engineers. We constructed search phrases from relevant terms and used them to extract peer-reviewed studies from the databases of five famous academic publishers, namely Scopus, SpringerLink, IEEE Xplore, ACM digital library, and ScienceDirect. We included primary studies published between January 2012 and February 2022 that applied ML techniques to building Blocking Bug Prediction models (BBPMs). Our result reveals a paucity of literature on BBPMs. Also, previous researchers employed ML techniques such as Decision Trees, Random Forest, Bayes Network, XGBoost, and DNN in building existing BB prediction models. However, the publicly available datasets for building BBPMs are significantly imbalanced. De-spite the poor performance of the Accuracy metric where imbalanced datasets are concerned, some primary studies still utilized the Accuracy metric to assess the performance of their proposed BBPM. Further research is required to validate existing and new BBPM on datasets of commercial software projects. Also, future researchers should mitigate the effect of class imbalance on the proposed BB prediction model before training a BBPM.

[1]  Adebayo Felix Adekoya,et al.  DeepLaBB: A Deep Learning Framework for Blocking Bugs , 2021, 2021 International Conference on Cyber Security and Internet of Things (ICSIoT).

[2]  Sajjad Mahmood,et al.  A maturity model for secure requirements engineering , 2020, Comput. Secur..

[3]  Tao Zhang,et al.  Blocking Bug Prediction Based on XGBoost with Enhanced Features , 2020, 2020 IEEE 44th Annual Computers, Software, and Applications Conference (COMPSAC).

[4]  Siffat Ullah Khan,et al.  Empirical Validation of Software Integration Practices in Global Software Development , 2020, SN Computer Science.

[5]  Ricardo da Silva Torres,et al.  Bug report severity level prediction in open source software: A survey and research opportunities , 2019, Inf. Softw. Technol..

[6]  Vahid Garousi,et al.  Aligning software engineering education with industrial needs: A meta-analysis , 2019, J. Syst. Softw..

[7]  Lucas Venezian Povoa,et al.  Software requirements testing approaches: a systematic literature review , 2019, Requirements Engineering.

[8]  Atika Qazi,et al.  Towards Sustainable Energy: A Systematic Review of Renewable Energy Sources, Technologies, and Public Opinions , 2019, IEEE Access.

[9]  Yuming Zhou,et al.  Predicting the Breakability of Blocking Bug Pairs , 2018, 2018 IEEE 42nd Annual Computer Software and Applications Conference (COMPSAC).

[10]  Silvana V. Aciar,et al.  Methodology for systematic literature review applied to engineering and education , 2018, 2018 IEEE Global Engineering Education Conference (EDUCON).

[11]  John Grundy,et al.  Systematic literature reviews in agile software development: A tertiary study , 2017, Inf. Softw. Technol..

[12]  M. Goulão,et al.  Quality in model-driven engineering: a tertiary study , 2016, Software Quality Journal.

[13]  David Lo,et al.  ELBlocker: Predicting blocking bugs with ensemble imbalance learning , 2015, Inf. Softw. Technol..

[14]  Ali Selamat,et al.  A systematic literature review of software requirements prioritization research , 2014, Inf. Softw. Technol..

[15]  Emad Shihab,et al.  Characterizing and predicting blocking bugs in open source projects , 2014, MSR 2014.

[16]  Danny Weyns,et al.  Variability in Software Systems—A Systematic Literature Review , 2014, IEEE Transactions on Software Engineering.

[17]  Premkumar T. Devanbu,et al.  How, and why, process metrics are better , 2013, 2013 35th International Conference on Software Engineering (ICSE).

[18]  Daniela Cruzes,et al.  Recommended Steps for Thematic Synthesis in Software Engineering , 2011, 2011 International Symposium on Empirical Software Engineering and Measurement.

[19]  Pearl Brereton,et al.  Systematic literature reviews in software engineering - A tertiary study , 2010, Inf. Softw. Technol..

[20]  Pearl Brereton,et al.  Performing systematic literature reviews in software engineering , 2006, ICSE.

[21]  Barry W. Boehm,et al.  Software Engineering Economics , 1993, IEEE Transactions on Software Engineering.

[22]  Seetha Hari,et al.  Learning From Imbalanced Data , 2019, Advances in Computer and Electrical Engineering.

[23]  Tim Menzies,et al.  Assessing Predictors of Software Defects , 2004 .