On the Performance of Hybrid Search Strategies for Systematic Literature Reviews in Software Engineering

Abstract Context When conducting a Systematic Literature Review (SLR), researchers usually face the challenge of designing a search strategy that appropriately balances result quality and review effort. Using digital library (or database) searches or snowballing alone may not be enough to achieve high-quality results. On the other hand, using both digital library searches and snowballing together may increase the overall review effort. Objective The goal of this research is to propose and evaluate hybrid search strategies that selectively combine database searches with snowballing. Method We propose four hybrid search strategies combining database searches in digital libraries with iterative, parallel, or sequential backward and forward snowballing. We simulated the strategies over three existing SLRs in SE that adopted both database searches and snowballing. We compared the outcome of digital library searches, snowballing, and hybrid strategies using precision, recall, and F-measure to investigate the performance of each strategy. Results Our results show that, for the analyzed SLRs, combining database searches from the Scopus digital library with parallel or sequential snowballing achieved the most appropriate balance of precision and recall. Conclusion We put forward that, depending on the goals of the SLR and the available resources, using a hybrid search strategy involving a representative digital library and parallel or sequential snowballing tends to represent an appropriate alternative to be used when searching for evidence in SLRs.

[1]  Mirko Perkusich,et al.  A systematic review on the use of Definition of Done on agile software development projects , 2017, EASE.

[2]  Natalia Juristo Juzgado,et al.  Systematic review and aggregation of empirical studies on elicitation techniques , 2011, IEEE Transactions on Software Engineering.

[3]  Claes Wohlin,et al.  Experimentation in Software Engineering , 2000, The Kluwer International Series in Software Engineering.

[4]  Claes Wohlin,et al.  Experiences from using snowballing and database searches in systematic literature studies , 2015, EASE.

[5]  Claes Wohlin,et al.  Second-generation systematic literature studies using snowballing , 2016, EASE.

[6]  Emilia Mendes,et al.  Cross- vs. within-company cost estimation studies revisited: an extended systematic review , 2014, EASE '14.

[7]  Elisa Yumi Nakagawa,et al.  Evaluating strategies for forward snowballing application to support secondary studies updates: emergent results , 2018, SBES.

[8]  Jacques Klein,et al.  An extensive systematic review on the Model-Driven Development of secure systems , 2015, Inf. Softw. Technol..

[9]  Alejandro Calderón,et al.  A systematic literature review on serious games evaluation: An application to software project management , 2015, Comput. Educ..

[10]  Pearl Brereton,et al.  Refining the systematic literature review process—two participant-observer case studies , 2010, Empirical Software Engineering.

[11]  Geraldo Barbosa Landre,et al.  Approaches to strategic alignment of software process improvement: A systematic literature review , 2017, J. Syst. Softw..

[12]  Pearl Brereton,et al.  Systematic literature reviews in software engineering - A systematic literature review , 2009, Inf. Softw. Technol..

[13]  Marco Aurélio Gerosa,et al.  A systematic literature review on the barriers faced by newcomers to open source software projects , 2015, Inf. Softw. Technol..

[14]  Claes Wohlin,et al.  Investigating the Use of a Hybrid Search Strategy for Systematic Reviews , 2017, 2017 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM).

[15]  Pearl Brereton,et al.  Systematic literature reviews in software engineering - A tertiary study , 2010, Inf. Softw. Technol..

[16]  Per Runeson,et al.  Reference-based search strategies in systematic reviews , 2009, EASE.

[17]  Emilia Mendes,et al.  How Reliable Are Systematic Reviews in Empirical Software Engineering? , 2010, IEEE Transactions on Software Engineering.

[18]  Pearl Brereton,et al.  Performing systematic literature reviews in software engineering , 2006, ICSE.

[19]  Kai Petersen,et al.  Considering rigor and relevance when evaluating test driven development: A systematic review , 2014, Inf. Softw. Technol..

[20]  Paramvir Singh,et al.  Exploring Automatic Search in Digital Libraries: A Caution Guide for Systematic Reviewers , 2017, EASE.

[21]  Claes Wohlin,et al.  Systematic literature studies: Database searches vs. backward snowballing , 2012, Proceedings of the 2012 ACM-IEEE International Symposium on Empirical Software Engineering and Measurement.

[22]  O. Dieste,et al.  Developing Search Strategies for Detecting Relevant Experiments for Systematic Reviews , 2007, First International Symposium on Empirical Software Engineering and Measurement (ESEM 2007).

[23]  Muhammad Ali Babar,et al.  On Searching Relevant Studies in Software Engineering , 2010, EASE.

[24]  Claes Wohlin,et al.  Search Strategy to Update Systematic Literature Reviews in Software Engineering , 2019, 2019 45th Euromicro Conference on Software Engineering and Advanced Applications (SEAA).

[25]  Emilia Mendes,et al.  Using Forward Snowballing to update Systematic Reviews in Software Engineering , 2016, ESEM.

[26]  Claes Wohlin,et al.  Guidelines for snowballing in systematic literature studies and a replication in software engineering , 2014, EASE '14.

[27]  Ayça Tarhan,et al.  On the Use of Ontologies in Software Process Assessment: A Systematic Literature Review , 2017, EASE.