Quality Assessment in Systematic Literature Reviews: A Software Engineering Perspective

Abstract Context: Quality Assessment (QA) of reviewed literature is paramount to a Systematic Literature Review (SLR) as the quality of conclusions completely depends on the quality of selected literature. A number of researchers in Software Engineering (SE) have developed a variety of QA instruments and also reported their challenges. We previously conducted a tertiary study on SLRs with QA from 2004 to 2013, and reported the findings in 2015. Objective: With the widespread use of SLRs in SE and the increasing adoption of QA in these SLRs in recent years, it is necessary to empirically investigate whether the previous conclusions are still valid and whether there are new insights to the subject in question using a larger and a more up-to-date SLR set. More importantly, we aim to depict a clear picture of QA used in SLRs in SE by aggregating and distilling good practices, including the commonly used QA instruments as well as the major roles and aspects of QA in research. Method: An extended tertiary study was conducted with the newly collected SLRs from 2014 to 2018 and the original SLRs from 2004 to 2013 to systematically review the QA used by SLRs in SE during the 15-year period from 2004 to 2018. In addition, this extended study also compared and contrasted the findings of the previous study conducted in 2015. Results: A total of 241 SLRs between 2004 and 2018 were included, from which we identified a number of QA instruments. These instruments are generally designed to focus on the rationality of study design, the rigor of study execution and analysis, and the credibility and contribution of study findings and conclusions, with the emphasis largely placed on its rigor. The quality data is mainly used for literature selection or as evidence to support conclusions. Conclusions: QA has received much attention in SE in more recent years and the improvement is evident since the last study in 2015. New findings show that the aims are more concise, the instruments are more diverse and rigorous, and the criteria are more thoughtful.

[1]  Naveed Ikram,et al.  Empirical Evidence in Software Architecture: A Systematic Literature Review Protocol , 2011, ICSEA 2011.

[2]  Kai Petersen,et al.  CERSE - Catalog for empirical research in software engineering: A Systematic mapping study , 2019, Inf. Softw. Technol..

[3]  Rory V. O'Connor,et al.  A multivocal literature review on serious games for software process standards education , 2018, Comput. Stand. Interfaces.

[4]  Per Runeson,et al.  Checklists for Software Engineering Case Study Research , 2007, First International Symposium on Empirical Software Engineering and Measurement (ESEM 2007).

[5]  Mehdi Mirakhorli,et al.  Datasets from Fifteen Years of Automated Requirements Traceability Research: Current State, Characteristics, and Quality , 2017, 2017 IEEE 25th International Requirements Engineering Conference (RE).

[6]  Xin Huang,et al.  Quality assessment of systematic reviews in software engineering: a tertiary study , 2015, EASE.

[7]  Xin Huang,et al.  Synthesizing Qualitative Research in Software Engineering: A Critical Review , 2018, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[8]  Dietmar Pfahl,et al.  Reporting guidelines for controlled experiments in software engineering , 2005, 2005 International Symposium on Empirical Software Engineering, 2005..

[9]  Mark Staples,et al.  Experiences using systematic review guidelines , 2006, J. Syst. Softw..

[10]  Ghulam Rasool,et al.  Multilingual Source Code Analysis: A Systematic Literature Review , 2017, IEEE Access.

[11]  Slinger Jansen,et al.  Defining multi-tenancy: A systematic mapping study on the academic and the industrial perspective , 2015, J. Syst. Softw..

[12]  Kai Petersen,et al.  An Empirically Evaluated Checklist for Surveys in Software Engineering , 2019, Inf. Softw. Technol..

[13]  Muhammad Shahbaz,et al.  A systematic literature review on electricity management systems , 2015 .

[14]  Roel Wieringa Towards a unified checklist for empirical research in software engineering: first proposal , 2012, EASE.

[15]  Per Runeson,et al.  Can we evaluate the quality of software engineering experiments? , 2010, ESEM '10.

[16]  V. Grover,et al.  An assessment of survey research in POM: from constructs to theory , 1998 .

[17]  Tony Gorschek,et al.  A method for evaluating rigor and industrial relevance of technology evaluations , 2011, Empirical Software Engineering.

[18]  Roel Wieringa,et al.  Requirements engineering paper classification and evaluation criteria: a proposal and a discussion , 2005, Requirements Engineering.

[19]  Shari Lawrence Pfleeger,et al.  Preliminary Guidelines for Empirical Research in Software Engineering , 2002, IEEE Trans. Software Eng..

[20]  Mateen Ahmed Abbasi,et al.  Assessment of Requirement Elicitation Tools and Techniques by Various Parameters , 2015 .

[21]  Jordi Cabot,et al.  Formal verification of static software models in MDE: A systematic review , 2014, Inf. Softw. Technol..

[22]  Pearl Brereton,et al.  A systematic review of systematic review process research in software engineering , 2013, Inf. Softw. Technol..

[23]  Marcelo Fantinato,et al.  Improvements to the Function Point Analysis Method: A Systematic Literature Review , 2015, IEEE Transactions on Engineering Management.

[24]  Paul W. P. J. Grefen,et al.  A systematic literature review on the architecture of business process management systems , 2017, Inf. Syst..

[25]  GasparicMarko,et al.  What recommendation systems for software engineering recommend , 2016 .

[26]  Pearl Brereton,et al.  Systematic literature reviews in software engineering - A tertiary study , 2010, Inf. Softw. Technol..

[27]  Vahid Garousi,et al.  Software test maturity assessment and test process improvement: A multivocal literature review , 2017, Inf. Softw. Technol..

[28]  Pearl Brereton,et al.  Reporting systematic reviews: Some lessons from a tertiary study , 2017, Inf. Softw. Technol..

[29]  Pearl Brereton,et al.  Evidence-Based Software Engineering and Systematic Reviews , 2015 .

[30]  T. Dybå,et al.  Applying Systematic Reviews to Diverse Study Types: An Experience Report , 2007, First International Symposium on Empirical Software Engineering and Measurement (ESEM 2007).

[31]  Teodor Sommestad,et al.  Variables influencing information security policy compliance: A systematic review of quantitative studies , 2014, Inf. Manag. Comput. Secur..

[32]  Emilia Mendes,et al.  A systematic review of Web engineering research , 2005, 2005 International Symposium on Empirical Software Engineering, 2005..

[33]  Natalia Juristo Juzgado,et al.  Effectiveness of Requirements Elicitation Techniques: Empirical Results Derived from a Systematic Review , 2006, 14th IEEE International Requirements Engineering Conference (RE'06).

[34]  Pearl Brereton,et al.  Systematic literature reviews in software engineering - A systematic literature review , 2009, Inf. Softw. Technol..

[35]  Vahid Garousi,et al.  Web application testing: A systematic literature review , 2014, J. Syst. Softw..

[36]  Cheng Zhang,et al.  Preliminary Reporting Guidelines for Experience Papers , 2009, EASE.

[37]  Barbara Kitchenham,et al.  Procedures for Performing Systematic Reviews , 2004 .

[38]  Tony Gorschek,et al.  A systematic review of domain analysis solutions for product lines , 2009, J. Syst. Softw..

[39]  Tore Dybå,et al.  Empirical studies of agile software development: A systematic review , 2008, Inf. Softw. Technol..

[40]  Tore Dybå,et al.  Strength of evidence in systematic reviews in software engineering , 2008, ESEM '08.

[41]  Austen Rainer,et al.  Case Study Research in Software Engineering - Guidelines and Examples , 2012 .

[42]  Mohammad Ghafari,et al.  A systematic literature review of software visualization evaluation , 2018, J. Syst. Softw..

[43]  Muhammad Ali Babar,et al.  Identifying relevant studies in software engineering , 2011, Inf. Softw. Technol..

[44]  M. Petticrew,et al.  Systematic Reviews in the Social Sciences: A Practical Guide , 2005 .

[45]  G. Guyatt,et al.  Grading quality of evidence and strength of recommendations , 2004, BMJ : British Medical Journal.

[46]  Daniel Lucrédio,et al.  Software Engineering for the Cloud: A Research Roadmap , 2012, 2012 26th Brazilian Symposium on Software Engineering.

[47]  Káthia Marçal de Oliveira,et al.  Rethinking Interoperability in Contemporary Software Systems , 2017, 2017 IEEE/ACM Joint 5th International Workshop on Software Engineering for Systems-of-Systems and 11th Workshop on Distributed Software Development, Software Ecosystems and Systems-of-Systems (JSOS).

[48]  Hamzeh Mohammad Alabool,et al.  Cloud service evaluation method-based Multi-Criteria Decision-Making: A systematic literature review , 2018, J. Syst. Softw..

[49]  Uwe Zdun,et al.  Systematic Review of Software Behavioral Model Consistency Checking , 2017, ACM Comput. Surv..

[50]  Paolo Tell,et al.  How Does Software Process Improvement Address Global Software Engineering? , 2016, 2016 IEEE 11th International Conference on Global Software Engineering (ICGSE).

[51]  Claes Wohlin,et al.  Experimentation in Software Engineering , 2012, Springer Berlin Heidelberg.

[52]  E. Murphy,et al.  A response to 'Quality in qualitative evaluation: a framework for assessing research evidence' , 2004 .

[53]  Tore Dybå,et al.  Evidence-based software engineering , 2004, Proceedings. 26th International Conference on Software Engineering.

[54]  Andrea Janes,et al.  What recommendation systems for software engineering recommend: A systematic literature review , 2016, J. Syst. Softw..

[55]  Pearl Brereton,et al.  An Evaluation of Quality Checklist Proposals - A participant-observer case study , 2009, EASE.