The Use of Visual Text Mining to Support the Study Selection Activity in Systematic Literature Reviews: A Replication Study

Background: Systematic literature reviews (SLRs)are an important component to identify and aggregate research evidence from different empirical studies. One of the activities associated with the SLR process is the selection of primary studies. The process used to select primary studies can be arduous, particularly when the researcher faces large volumes of primary studies. Aim: An experiment was conducted as a pilot test to compare the performance and effectiveness of graduate students in selecting primary studies manually and using visual text mining (VTM) techniques. This paper describes a replication study. Method: The same experimental design and materials of the previous experiment were used in the current experiment. Result: The previous experiment revealed that VTM techniques can speed up the selection of primary studies and increase the number of studies correctly included/excluded (effectiveness). The results of the replication confirmed that studies are more rapidly selected using VTM. We observed that the level of experience in researching has a direct relationship with the effectiveness. Conclusion: VTM techniques have proven valuable in the selection of primary studies.

[1]  Barbara A. Kitchenham,et al.  The role of replications in empirical software engineering—a word of warning , 2008, Empirical Software Engineering.

[2]  Sophia Ananiadou,et al.  Supporting Systematic Reviews Using Text Mining , 2009 .

[3]  Manoel G. Mendonça,et al.  A Visual Text Mining approach for Systematic Reviews , 2007, First International Symposium on Empirical Software Engineering and Measurement (ESEM 2007).

[4]  Rosane Minghim,et al.  An Approach Based on Visual Text Mining to Support Categorization and Classification in the Systematic Mapping , 2010, EASE.

[5]  Muhammad Ali Babar,et al.  An Empirical Investigation of Systematic Reviews in Software Engineering , 2011, 2011 International Symposium on Empirical Software Engineering and Measurement.

[6]  Mehwish Riaz,et al.  Experiences Conducting Systematic Reviews from Novices' Perspective , 2010, EASE.

[7]  Tore Dybå,et al.  Applying Systematic Reviews to Diverse Study Types: An Experience Report , 2007, ESEM 2007.

[8]  Natalia Juristo Juzgado,et al.  Replications of software engineering experiments , 2013, Empirical Software Engineering.

[9]  Pearl Brereton,et al.  Performing systematic literature reviews in software engineering , 2006, ICSE.

[10]  Haim Levkowitz,et al.  From Visual Data Exploration to Visual Data Mining: A Survey , 2003, IEEE Trans. Vis. Comput. Graph..

[11]  Kai Petersen,et al.  Identifying Strategies for Study Selection in Systematic Reviews and Maps , 2011, 2011 International Symposium on Empirical Software Engineering and Measurement.

[12]  Pearl Brereton,et al.  A Study of Computing Undergraduates Undertaking a Systematic Literature Review , 2011, IEEE Transactions on Education.

[13]  Muhammad Ali Babar,et al.  Systematic reviews in software engineering: An empirical investigation , 2013, Inf. Softw. Technol..

[14]  Daniel A. Keim,et al.  Information Visualization and Visual Data Mining , 2002, IEEE Trans. Vis. Comput. Graph..

[15]  Emilia Mendes,et al.  Using Visual Text Mining to Support the Study Selection Activity in Systematic Literature Reviews , 2011, 2011 International Symposium on Empirical Software Engineering and Measurement.

[16]  Natalia Juristo Juzgado,et al.  Using differences among replications of software engineering experiments to gain knowledge , 2009, 2009 3rd International Symposium on Empirical Software Engineering and Measurement.

[17]  Jeffrey C. Carver,et al.  The role of replications in Empirical Software Engineering , 2008, Empirical Software Engineering.

[18]  Khaled El Emam,et al.  The Use of Electronic Data Capture Tools in Clinical Trials: Web-Survey of 259 Canadian Trials , 2009, Journal of medical Internet research.

[19]  Barbara Kitchenham,et al.  Procedures for Performing Systematic Reviews , 2004 .

[20]  Natalia Juristo Juzgado,et al.  Replication of Software Engineering Experiments , 2010, LASER Summer School.

[21]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .