A Systematic Mapping on the use of Visual Data Mining to Support the Conduct of Systematic Literature Reviews

A systematic literature review (SLR) is a methodology used to find and aggregate all relevant existing evidence about a specific research question of interest. Important decisions need to be made at several points in the review process, relating to search of the literature, selection of relevant primary studies and use of methods of synthesis. Visualization can support tasks that involve large collections of data, such as the studies collected, evaluated and summarized in an SLR. The objective of this paper is to present the results of a systematic mapping study (SM) conducted to collect and evaluate evidence on the use of a specific visualization technique, visual data mining (VDM), to support the SLR process. We reviewed 20 papers and our results indicate a scarcity of research on the use of VDM to help with conducting SLRs in the software engineering domain. However, most of the studies (16 of the 20 studies included in our mapping) have been conducted in the field of medicine and they revealed that the activities of data extraction and data synthesis, related to conducting the review phase of an SLR process, have more VDM support than other activities. In contrast, according to our SM, previous studies using VDM techniques with SLRs have not employed such techniques during the SLR’s planning and reporting phases.

[1]  Per Runeson,et al.  A systematic review on regression test selection techniques , 2010, Inf. Softw. Technol..

[2]  Sophia Ananiadou,et al.  Supporting Systematic Reviews Using Text Mining , 2009 .

[3]  Jianhui Luo,et al.  Experiments on Supervised Learning Algorithms for Text Categorization , 2005, 2005 IEEE Aerospace Conference.

[4]  Haim Levkowitz,et al.  From Visual Data Exploration to Visual Data Mining: A Survey , 2003, IEEE Trans. Vis. Comput. Graph..

[5]  Manoel G. Mendonça,et al.  A Visual Text Mining approach for Systematic Reviews , 2007, First International Symposium on Empirical Software Engineering and Measurement (ESEM 2007).

[6]  Rosane Minghim,et al.  Visual text mining using association rules , 2007, Comput. Graph..

[7]  Barbara Kitchenham,et al.  Procedures for Performing Systematic Reviews , 2004 .

[8]  Stan Matwin,et al.  Parameterized Contrast in Second Order Soft Co-occurrences: A Novel Text Representation Technique in Text Mining and Knowledge Extraction , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[9]  Bart De Moor,et al.  Co-clustering approaches to integrate lexical and bibliographical information , 2005 .

[10]  Pearl Brereton,et al.  Systematic literature reviews in software engineering - A tertiary study , 2010, Inf. Softw. Technol..

[11]  Mehwish Riaz,et al.  Experiences Conducting Systematic Reviews from Novices' Perspective , 2010, EASE.

[12]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[13]  Shari Lawrence Pfleeger,et al.  Preliminary Guidelines for Empirical Research in Software Engineering , 2002, IEEE Trans. Software Eng..

[14]  Aaron M. Cohen,et al.  SYRIAC: The SYstematic Review Information Automated Collection System A Data Warehouse for Facilitating Automated Biomedical Text Classification , 2008, AMIA.

[15]  Frank Harary,et al.  Graph Theory , 2016 .

[16]  Maurice H. T. Ling,et al.  Reconstruction of Protein-Protein Interaction Pathways by Mining Subject-Verb-Objects Intermediates , 2007, PRIB.

[17]  Pearl Brereton,et al.  Performing systematic literature reviews in software engineering , 2006, ICSE.

[18]  Tanja Bekhuis Conceptual biology, hypothesis discovery, and text mining: Swanson's legacy , 2006, Biomedical digital libraries.

[19]  Tore Dybå,et al.  Applying Systematic Reviews to Diverse Study Types: An Experience Report , 2007, First International Symposium on Empirical Software Engineering and Measurement (ESEM 2007).

[20]  Pearl Brereton,et al.  Systematic literature reviews in software engineering - A systematic literature review , 2009, Inf. Softw. Technol..

[21]  E A Ponomarenko,et al.  [Identification of differentially expressed proteins using automatic meta-analysis of proteomics-related articles]. , 2009, Biomeditsinskaia khimiia.

[22]  Pankaj Chopra Data mining techniques to enable large-scale exploratory analysis of heterogeneous scientific data , 2009 .

[23]  James P. Sluka Extracting Knowledge from Genomic Experiments by Incorporating the Biomedical Literature , 2002 .

[24]  Thomas Werner,et al.  The next generation of literature analysis: Integration of genomic analysis into text mining , 2005, Briefings Bioinform..

[25]  Pearl Brereton,et al.  Lessons from applying the systematic literature review process within the software engineering domain , 2007, J. Syst. Softw..

[26]  Francisco Tirado,et al.  SENT: semantic features in text , 2009, Nucleic Acids Res..

[27]  Bernard Dousset,et al.  Combining mining and visualization tools to discover the geographic structure of a domain , 2006, Comput. Environ. Urban Syst..

[28]  J. Burnham Scopus database: a review , 2006, Biomedical digital libraries.

[29]  Daniel A. Keim,et al.  Information Visualization and Visual Data Mining , 2002, IEEE Trans. Vis. Comput. Graph..

[30]  Dah-Jye Lee,et al.  Finding relevant PDF medical journal articles by the content of their figures , 2007, SPIE Medical Imaging.

[31]  Rosane Minghim,et al.  An Approach Based on Visual Text Mining to Support Categorization and Classification in the Systematic Mapping , 2010, EASE.

[32]  Dursun Delen,et al.  Seeding the survey and analysis of research literature with text mining , 2008, Expert Syst. Appl..

[33]  Vladimir Brusic,et al.  Data mining of cancer vaccine trials: a bird's-eye view , 2008, Immunome research.

[34]  Tore Dybå,et al.  Strength of evidence in systematic reviews in software engineering , 2008, ESEM '08.

[35]  Elena A. Ponomarenko,et al.  Identification of differentially expressed proteins using automated meta-analysis of proteomic articles , 2009 .

[36]  Arno Lukas,et al.  Analysis and prediction of protective continuous B-cell epitopes on pathogen proteins , 2008, Immunome research.

[37]  Choochart Haruechaiyasak,et al.  Enhancing the Literature Review Using Author-Topic Profiling , 2008, ICADL.

[38]  John F. Hurdle,et al.  Extracting Information from Textual Documents in the Electronic Health Record: A Review of Recent Research , 2008, Yearbook of Medical Informatics.

[39]  Rosane Minghim,et al.  HiPP: A Novel Hierarchical Point Placement Strategy and its Application to the Exploration of Document Collections , 2008, IEEE Transactions on Visualization and Computer Graphics.

[40]  Sándor Dominich,et al.  Formal Foundation of Information Retrieval , 2000 .

[41]  M. Petticrew,et al.  Systematic Reviews in the Social Sciences: A Practical Guide , 2005 .

[42]  Gurpreet Singh Lehal,et al.  A Survey of Text Mining Techniques and Applications , 2009 .

[43]  Antonio Jimeno-Yepes,et al.  Exploitation of ontological resources for scientific literature analysis: Searching genes and related diseases , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[44]  Alan L. Porter,et al.  Research profiling: Improving the literature review , 2002, Scientometrics.