Which academic search systems are suitable for systematic reviews or meta‐analyses? Evaluating retrieval qualities of Google Scholar, PubMed, and 26 other resources

Rigorous evidence identification is essential for systematic reviews and meta‐analyses (evidence syntheses) because the sample selection of relevant studies determines a review's outcome, validity, and explanatory power. Yet, the search systems allowing access to this evidence provide varying levels of precision, recall, and reproducibility and also demand different levels of effort. To date, it remains unclear which search systems are most appropriate for evidence synthesis and why. Advice on which search engines and bibliographic databases to choose for systematic searches is limited and lacking systematic, empirical performance assessments. This study investigates and compares the systematic search qualities of 28 widely used academic search systems, including Google Scholar, PubMed, and Web of Science. A novel, query‐based method tests how well users are able to interact and retrieve records with each system. The study is the first to show the extent to which search systems can effectively and efficiently perform (Boolean) searches with regards to precision, recall, and reproducibility. We found substantial differences in the performance of search systems, meaning that their usability in systematic searches varies. Indeed, only half of the search systems analyzed and only a few Open Access databases can be recommended for evidence syntheses without adding substantial caveats. Particularly, our findings demonstrate why Google Scholar is inappropriate as principal search system. We call for database owners to recognize the requirements of evidence synthesis and for academic journals to reassess quality requirements for systematic reviews. Our findings aim to support researchers in conducting better searches for better evidence synthesis.

[1]  Nila A Sathe,et al.  Searching for studies: a guide to information retrieval for Campbell systematic reviews , 2017 .

[2]  Hannah Moshontz,et al.  Reporting standards for literature searches and report inclusion criteria: making research syntheses more transparent and easy to replicate , 2015, Research synthesis methods.

[3]  D. T. Richnell,et al.  Libraries in the People's Republic of China: A Report of a Visit, June I976 , 1977 .

[4]  Rachel Kettle,et al.  Identifying evidence for public health guidance: a comparison of citation searching with Web of Science and Google Scholar , 2016, Research synthesis methods.

[5]  David Baur,et al.  Effect of antibiotic stewardship on the incidence of infection and colonisation with antibiotic-resistant bacteria and Clostridium difficile infection: a systematic review and meta-analysis. , 2017, The Lancet. Infectious diseases.

[6]  A. Neelameghan,et al.  An Online Multi-Lingual, Multi-Faith Thesaurus: A Progress Report on F-THES , 2005, Webology.

[7]  G. Andersson,et al.  Internet-based vs. face-to-face cognitive behavior therapy for psychiatric and somatic disorders: an updated systematic review and meta-analysis , 2018, Cognitive behaviour therapy.

[8]  Jyoti Prakash,et al.  Precision and Relative Recall of Search Engines: A Comparative Study of Google and Yahoo , 2009 .

[9]  Remigiusz Sapa,et al.  Information seeking behaviour of mathematicians: scientists and students , 2014, Inf. Res..

[10]  Sarantos Kapidakis,et al.  Information Seeking and Searching Habits of Greek Physicists and Astronomers: A Case Study of Undergraduate Students , 2013 .

[11]  D. Moher,et al.  Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. , 2010, International journal of surgery.

[12]  A. Carvalho,et al.  Repetitive Transcranial Magnetic Stimulation for the Acute Treatment of Major Depressive Episodes: A Systematic Review With Network Meta-analysis , 2017, JAMA psychiatry.

[13]  Enrique Orduña-Malea,et al.  Methods for estimating the size of Google Scholar , 2014, Scientometrics.

[14]  Thomas W. Conkling,et al.  Google Scholar’s Coverage of the Engineering Literature: An Empirical Study , 2008 .

[15]  David Bawden,et al.  Is Google enough? Comparison of an internet search engine with academic library resources , 2005, Aslib Proc..

[16]  E. Riboli,et al.  Fruit and vegetable intake and the risk of cardiovascular disease, total cancer and all-cause mortality—a systematic review and dose-response meta-analysis of prospective studies , 2017, International journal of epidemiology.

[17]  Mike Thelwall,et al.  Google Scholar, Web of Science, and Scopus: a systematic comparison of citations in 252 subject categories , 2018, J. Informetrics.

[18]  P. Kirschner,et al.  The myths of the digital native and the multitasker , 2017 .

[19]  Andrew S. Pullin,et al.  ROSES RepOrting standards for Systematic Evidence Syntheses: pro forma, flow-diagram and descriptive summary of the plan and conduct of environmental systematic reviews and systematic maps , 2018, Environmental Evidence.

[20]  J. Ioannidis,et al.  Comparative efficacy and acceptability of 21 antidepressant drugs for the acute treatment of adults with major depressive disorder: a systematic review and network meta-analysis , 2018, The Lancet.

[21]  Adrian F Hernandez,et al.  Cardiovascular outcomes with glucagon-like peptide-1 receptor agonists in patients with type 2 diabetes: a meta-analysis. , 2017, The lancet. Diabetes & endocrinology.

[22]  N. Foster,et al.  Veterinary students' usage and perception of video teaching resources , 2011, BMC medical education.

[23]  Biddy Casselden,et al.  An exploration into the information-seeking behaviours of engineers and scientists , 2019, J. Libr. Inf. Sci..

[24]  Shannon Robalino,et al.  Systematic searching for environmental evidence using multiple tools and sources , 2017, Environmental Evidence.

[25]  D. Grant Campbell,et al.  "I still like Google": University student perceptions of searching OPACs and the web , 2005, ASIST.

[26]  E. Cerin,et al.  Built environmental correlates of older adults’ total physical activity and walking: a systematic review and meta-analysis , 2017, International Journal of Behavioral Nutrition and Physical Activity.

[27]  Gwan-Su Yi,et al.  Finding type 2 diabetes causal single nucleotide polymorphism combinations and functional modules from genome-wide association data , 2013, BMC Medical Informatics and Decision Making.

[28]  Ronald N. Kostoff,et al.  CAB: Citation-Assisted Background , 2005, Scientometrics.

[29]  Henk F. Moed,et al.  Suitability of Google Scholar as a source of scientific information and as a source of data for scientific evaluation - Review of the Literature , 2017, J. Informetrics.

[30]  D. Eden Replication, Meta-Analysis, Scientific Progress, and AMJ's Publication Policy , 2002 .

[31]  Nima Zamiri,et al.  Mortality and morbidity in acutely ill adults treated with liberal versus conservative oxygen therapy (IOTA): a systematic review and meta-analysis , 2018, The Lancet.

[32]  Stéfan Jacques Darmoni,et al.  Is the coverage of google scholar enough to be used alone for systematic reviews , 2013, BMC Medical Informatics and Decision Making.

[33]  Els Kuiper,et al.  Students' use of Web literacy skills and strategies: searching, reading and evaluating Web information , 2008, Inf. Res..

[34]  Neal R Haddaway,et al.  The Role of Google Scholar in Evidence Reviews and Its Applicability to Grey Literature Searching , 2015, PloS one.

[35]  Sandy Oliver,et al.  Techniques for identifying cross‐disciplinary and ‘hard‐to‐detect’ evidence for systematic review , 2014, Research synthesis methods.

[36]  Gretchen A. Stevens,et al.  Magnitude, temporal trends, and projections of the global prevalence of blindness and distance and near vision impairment: a systematic review and meta-analysis. , 2017, The Lancet. Global health.

[37]  J. Burnham Scopus database: a review , 2006, Biomedical digital libraries.

[38]  Ali Sunyaev,et al.  If you want your research done right, do you have to do it all yourself? Developing design principles for systematic literature search systems , 2017 .

[39]  Lei Wang,et al.  Three options for citation tracking: Google Scholar, Scopus and Web of Science , 2006, Biomedical digital libraries.

[40]  Dean Giustini,et al.  Comparing the coverage, recall, and precision of searches for 120 systematic reviews in Embase, MEDLINE, and Google Scholar: a prospective study , 2016, Systematic Reviews.

[41]  Abdullah Abrizah,et al.  Where and how early career researchers find scholarly information , 2017, Learn. Publ..

[42]  Bradley M. Hemminger,et al.  A study of factors that affect the information-seeking behavior of academic scientists , 2012, J. Assoc. Inf. Sci. Technol..

[43]  Dorota Glowacka,et al.  Is exploratory search different? A comparison of information search behavior for exploratory and lookup tasks , 2016, J. Assoc. Inf. Sci. Technol..

[44]  Mike Thelwall,et al.  Search engine coverage bias: evidence and possible causes , 2004, Inf. Process. Manag..

[45]  Matthew E Falagas,et al.  Comparison of PubMed, Scopus, Web of Science, and Google Scholar: strengths and weaknesses , 2007, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[46]  Ryen W. White,et al.  Exploratory Search: Beyond the Query-Response Paradigm , 2009, Exploratory Search: Beyond the Query-Response Paradigm.

[47]  Wichor Matthijs Bramer,et al.  Variation in number of hits for complex searches in Google Scholar. , 2016, Journal of the Medical Library Association : JMLA.

[48]  Peder Olesen Larsen,et al.  The rate of growth in scientific publication and the decline in coverage provided by Science Citation Index , 2010, Scientometrics.

[49]  Dawn Stacey,et al.  Decision aids for people facing health treatment or screening decisions. , 2009, The Cochrane database of systematic reviews.

[50]  Martin Offringa,et al.  Systematic reviews and meta-analysis. , 2015, Seminars in fetal & neonatal medicine.

[51]  Claire Stansfield,et al.  Methods for documenting systematic review searches: a discussion of common issues , 2014, Research synthesis methods.

[52]  Melissa L Rethlefsen,et al.  Librarian co-authors correlated with higher quality reported search strategies in general internal medicine systematic reviews. , 2015, Journal of clinical epidemiology.

[53]  Péter Jacsó,et al.  Google Scholar revisited , 2008, Online Inf. Rev..

[54]  Dean Giustini,et al.  Google Scholar is not enough to be used alone for systematic reviews , 2013, Online journal of public health informatics.

[55]  Gisela Taschner Goldenstein Megatrends 2000: ten new directions for the 1990's , 1990 .

[56]  HjørlandBirger Classical databases and knowledge organization , 2015 .

[57]  Martin Boeker,et al.  Google Scholar as replacement for systematic literature searches: good relative recall and precision are not enough , 2013, BMC Medical Research Methodology.

[58]  Lokman I. Meho,et al.  Impact of data sources on citation counts and rankings of LIS faculty: Web of science versus scopus and google scholar , 2007, J. Assoc. Inf. Sci. Technol..

[59]  Amanda Spink,et al.  How are we searching the World Wide Web? A comparison of nine search engine transaction logs , 2006, Inf. Process. Manag..

[60]  Julia H. Littell,et al.  Conceptual and practical classification of research reviews and other evidence synthesis products , 2018, Campbell systematic reviews.

[61]  S. M. Shafi,et al.  Precision and Recall of Five Search Engines for Retrieval of Scholarly Information in the Field of Biotechnology , 2005, Webology.

[62]  Deborah Meert,et al.  Impact of librarians on reporting of the literature searching component of pediatric systematic reviews. , 2017, Journal of the Medical Library Association : JMLA.

[63]  Helen Georgas,et al.  Google vs. the Library (Part II): Student Search Patterns and Behaviors when Using Google and a Federated Search Tool , 2014 .

[64]  Barrie Gunter,et al.  The Google generation: the information behaviour of the researcher of the future , 2008, Aslib Proc..

[65]  Martin P. Brändle,et al.  The coverage of Microsoft Academic: analyzing the publication output of a university , 2017, Scientometrics.

[66]  Silvia Salini,et al.  Assessing the reliability and validity of Google Scholar indicators : The case of social sciences in Italy , 2018 .

[67]  Anne-Wil Harzing,et al.  A longitudinal study of Google Scholar coverage between 2012 and 2013 , 2013, Scientometrics.

[68]  Graham K. Rand,et al.  Quantitative Applications in the Social Sciences , 1983 .

[69]  Péter Jacsó,et al.  Academic Search Engines: A Quantitative Outlook , 2015, Online Inf. Rev..

[70]  Abbe Mowshowitz,et al.  Measuring search engine bias , 2005, Inf. Process. Manag..

[71]  Michael Gusenbauer,et al.  Google Scholar to overshadow them all? Comparing the sizes of 12 academic search engines and bibliographic databases , 2018, Scientometrics.

[72]  J. Higgins,et al.  Cochrane Handbook for Systematic Reviews of Interventions , 2010, International Coaching Psychology Review.

[73]  José Luis Ortega Other academic search engines , 2014 .

[74]  Daniel F. McAuley,et al.  High-volume haemofiltration for sepsis (Protocol) , 2009 .

[75]  C. S. Green,et al.  Meta-Analysis of Action Video Game Impact on Perceptual, Attentional, and Cognitive Skills , 2018, Psychological bulletin.

[76]  Dwayne Van Eerd,et al.  Searching for grey literature for systematic reviews: challenges and benefits , 2014, Research synthesis methods.

[77]  Jonathan B. Koffel Use of Recommended Search Strategies in Systematic Reviews and the Impact of Librarian Involvement: A Cross-Sectional Survey of Recent Authors , 2015, PloS one.

[78]  Judit Bar-Ilan,et al.  On the overlap, the precision and estimated recall of search engines. A case study of the query “Erdos” , 1998, Scientometrics.

[79]  Marcia J. Bates,et al.  Encyclopedia of Library and Information Sciences , 2009 .

[80]  Hao-hua Chu,et al.  Search En-gines for the World Wide Web: A Compara-tive Study and Evaluation Methodology , 1996 .

[81]  Bradley M. Hemminger,et al.  Information seeking behavior of academic scientists , 2007, J. Assoc. Inf. Sci. Technol..

[82]  Morwenna Rogers,et al.  A checklist to assess database-hosting platforms for designing and running searches for systematic reviews. , 2014, Health information and libraries journal.

[83]  Eva Stowers,et al.  Why not just Google it? An assessment of information literacy skills in a biomedical science curriculum , 2011, BMC medical education.

[84]  Birger Hjørland,et al.  Classical databases and knowledge organization: A case for boolean retrieval and human decision‐making during searches , 2015, J. Assoc. Inf. Sci. Technol..

[85]  Elena Gorbacheva,et al.  Achieving Rigor in Literature Reviews: Insights from Qualitative Data Analysis and Tool-Support , 2015, Commun. Assoc. Inf. Syst..

[86]  Peter Ingwersen,et al.  Cognitive Perspectives of Information Retrieval Interaction: Elements of a Cognitive IR Theory , 1996, J. Documentation.

[87]  Margaret Sampson,et al.  Inquisitio validus Index Medicus: A simple method of validating MEDLINE systematic review searches , 2011, Research synthesis methods.

[88]  Madian Khabsa,et al.  Digital commons , 2020, Internet Policy Rev..

[89]  Alison Bethel,et al.  Locating qualitative studies in dementia on MEDLINE, EMBASE, CINAHL, and PsycINFO: A comparison of search strategies , 2018, Research synthesis methods.

[90]  Oscar Pastor,et al.  An activity-theory-based model to analyse Web application requirements , 2008, Inf. Res..

[91]  Eve E. Hoggan,et al.  Information-seeking behaviors of computer scientists: Challenges for electronic literature search tools , 2013, ASIST.

[92]  Dean Giustini,et al.  The comparative recall of Google Scholar versus PubMed in identical searches for biomedical systematic reviews: a review of searches used in systematic reviews , 2013, Systematic Reviews.