Comparing test searches in PubMed and Google Scholar.

Google Scholar has been met with both enthusiasm and criticism since its introduction in 2004. This search engine provides a simple way to access “peer-reviewed papers, theses, books, abstracts, and articles from academic publishers' sites, professional societies, preprint repositories, universities and other scholarly organizations” [1]. An obvious strength of Google Scholar is its intuitive interface, as the main search engine interface consists of a simple query box. In contrast, databases, such as PubMed, utilize search interfaces that offer a greater variety of advanced features. These additional features, while powerful, often lead to a complexity that may require a substantial investment of time to master. It has been observed that Google Scholar may allow searchers to “find some resources they can use rather than be frustrated by a database's search screen” [2]. Some even feel that “Google Scholar's simplicity may eventually consume PubMed” [3]. Along with ease of use, Google Scholar carries the familiar “Google” brand name. As Kennedy and Price so aptly stated, “College students AND professors might not know that library databases exist, but they sure know Google” [4]. The familiarity of Google may allow librarians and educators to ease students into the scholarly searching process by starting with Google Scholar and eventually moving to more complex systems. Felter noted that “as researchers work with Google Scholar and reach limitations of searching capabilities and options, they may become more receptive to other products” [5]. Google Scholar is also thought to provide increased access to gray literature [2], as it retrieves more than journal articles and includes preprint archives, conference proceedings, and institutional repositories [6]. Google Scholar also includes links to the online collections of some academic libraries. Including these access points in Google Scholar retrieval sets may ultimately help more users reach more of their own institution's subscriptions [7]. While its advantages are substantial, Google Scholar is not without flaws. The shortcomings of the system and its search interface have been well documented in the literature and include lack of reliable advanced search functions, lack of controlled vocabulary, and issues regarding scope of coverage and currency. Table 1 summarizes some of the reported criticisms of Google Scholar. Table 1 Criticisms of Google Scholar Vine found that while Google Scholar pulls in data from PubMed, many PubMed records are missing [20], and that Google Scholar also lacks features available in MEDLINE [12]. Others have noted that Google Scholar should not be the first or sole choice when searching for patient care information, clinical trials, or literature reviews [23,24]. Thorough review and testing of Google Scholar, being an approach similar to that used to evaluate licensed resources, is necessary to better understand its strengths and limitations. As Jacso states, “professional searchers must do sample test searches and correctly interpret the results to corroborate claims and get factual information about databases” [18]. This paper compares and contrasts a variety of test searches in PubMed and Google Scholar to gain a better understanding of Google Scholar's searching capabilities.

[1]  Anjo Anjewierden,et al.  Detecting knowledge flows in weblogs , 2005 .

[2]  S. Herring,et al.  Women and Children Last: The Discursive Construction of Weblogs , 2004 .

[3]  G. W. Milligan,et al.  An examination of the effect of six types of error perturbation on fifteen clustering algorithms , 1980 .

[4]  Barbara M Koehler,et al.  Scholarly communications program: force for change , 2006, Biomedical digital libraries.

[5]  A. Rubin,et al.  Predictors of Internet Use , 2000 .

[6]  Paul E. Green,et al.  Numerical Taxonomy in Marketing Analysis: A Review Article , 1968 .

[7]  Steve Cayzer,et al.  Semantic blogging and decentralized knowledge management , 2004, CACM.

[8]  Erinn E Aspinall,et al.  The National Institutes of Health (NIH) Policy on Enhancing Public Access: tracking institutional contribution rates. , 2006, Journal of the Medical Library Association : JMLA.

[9]  Joann M. Wleklinski,et al.  Studying google scholar: Wall to wall coverage? , 2005 .

[10]  J. D. Johnson Cancer-Related Information Seeking , 1997 .

[11]  Julie Hillan Physician use of patient-centered weblogs and online journals. , 2003, Clinical medicine & research.

[12]  Eugene Barsky,et al.  A look at Google Scholar, PubMed, and Scirus: comparisons and recommendations , 2005 .

[13]  G. A. Mishne,et al.  Expiriments with mood classification in blog posts , 2005, SIGIR 2005.

[14]  Frederick J. Friend,et al.  Google Scholar: potentially good for users of academic information , 2006 .

[15]  R. Steinbrook Searching for the right search--reaching the medical literature. , 2006, The New England journal of medicine.

[16]  Susan Gardner,et al.  Gaga over Google? Scholar in the Social Sciences , 2005 .

[17]  Jim Giles,et al.  Science in the web age: Start your engines , 2005, Nature.

[18]  R. Greg,et al.  Notess. Scholarly Web Searching : Google Scholar and Scirus , 2005 .

[19]  Michael D. Cooper,et al.  Using clustering techniques to detect usage patterns in a Web-based information system , 2001, J. Assoc. Inf. Sci. Technol..

[20]  Douglas Kellner,et al.  New Media and Internet Activism: From the ‘Battle of Seattle’ to Blogging , 2004, New Media Soc..

[21]  Jim Henderson,et al.  Google Scholar: A source for clinicians? , 2005, Canadian Medical Association Journal.

[22]  Kamran Abbasi,et al.  Simplicity and complexity in health care: what medicine can learn from Google and iPod. , 2005, Journal of the Royal Society of Medicine.

[23]  Dean Giustini,et al.  How Google is changing medicine , 2005, BMJ : British Medical Journal.

[24]  P. Jacsó As we may search : Comparison of major features of the Web of Science, Scopus, and Google Scholar citation-based and citation-enhanced databases , 2005 .

[25]  Laura M. Felter,et al.  Google scholar, scirus, and the scholarly search revolution , 2005 .

[26]  Sherrie S. Bergman The scholarly communication movement: highlights and recent developments , 2006 .

[27]  Jeffrey M Burns,et al.  Exploring new ways of publishing: a library-faculty partnership. , 2003, Journal of the Medical Library Association : JMLA.

[28]  John McKechnie,et al.  Modelling information seeking behaviour of AEC professionals on online technical information resources , 2003, J. Inf. Technol. Constr..

[29]  Jana Bradley,et al.  Modeling public health interventions for improved access to the gray literature. , 2005, Journal of the Medical Library Association : JMLA.

[30]  Michelle Gumbrecht,et al.  Blogs as “Protected Space” , 2004 .

[31]  Marilyn Domas White,et al.  Questioning Behavior on a Consumer Health Electronic List , 2000, The Library Quarterly.

[32]  G. Mishne Experiments with Mood Classification in , 2005 .

[33]  R. Mojena,et al.  Hierarchical Grouping Methods and Stopping Rules: An Evaluation , 1977, Comput. J..

[34]  Sarah Barbara Watstein,et al.  Google Scholar™ and libraries: point/counterpoint , 2005 .

[35]  Kristin Yiotis,et al.  The Open Access Initiative: A New Paradigm for Scholarly Communications , 2005 .

[36]  Richard K Johnson Will Research Sharing Keep Pace with the Internet? , 2006, The Journal of Neuroscience.