Bibliometrics to webometrics

Bibliometrics has changed out of all recognition since 1958; becoming established as a field, being taught widely in library and information science schools, and being at the core of a number of science evaluation research groups around the world. This was all made possible by the work of Eugene Garfield and his Science Citation Index. This article reviews the distance that bibliometrics has travelled since 1958 by comparing early bibliometrics with current practice, and by giving an overview of a range of recent developments, such as patent analysis, national research evaluation exercises, visualization techniques, new applications, online citation indexes, and the creation of digital libraries. Webometrics, a modern, fast-growing offshoot of bibliometrics, is reviewed in detail. Finally, future prospects are discussed with regard to both bibliometrics and webometrics.

[1]  Gerald Benoît,et al.  Link analysis: An information science approach , 2006, J. Assoc. Inf. Sci. Technol..

[2]  Lennart Björneborn,et al.  'Mini small worlds' of shortest link paths crossing domain boundaries in an academic Web space , 2006, Scientometrics.

[3]  Chaomei Chen,et al.  Information Visualization: Beyond the Horizon , 2006 .

[4]  Charles Oppenheim,et al.  Highly cited old papers and the reasons why they continue to be cited , 1978, J. Am. Soc. Inf. Sci..

[5]  R. Merton The Matthew Effect in Science , 1968, Science.

[6]  Tim Brody,et al.  Earlier Web usage statistics as predictors of later citation impact: Research Articles , 2006 .

[7]  H. Zuckerman The sociology of science. , 1988 .

[8]  Edward J. Valauskas,et al.  Web Logs as Indices of Electronic Journal Use: Tools for Identifying a “Classic” Article , 2002 .

[9]  Rudy Prabowo,et al.  Identifying and characterizing public science-related fears from RSS feeds , 2007, J. Assoc. Inf. Sci. Technol..

[10]  Howard Rosenbaum,et al.  Can search engines be used as tools for web-link analysis? A critical view , 1999, J. Documentation.

[11]  S. Bradford "Sources of information on specific subjects" by S.C. Bradford , 1985 .

[12]  Danah Boyd,et al.  Friendster and publicly articulated social networking , 2004, CHI EA '04.

[13]  Henry G. Small,et al.  Co-citation in the scientific literature: A new measure of the relationship between two documents , 1973, J. Am. Soc. Inf. Sci..

[14]  Arie Rip,et al.  Evaluation of societal quality of public sector research in the Netherlands , 2000 .

[15]  Stevan Harnad,et al.  Earlier Web Usage Statistics as Predictors of Later Citation Impact , 2005, J. Assoc. Inf. Sci. Technol..

[16]  Lokman I. Meho,et al.  Using the h-index to rank influential information scientistss: Brief Communication , 2006 .

[17]  Lokman I. Meho,et al.  Impact of data sources on citation counts and rankings of LIS faculty: Web of science versus scopus and google scholar , 2007 .

[18]  Loet Leydesdorff,et al.  The university-industry knowledge relationship: Analyzing patents and the science base of technologies , 2004, J. Assoc. Inf. Sci. Technol..

[19]  S. Cozzens,et al.  Assessing federally-supported academic research in the United States , 2000 .

[20]  Norman Kaplan,et al.  The Sociology of Science: Theoretical and Empirical Investigations , 1974 .

[21]  J. Ravetz Sociology of Science , 1972, Nature.

[22]  Mike Thelwall,et al.  Which factors explain the Web impact of scientists' personal homepages? , 2007 .

[23]  Mike Thelwall,et al.  Hyperlinks as a data source for science mapping , 2004, J. Inf. Sci..

[24]  Journal of Information Science , 1984 .

[25]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[26]  Rudy Prabowo,et al.  Identifying and characterizing public science-related fears from RSS feeds: Research Articles , 2007 .

[27]  Amma Darko,et al.  Beyond the Horizon , 1995 .

[28]  Mike Thelwall,et al.  Extracting macroscopic information from Web links , 2001, J. Assoc. Inf. Sci. Technol..

[29]  Alastair G. Smith Does metadata count? A Webometric investigation , 2002, Dublin Core Conference.

[30]  Helen Nissenbaum,et al.  Shaping the Web: Why the Politics of Search Engines Matters , 2000, Inf. Soc..

[31]  Charles Oppenheim,et al.  Comparing alternatives to the Web of Science for coverage of the social sciences' literature , 2007, J. Informetrics.

[32]  José Luis Ortega,et al.  Scientific research activity and communication measured with cybermetrics indicators , 2006, J. Assoc. Inf. Sci. Technol..

[33]  Jonathan Furner,et al.  Scholarly communication and bibliometrics , 2005, Annu. Rev. Inf. Sci. Technol..

[34]  Magdal Pienaar,et al.  The South African system of evaluating and rating individual researchers : its merits, shortcomings, impact and future , 2000 .

[35]  Gaston Heimeriks,et al.  Mapping communication and collaboration in heterogeneous research networks , 2004, Scientometrics.

[36]  Stephen S. Murray,et al.  The bibliometric properties of article readership information: Research Articles , 2005 .

[37]  Judit Bar-Ilan,et al.  Evolution, continuity, and disappearance of documents on a specific topic on the Web: A longitudinal study of informetrics , 2004, J. Assoc. Inf. Sci. Technol..

[38]  Liwen Vaughan,et al.  Web citation data for impact assessment: A comparison of four science disciplines: Book Reviews , 2005 .

[39]  C. Lee Giles,et al.  Accessibility of information on the web , 1999, Nature.

[40]  Giorgio Sirilli,et al.  R&D evaluation in Italy: more needs to be done , 2005 .

[41]  Eugene Garfield,et al.  Citation indexing - its theory and application in science, technology, and humanities , 1979 .

[42]  Chaomei Chen,et al.  CiteSpace II: Detecting and visualizing emerging trends and transient patterns in scientific literature , 2006, J. Assoc. Inf. Sci. Technol..

[43]  Howard D. White,et al.  Author cocitation: A literature measure of intellectual structure , 1981, J. Am. Soc. Inf. Sci..

[44]  Eric K. Ringger,et al.  Pulse: Mining Customer Opinions from Free Text , 2005, IDA.

[45]  Mike Thelwall,et al.  A longitudinal study of academic webs: Growth and stabilisation , 2007, Scientometrics.

[46]  Philipp Mayr,et al.  Google Web APIs - an Instrument for Webometric Analyses? , 2006, ArXiv.

[47]  Mike Thelwall,et al.  Motivations for academic web site interlinking: evidence for the Web as a novel source of information on informal scholarly communication , 2003, J. Inf. Sci..

[48]  L. Butler,et al.  Explaining Australia’s increased share of ISI publications—the effects of a funding formula based on publication counts , 2003 .

[49]  Andrew Tomkins,et al.  How to build a WebFountain: An architecture for very large-scale text analytics , 2004, IBM Syst. J..

[50]  Kevin W. Boyack,et al.  Using detailed maps of science to identify potential collaborations , 2009, Scientometrics.

[51]  Ramanathan V. Guha,et al.  The predictive power of online chatter , 2005, KDD '05.

[52]  Peter Ingwersen,et al.  Toward a basic framework for webometrics , 2004, J. Assoc. Inf. Sci. Technol..

[53]  Howard D. White,et al.  Pathfinder networks and author cocitation analysis: A remapping of paradigmatic information scientists , 2003, J. Assoc. Inf. Sci. Technol..

[54]  E. Garfield The history and meaning of the journal impact factor. , 2006, JAMA.

[55]  Plergiorgio Strata,et al.  Citation analysis , 1995, Nature.

[56]  Lokman I. Meho,et al.  Impact of data sources on citation counts and rankings of LIS faculty: Web of science versus scopus and google scholar , 2007, J. Assoc. Inf. Sci. Technol..

[57]  Debora Shaw,et al.  Web citation data for impact assessment: A comparison of four science disciplines , 2005, J. Assoc. Inf. Sci. Technol..

[58]  Mike Thelwall,et al.  Google Scholar citations and Google Web/URL citations: A multi-discipline exploratory analysis , 2007, J. Assoc. Inf. Sci. Technol..

[59]  Mike Thelwall Extracting accurate and complete results from search engines: Case study windows live , 2008 .

[60]  Debora Shaw,et al.  Bibliographic and Web citations: What is the difference? , 2003, J. Assoc. Inf. Sci. Technol..

[61]  M. Thelwall Social networks, gender, and friending: An analysis of MySpace member profiles , 2008 .

[62]  Philip M. Davis,et al.  Does the arXiv lead to higher citations and reduced publisher downloads for mathematics articles? , 2006, Scientometrics.

[63]  David M. Pennock,et al.  Winners don't take all: Characterizing the competition for links on the web , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[64]  Yuen Ren Chao,et al.  Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology , 1950 .

[65]  Mike Thelwall,et al.  Search engine coverage bias: evidence and possible causes , 2004, Inf. Process. Manag..

[66]  Lokman I. Meho,et al.  Using the h-index to rank influential information scientists , 2006, J. Assoc. Inf. Sci. Technol..

[67]  H. Moed Citation Analysis in Research Evaluation (Information Science & Knowledge Management) , 2005 .

[68]  Loet Leydesdorff,et al.  Why Words and Co-Words Cannot Map the Development of the Sciences , 1997, J. Am. Soc. Inf. Sci..

[69]  Owen Thomas,et al.  Webometric analysis of departments of librarianship and information science , 2000, J. Inf. Sci..

[70]  Loet Leydesdorff,et al.  Betweenness centrality as an indicator of the interdisciplinarity of scientific journals , 2007, J. Assoc. Inf. Sci. Technol..

[71]  Bernardo A. Huberman,et al.  Rhythms of social interaction: messaging within a massive online network , 2006, ArXiv.

[72]  Judit Bar-Ilan,et al.  The use of web search engines in information science research , 2005, Annu. Rev. Inf. Sci. Technol..

[73]  Derek de Solla Price,et al.  A general theory of bibliometric and other cumulative advantage processes , 1976, J. Am. Soc. Inf. Sci..

[74]  N. Payne A Longitudinal Study of Academic Web Links: Identifying and Explaining Change , 2007 .

[75]  E. Garfield Citation analysis as a tool in journal evaluation. , 1972, Science.

[76]  J. E. Hirsch,et al.  An index to quantify an individual's scientific research output , 2005, Proc. Natl. Acad. Sci. USA.

[77]  Peter Ingwersen,et al.  Informetric analyses on the world wide web: methodological approaches to 'webometrics' , 1997, J. Documentation.

[78]  Blaise Cronin,et al.  The citation process: The role and significance of citations in scientific communication , 1984 .

[79]  Danah Boyd,et al.  Friends, Friendsters, and Top 8: Writing community into being on social network sites , 2006, First Monday.

[80]  Werner B. Korte,et al.  empirica - Gesellschaft für Kommunikations- und Technologieforschung mbH , 2007 .

[81]  Charles Oppenheim,et al.  The Influence of Peer Review on the Research Assessment Exercise , 2004, J. Inf. Sci..

[82]  José Luis Ortega,et al.  Longitudinal Study of Contents and Elements in the Scientific Web environment , 2006 .

[83]  M. HamidR.Jamali,et al.  Site navigation and its impact on the content viewed by the virtual scholar: a deep log analysis , 2007, J. Inf. Sci..

[84]  Mike Thelwall,et al.  Do the Web sites of higher rated scholars have significantly more online impact? , 2004, J. Assoc. Inf. Sci. Technol..

[85]  Andrei Z. Broder,et al.  Graph structure in the Web , 2000, Comput. Networks.

[86]  Martin Meyer,et al.  Academic patents as an indicator of useful research? A new approach to measure academic inventiveness , 2003 .

[87]  Paul Nieuwenhuysen,et al.  Internet search engines - fluctuations in document accessibility , 2001, J. Documentation.

[88]  Charles Oppenheim,et al.  Using the h-index to rank influential British researchers in information science and librarianship , 2007, J. Assoc. Inf. Sci. Technol..

[89]  Stephen J. Bensman Garfield and the impact factor , 2007, Annu. Rev. Inf. Sci. Technol..

[90]  Mike Thelwall,et al.  Extracting accurate and complete results from search engines: Case study windows live , 2008, J. Assoc. Inf. Sci. Technol..

[91]  Ronald Rousseau,et al.  Daily time series of common single word searches in AltaVista and NorthernLight , 1998 .

[92]  Katherine W. McCain,et al.  : The Web of Knowledge: A Festschrift in Honor of Eugene Garfield , 2001 .

[93]  Blaise Cronin,et al.  The effect of postings information on searching behaviour Bibliometrics and beyond : some thoughts on web-based citation analysis , 2001 .

[94]  Jonathan Adams Research Assessment in the UK , 2002, Science.

[95]  Henry Small Visualizing science by citation mapping , 1999 .

[96]  Sally Jo Cunningham,et al.  A transaction log analysis of a digital library , 2000, International Journal on Digital Libraries.

[97]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[98]  E. V. Couvering,et al.  New Media ? The Political Economy of Internet Search Engines , 2004 .

[99]  Rudy Prabowo,et al.  Are raw RSS feeds suitable for broad issue scanning? A science concern case study , 2006, J. Assoc. Inf. Sci. Technol..

[100]  Mike Thelwall,et al.  Motivations for URL citations to open access library and information science articles , 2006, Scientometrics.

[101]  Stephen S. Murray,et al.  The bibliometric properties of article readership information , 2005, J. Assoc. Inf. Sci. Technol..

[102]  Timothy C. Craven Variations in Use of Meta Tag Keywords by Web Pages in Different Languages , 2004, J. Inf. Sci..

[103]  Peter Ingwersen,et al.  The calculation of web impact factors , 1998, J. Documentation.

[104]  Hao-Ren Ke,et al.  Exploring behavior of E-journal users in science and technology: Transaction log analysis of Elsevier's ScienceDirect OnSite in Taiwan , 2002 .

[105]  Judit Bar-Ilan Search engine results over time-a case study on search engine stability , 1998 .

[106]  G. P. Laroff,et al.  The role of networking , 1996 .

[107]  Stevan Harnad,et al.  Open access scientometrics and the UK Research Assessment Exercise , 2007, Scientometrics.

[108]  Henk F. Moed,et al.  Citation Analysis in Research Evaluation , 1899 .

[109]  José Luis Ortega,et al.  Scientific research activity and communication measured with cybermetrics indicators: Research Articles , 2006 .

[110]  Charles Oppenheim,et al.  Using the h -index to rank influential British researchers in information science and librarianship: Brief Communication , 2007 .

[111]  Mike Thelwall,et al.  A statistical analysis of the web presences of European life sciences research teams , 2008, J. Assoc. Inf. Sci. Technol..

[112]  Irena V. Marshakova-shaikevich System of Document Connections Based on References , 2009 .

[113]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.