Toward a basic framework for webometrics

In this article, we define webometrics within the framework of informetric studies and bibliometrics, as belonging to library and information science, and as associated with cybermetrics as a generic subfield. We develop a consistent and detailed link typology and terminology and make explicit the distinction among different Web node levels when using the proposed conceptual framework. As a consequence, we propose a novel diagram notation to fully appreciate and investigate link structures between Web nodes in webometric analyses. We warn against taking the analogy between citation analyses and link analyses too far.

[1]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[2]  Robert R. Korfhage,et al.  Information networks: Definitions and message transfer models , 1972, J. Am. Soc. Inf. Sci..

[3]  Andrei Z. Broder,et al.  Graph structure in the Web , 2000, Comput. Networks.

[4]  Hak-Joon Kim Motivation for hyperlinking in scholarly electronic articles: a qualitative study , 2000 .

[5]  Yuen Ren Chao,et al.  Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology , 1950 .

[6]  Blaise Cronin,et al.  Invoked on the Web , 1998, J. Am. Soc. Inf. Sci..

[7]  Henry G. Small,et al.  Co-citation in the scientific literature: A new measure of the relationship between two documents , 1973, J. Am. Soc. Inf. Sci..

[8]  Peter Pirolli,et al.  Life, death, and lawfulness on the electronic frontier , 1997, CHI.

[9]  Albert-László Barabási,et al.  Internet: Diameter of the World-Wide Web , 1999, Nature.

[10]  Anthony F. J. van Raan,et al.  Bibliometrics and internet: Some observations and expectations , 2004, Scientometrics.

[11]  Julie M. Hurd,et al.  The transformation of scientific communication: A model for 2020 , 2000, J. Am. Soc. Inf. Sci..

[12]  M. M. Kessler Bibliographic coupling between scientific papers , 1963 .

[13]  Mark Levene,et al.  Web Dynamics , 2004, Springer Berlin Heidelberg.

[14]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[15]  A. Vázquez Knowing a network by walking on it: emergence of scaling , 2000, cond-mat/0006132.

[16]  Chaomei Chen,et al.  How did university departments interweave the Web: A study of connectivity and underlying factors , 1998, Interact. Comput..

[17]  Bernardo A. Huberman,et al.  The laws of the web - patterns in the ecology of information , 2001 .

[18]  Filippo Menczer,et al.  Growing and navigating the small world Web by local content , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Hendrik Blockeel,et al.  Web mining research: a survey , 2000, SKDD.

[20]  Judit Bar-Ilan,et al.  The “mad cow disease”, Usenet Newsgroups and bibliometric laws , 1997, Scientometrics.

[21]  B. C. Brookes Biblio-, sciento-, infor-metrics?? what are we talking about ? , 1990 .

[22]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[23]  Chanathip Namprempre,et al.  HyPursuit: a hierarchical network search engine that exploits content-link hypertext clustering , 1996, HYPERTEXT '96.

[24]  Blaise Cronin,et al.  Bibliometrics and beyond: some thoughts on web-based citation analysis , 2001, J. Inf. Sci..

[25]  Peter Willett,et al.  The Representation and Comparison of Hypertext Structures using Graphs in Information Retrieval and , 1996 .

[26]  A A Hernández-Borges,et al.  Comparative analysis of pediatric mailing lists on the Internet. , 1997, Pediatrics.

[27]  M. Kochen,et al.  Contacts and influence , 1978 .

[28]  Lennart Björneborn Small-world linkage and co-linkage , 2001, HYPERTEXT '01.

[29]  E GARFIELD,et al.  Citation indexes for science; a new dimension in documentation through association of ideas. , 2006, Science.

[30]  Stephen P. Harter,et al.  Web-based analyses of E-journal impact: Approaches, problems, and issues , 2000, J. Am. Soc. Inf. Sci..

[31]  Susan C. Herring,et al.  Computer-mediated communication on the internet , 2005, Annu. Rev. Inf. Sci. Technol..

[32]  Jon M. Kleinberg,et al.  Mining the Web's Link Structure , 1999, Computer.

[33]  Mike Thelwall,et al.  Motivations for academic web site interlinking: evidence for the Web as a novel source of information on informal scholarly communication , 2003, J. Inf. Sci..

[34]  Jon M. Kleinberg,et al.  The Web as a Graph: Measurements, Models, and Methods , 1999, COCOON.

[35]  Lada A. Adamic,et al.  Power-Law Distribution of the World Wide Web , 2000, Science.

[36]  Gary Stixon,et al.  Japan Fields a Big-League Light Gatherer , 1999 .

[37]  James Pitkow Characterizing World Wide Web Ecologies. A thesis presented to the Academic Faculty In partial fulfillment of the requirements for the PhD in the Dept. of Computer Science, Georgia Institute of Technology, 1997 , 1997 .

[38]  Hak-Joon Kim,et al.  Motivations for hyperlinking in scholarly electronic articles: A qualitative study , 2000, J. Am. Soc. Inf. Sci..

[39]  Peter Ingwersen,et al.  The calculation of web impact factors , 1998, J. Documentation.

[40]  Ray R. Larson,et al.  Bibliometrics of the World Wide Web: An Exploratory Analysis of the Intellectual Structure of Cyberspace , 1996 .

[41]  Vijay V. Raghavan,et al.  The Shape of the Web and Its Implications for Searching the Web , 2000 .

[42]  Mike Thelwall,et al.  The connection between the research of a university and counts of links to its web pages: An investigation based upon a classification of the relationships of pages to the research of the host university , 2003, J. Assoc. Inf. Sci. Technol..

[43]  Yin Zhang Scholarly use of internet-based electronic resources , 2001, J. Assoc. Inf. Sci. Technol..

[44]  K. C. Claffy,et al.  Measuring the Internet , 2004, The Practical Handbook of Internet Computing.

[45]  Jaideep Srivastava,et al.  Web mining: information and pattern discovery on the World Wide Web , 1997, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence.

[46]  Mike Thelwall,et al.  Conceptualizing documentation on the Web: An evaluation of different heuristic-based models for counting links between university Web sites , 2002, J. Assoc. Inf. Sci. Technol..

[47]  John Scott Social Network Analysis , 1988 .

[48]  Mike Thelwall,et al.  What is this link doing here? Beginning a fine-grained process of identifying reasons for academic hyperlink creation , 2003, Inf. Res..

[49]  Lada A. Adamic,et al.  The Web's hidden order , 2001, CACM.

[50]  Manfred Kochen The growth of knowledge : readings on organization and retrieval of information , 1967 .

[51]  Leo Egghe,et al.  New informetric aspects of the Internet: some reflections - many problems , 2000, J. Inf. Sci..

[52]  Rob Kitchin,et al.  New Cartographies to Chart Cyberspace , 2002 .

[53]  J. Gross,et al.  Graph Theory and Its Applications , 1998 .

[54]  David M. Pennock,et al.  The structure of broad topics on the web , 2002, WWW.

[55]  Peter Ingwersen,et al.  Perspective of webometrics , 2004, Scientometrics.

[56]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[57]  Ellen Spertus,et al.  ParaSite: Mining Structural Information on the Web , 1997, Comput. Networks.

[58]  Mike Thelwall,et al.  Hyperlink Analyses of the World Wide Web: A Review , 2006, J. Comput. Mediat. Commun..

[59]  M. Dodge,et al.  Mapping Cyberspace , 2000 .

[60]  Chris H. Q. Ding,et al.  Link Analysis: Hubs and Authorities on the World Wide Web , 2004, SIAM Rev..

[61]  Luc Girardin Mapping the virtual geography of the World-Wide Web , 1996, WWW 1996.

[62]  Ronald Rousseau,et al.  Social network analysis: a powerful strategy, also for the information sciences , 2002, J. Inf. Sci..

[63]  Mike Thelwall,et al.  Three target document range metrics for university web sites , 2003, J. Assoc. Inf. Sci. Technol..

[64]  Oren Etzioni,et al.  The World-Wide Web: quagmire or gold mine? , 1996, CACM.

[65]  Norman P. Hummon,et al.  Connectivity in a citation network: The development of DNA theory☆ , 1989 .

[66]  James Edward Pitkow,et al.  Characterizing World Wide Web ecologies , 1997 .

[67]  Jock D. Mackinlay,et al.  Visualizing the evolution of Web ecologies , 1998, CHI.

[68]  Ralph Abraham Webometry: Measuring the complexity of the world wide web , 1997 .

[69]  Kimberly C. Claffy,et al.  Measuring the Internet , 2004, The Practical Handbook of Internet Computing.

[70]  Luc Girardin,et al.  Cyberspace geography visualization: mapping the World-Wide Web to help people find their way in cybe , 1995 .

[71]  A. Vázquez Disordered networks generated by recursive searches , 2001 .

[72]  Michel Zitt,et al.  Co-citations and co-sitations: A cautionary view on an analogy , 2002, Scientometrics.

[73]  Alfred J. Lotka,et al.  The frequency distribution of scientific productivity , 1926 .

[74]  S. Bradford "Sources of information on specific subjects" by S.C. Bradford , 1985 .

[75]  Peter Ingwersen,et al.  Informetric analyses on the world wide web: methodological approaches to 'webometrics' , 1997, J. Documentation.

[76]  Ramana Rao,et al.  Silk from a sow's ear: extracting usable structures from the Web , 1996, CHI.

[77]  Jonathan Furner,et al.  Scholarly communication and bibliometrics , 2005, Annu. Rev. Inf. Sci. Technol..

[78]  Paul Nicholls,et al.  Introduction to informetrics: Quantitative methods in library, documentation and information science , 1991 .

[79]  Jean Tague-Sutcliffe,et al.  An Introduction to Informetrics , 1992, Inf. Process. Manag..

[80]  D J PRICE,et al.  NETWORKS OF SCIENTIFIC PAPERS. , 1965, Science.

[81]  Xerox,et al.  The Small World , 1999 .

[82]  R. Rousseau Sitations: an exploratory study , 1997 .

[83]  Brian D. Davison Topical locality in the Web , 2000, SIGIR '00.

[84]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[85]  Massimo Marchiori,et al.  The Limits of Web Metadata, and Beyond , 1998, Comput. Networks.

[86]  Eric K. Meyer Web Metrics: Too Much Data, Too Little Analysis , 2000 .

[87]  Jill Walker,et al.  Links and power: the political economy of linking on the Web , 2002, HYPERTEXT '02.