Methodological Issues of Webometric Studies

The contribution defines webometrics within the framework of informetric studies, bibliometrics, and scientometrics as belonging to library and information science, and associated with cybermetrics as a generic sub-field. It outlines a consistent and detailed link typology and terminology and makes explicit the distinction between the web node levels when using the proposed terminological structures. Secondly, the contribution presents the meaning, methodology and problematic issues of the central webometric analysis types, i.e., Web engine and crawler coverage, quality and sampling issues. It discusses briefly Web Impact Factor and other link analyses. The contribution finally looks into log studies of human Web interaction.

[1]  Owen Thomas,et al.  Webometric analysis of departments of librarianship and information science , 2000, J. Inf. Sci..

[2]  Marc Najork,et al.  On near-uniform URL sampling , 2000, Comput. Networks.

[3]  R. Rousseau Sitations: an exploratory study , 1997 .

[4]  Chaomei Chen,et al.  How did university departments interweave the Web: A study of connectivity and underlying factors , 1998, Interact. Comput..

[5]  Loren H. Rieseberg,et al.  How reliable is science information on the web? , 1999, Nature.

[6]  Lei Cui,et al.  Rating Health Web sites using the principles of Citation Analysis: A Bibliometric Approach , 1999, Journal of medical Internet research.

[7]  Judit Bar-Ilan Search engine results over time-a case study on search engine stability , 1998 .

[8]  Mike Thelwall,et al.  Hyperlink Analyses of the World Wide Web: A Review , 2006, J. Comput. Mediat. Commun..

[9]  Mike Thelwall,et al.  Motivations for academic web site interlinking: evidence for the Web as a novel source of information on informal scholarly communication , 2003, J. Inf. Sci..

[10]  Jon M. Kleinberg,et al.  The Web as a Graph: Measurements, Models, and Methods , 1999, COCOON.

[11]  B. C. Brookes Biblio-, sciento-, infor-metrics?? what are we talking about ? , 1990 .

[12]  Ronald Rousseau,et al.  Daily time series of common single word searches in AltaVista and NorthernLight , 1998 .

[13]  Amanda Spink,et al.  Real life, real users, and real needs: a study and analysis of user queries on the web , 2000, Inf. Process. Manag..

[14]  Jean Tague-Sutcliffe,et al.  An Introduction to Informetrics , 1992, Inf. Process. Manag..

[15]  Peter Ingwersen,et al.  The Turn - Integration of Information Seeking and Retrieval in Context , 2005, The Kluwer International Series on Information Retrieval.

[16]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[17]  Susan C. Herring,et al.  Computer-mediated communication on the internet , 2005, Annu. Rev. Inf. Sci. Technol..

[18]  Mike Thelwall The Responsiveness of Search Engine Indexes , 2001 .

[19]  James E. Pitkow,et al.  Characterizing Browsing Strategies in the World-Wide Web , 1995, Comput. Networks ISDN Syst..

[20]  Amanda Spink,et al.  Introduction to the special issue on Web research , 2002, J. Assoc. Inf. Sci. Technol..

[21]  K. C. Claffy,et al.  Measuring the Internet , 2004, The Practical Handbook of Internet Computing.

[22]  Judit Bar-Ilan Methods for measuring search engine performance over time , 2002, J. Assoc. Inf. Sci. Technol..

[23]  Peter Ingwersen,et al.  Informetric analyses on the world wide web: methodological approaches to 'webometrics' , 1997, J. Documentation.

[24]  Blaise Cronin,et al.  Science and Scholarship on the World Wide Web: a North American Perspective , 1996, J. Documentation.

[25]  Mike Thelwall,et al.  Scholarly Use of the Web: What Are the Key Inducers of Links to Journal Web Sites , 2003, J. Assoc. Inf. Sci. Technol..

[26]  Amanda Spink,et al.  Searching the Web: the public and their queries , 2001 .

[27]  Andrei Z. Broder,et al.  Graph structure in the Web , 2000, Comput. Networks.

[28]  Mike Thelwall,et al.  Disciplinary and linguistic considerations for academic Web linking: An exploratory hyperlink mediated study with Mainland China and Taiwan , 2003, Scientometrics.

[29]  Judit Bar-Ilan,et al.  The “mad cow disease”, Usenet Newsgroups and bibliometric laws , 1997, Scientometrics.

[30]  Blaise Cronin,et al.  Bibliometrics and beyond: some thoughts on web-based citation analysis , 2001, J. Inf. Sci..

[31]  Gabriel Pinski,et al.  Citation influence for journal aggregates of scientific publications: Theory, with application to the literature of physics , 1976, Inf. Process. Manag..

[32]  Mike Thelwall,et al.  Extracting macroscopic information from Web links , 2001, J. Assoc. Inf. Sci. Technol..

[33]  Giles,et al.  Searching the world wide Web , 1998, Science.

[34]  Judit Bar-Ilan,et al.  The Web as an information source on informetrics?: a content analysis , 2000 .

[35]  Peter Willett,et al.  Estimating the recall performance of Web search engines , 1997 .

[36]  M. P. Courtois,et al.  Results-ranking in Web search engines : Search Engine Section , 1999 .

[37]  Howard Rosenbaum,et al.  Can search engines be used as tools for web-link analysis? A critical view , 1999, J. Documentation.

[38]  Peter Ingwersen,et al.  Characteristics of scientific Web publications: Preliminary data gathering and analysis , 2004, J. Assoc. Inf. Sci. Technol..

[39]  Josep-Manuel Rodríguez-Gairín Valoración del impacto de la información en Internet: Altavista, el “Citation Index” de la red , 1997 .

[40]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[41]  C. Lee Giles,et al.  Accessibility of information on the web , 1999, Nature.

[42]  Monika Henzinger,et al.  Analysis of a very large web search engine query log , 1999, SIGF.

[43]  Charles Oppenheim,et al.  The evaluation of WWW search engines , 2000, J. Documentation.

[44]  Mike Thelwall,et al.  The relationship between the WIFs or inlinks of Computer Science Departments in UK and their RAE ratings or research productivities in 2001 , 2003, Scientometrics.

[45]  Rob Kling,et al.  Not Just a Matter of Time: Field Differences and the Shaping of Electronic Media , 1999 .

[46]  Lennart Björneborn Small-world linkage and co-linkage , 2001, HYPERTEXT '01.

[47]  Judit Bar-Ilan,et al.  Data collection methods on the Web for infometric purposes — A review and analysis , 2004, Scientometrics.

[48]  Judit Bar-Ilan,et al.  The life span of a specific topic on the web , 1999, Scientometrics.

[49]  Mike Thelwall,et al.  Search engine coverage bias: evidence and possible causes , 2004, Inf. Process. Manag..

[50]  Alastair Smith,et al.  A Tale of Two Web Spaces: Comparing Sites Using Web Impact Factors. , 1999 .

[51]  Peter Ingwersen,et al.  Perspective of webometrics , 2004, Scientometrics.

[52]  Bernard J. Jansen,et al.  A review of web searching studies and a framework for future research , 2001 .

[53]  Mike Thelwall Web impact factors and search engine coverage , 2000, J. Documentation.

[54]  Mike Thelwall,et al.  A web crawler design for data mining , 2001, J. Inf. Sci..

[55]  Leo Egghe,et al.  Introduction to Informetrics: Quantitative Methods in Library, Documentation and Information Science , 1990 .

[56]  Ramana Rao,et al.  Silk from a sow's ear: extracting usable structures from the Web , 1996, CHI.

[57]  Peter Ingwersen,et al.  Toward a basic framework for webometrics , 2004, J. Assoc. Inf. Sci. Technol..

[58]  Ronald Rousseau,et al.  Social network analysis: a powerful strategy, also for the information sciences , 2002, J. Inf. Sci..

[59]  Peter Ingwersen,et al.  The calculation of web impact factors , 1998, J. Documentation.

[60]  Ray R. Larson,et al.  Bibliometrics of the World Wide Web: An Exploratory Analysis of the Intellectual Structure of Cyberspace , 1996 .

[61]  Yanchun Zhang,et al.  Effectively Finding Relevant Web Pages from Linkage Information , 2003, IEEE Trans. Knowl. Data Eng..