An Investigation of Web Resource Distribution in the Field of Information Science

This study introduces a new methodology to explore Web information distribution. Subject terms extracted from a key journal in the field of information science were employed to conduct Web searches on Google to identify a corpus of Internet domains and associated Web pages to represent the discipline of information science. A Bradford analysis was then applied to the corpus to determine if the scatter of Web pages conformed to a Bradford distribution. The modeling of the collected data to the power law function in LOTKA program indicates a good fit at even 10% significance level. With a binning procedure and least squares fitting, an R square value of 0.987 was obtained. A division of the data according to top level domain category shows different number of domains and domain productivities in different types of domains. Governmental and commercial domains have higher productivity than the educational and organizational domains. However, the difference between governmental and commercial domains and between educational and organizational domains are not significant.

[1]  M. van der Westhuizen The invisible Web , 2001 .

[2]  Edie M. Rasmussen,et al.  Indexing and retrieval for the Web , 2005, Annu. Rev. Inf. Sci. Technol..

[3]  Leo Egghe,et al.  The duality of informetric systems with applications to the empirical laws , 1990, J. Inf. Sci..

[4]  Liwen Vaughan,et al.  Webometrics , 2005, Annu. Rev. Inf. Sci. Technol..

[5]  Mike Thelwall,et al.  Methodologies for crawler based Web surveys , 2002, Internet Res..

[6]  Ronald Rousseau,et al.  Bradford Curves , 1994, Inf. Process. Manag..

[7]  Cristina Faba Pérez,et al.  "Sitation" distributions and Bradford's law in a closed Web space , 2003, J. Documentation.

[8]  Alfred J. Lotka,et al.  The frequency distribution of scientific productivity , 1926 .

[9]  S. Bradford "Sources of information on specific subjects" by S.C. Bradford , 1985 .

[10]  Vincent Larivière,et al.  Self-Selected or Mandated, Open Access Increases Citation Impact for Higher Quality Research , 2010, PloS one.

[11]  Judit Bar-Ilan,et al.  The use of web search engines in information science research , 2005, Annu. Rev. Inf. Sci. Technol..

[12]  Jon Postel,et al.  Domain requirements , 1984, Request for Comments.

[13]  Mary W. Lockett The Bradford Distribution: A Review of the Literature, 1934-1987. , 1989 .

[14]  Birger Hjørland,et al.  Practical potentials of Bradford's law: a critical examination of the received view , 2007, J. Documentation.

[15]  Paul Pedley The invisible Web , 2001 .

[16]  Judit Bar-Ilan,et al.  Data collection methods on the Web for infometric purposes — A review and analysis , 2004, Scientometrics.

[17]  Staša Milojević Power law distributions in information science: Making the case for logarithmic binning , 2010 .

[18]  Andrei Z. Broder,et al.  A Technique for Measuring the Relative Size and Overlap of Public Web Search Engines , 1998, Comput. Networks.

[19]  Albert-László Barabási,et al.  Internet: Diameter of the World-Wide Web , 1999, Nature.

[20]  Ferdinand F. Leimkuhler,et al.  A relationship between Lotka's Law, Bradford's Law, and Zipf's Law , 1986, J. Am. Soc. Inf. Sci..

[21]  Judit Bar-Ilan,et al.  The “mad cow disease”, Usenet Newsgroups and bibliometric laws , 1997, Scientometrics.

[22]  Judit Bar-Ilan,et al.  The Web as an information source on informetrics? A content analysis , 2000, J. Am. Soc. Inf. Sci..

[23]  Cristina Faba-Pérez,et al.  Sitation distributions and Bradford's law in a closed Web space , 2003 .

[24]  Mike Thelwall,et al.  Extracting accurate and complete results from search engines: Case study windows live , 2008, J. Assoc. Inf. Sci. Technol..

[25]  R. Rousseau Sitations: an exploratory study , 1997 .

[26]  K. C. Claffy,et al.  Measuring the Internet , 2004, The Practical Handbook of Internet Computing.

[27]  W. Lehr Measuring the Internet , 2012 .

[28]  Mike Thelwall,et al.  Scholarly Use of the Web: What Are the Key Inducers of Links to Journal Web Sites , 2003, J. Assoc. Inf. Sci. Technol..

[29]  Amanda Spink,et al.  Searching the Web: the public and their queries , 2001 .

[30]  Ferdinand F. Leimkuhler,et al.  A Relationship between Lotka's Law, Bradford's Law, and Zipf's Law. , 1986 .

[31]  Debora Shaw,et al.  Bibliographic and Web citations: What is the difference? , 2003, J. Assoc. Inf. Sci. Technol..

[32]  Documentation , 2006 .

[33]  R. Rousseau,et al.  LOTKA: A program to fit a power law distribution to observed frequency data. , 2000 .

[34]  José Luis Ortega,et al.  Scientific research activity and communication measured with cybermetrics indicators , 2006, J. Assoc. Inf. Sci. Technol..