A statistical analysis of the web presences of European life sciences research teams

Web links have been used for around ten years to explore the online impact of academic information and information producers. Nevertheless, few studies have attempted to relate link counts to relevant offline attributes of the owners of the targeted Web sites, with the exception of research productivity. This article reports the results of a study to relate site inlink counts to relevant owner characteristics for over 400 European life-science research group Web sites. The analysis confirmed that research-group size and Web-presence size were important for attracting Web links, although research productivity was not. Little evidence was found for significant influence of any of an array of factors, including research-group leader gender and industry connections. In addition, the choice of search engine for link data created a surprising international difference in the results, with Google perhaps giving unreliable results. Overall, the data collection, statistical analysis and results interpretation were all complex and it seems that we still need to know more about search engines, hyperlinks, and their function in science before we can draw conclusions on their usefulness and role in the canon of science and technology indicators. © 2008 Wiley Periodicals, Inc.

[1]  Mike Thelwall,et al.  Methodologies for crawler based Web surveys , 2002, Internet Res..

[2]  Yu Xie,et al.  Explaining Sex Differences in Publication Productivity among Postsecondary Faculty , 2003 .

[3]  Ronald Rousseau,et al.  Daily time series of common single word searches in AltaVista and NorthernLight , 1998 .

[4]  Jenny Fry,et al.  Studying the Scholarly web: How disciplinary culture shapes online representations , 2006 .

[5]  S. Lawrence Free online availability substantially increases a paper's impact , 2001, Nature.

[6]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[7]  Robert E. Kraut,et al.  Patterns of contact and communication in scientific research collaboration , 1990, CSCW '88.

[8]  Mike Thelwall,et al.  The connection between the research of a university and counts of links to its web pages: An investigation based upon a classification of the relationships of pages to the research of the host university , 2003, J. Assoc. Inf. Sci. Technol..

[9]  Jonathon N. Cummings,et al.  Collaborative Research Across Disciplinary and Organizational Boundaries , 2005 .

[10]  Mike Thelwall,et al.  National and international university departmental Web site interlinking , 2005, Scientometrics.

[11]  Judit Bar-Ilan,et al.  The use of web search engines in information science research , 2005, Annu. Rev. Inf. Sci. Technol..

[12]  Mike Thelwall,et al.  Three target document range metrics for university web sites , 2003, J. Assoc. Inf. Sci. Technol..

[13]  Paul Nieuwenhuysen,et al.  Internet search engines - fluctuations in document accessibility , 2001, J. Documentation.

[14]  Mike Thelwall,et al.  Do the Web sites of higher rated scholars have significantly more online impact? , 2004, J. Assoc. Inf. Sci. Technol..

[15]  Sara Kiesler,et al.  KDI INITIATIVE: MULTIDISCIPLINARY SCIENTIFIC COLLABORATIONS , 2003 .

[16]  Mike Thelwall,et al.  Conceptualizing documentation on the Web: An evaluation of different heuristic-based models for counting links between university Web sites , 2002, J. Assoc. Inf. Sci. Technol..

[17]  Andrei Z. Broder,et al.  Graph structure in the Web , 2000, Comput. Networks.

[18]  Howard Rosenbaum,et al.  Can search engines be used as tools for web-link analysis? A critical view , 1999, J. Documentation.

[19]  Franz Barjak,et al.  Research productivity in the internet era , 2006, Scientometrics.

[20]  M. Thelwall,et al.  U.S. academic departmental Web-site interlinking in the United States Disciplinary differences , 2003 .

[21]  Dag W. Aksnes,et al.  Scientific Productivity and Group Size: A Bibliometric Analysis of Norwegian Microbiological Research , 2004, Scientometrics.

[22]  Mike Thelwall,et al.  Hyperlinks as a data source for science mapping , 2004, J. Inf. Sci..

[23]  Mike Thelwall,et al.  Why Do Web Sites from Different Academic Subjects Interlink? , 2003, J. Inf. Sci..

[24]  Mike Thelwall,et al.  Web Impact Factors for Australasian universities , 2002, Scientometrics.

[25]  Peter Ingwersen,et al.  The calculation of web impact factors , 1998, J. Documentation.

[26]  George A. Barnett,et al.  Hyperlink-affiliation network structure of top web sites: Examining affiliates with hyperlink in Korea , 2002, J. Assoc. Inf. Sci. Technol..

[27]  Chaomei Chen,et al.  How did university departments interweave the Web: A study of connectivity and underlying factors , 1998, Interact. Comput..

[28]  C. Lee Giles,et al.  Accessibility of information on the web , 1999, Nature.

[29]  Brian Uzzi,et al.  Athena unbound: Barriers to women in academic science and engineering , 1992 .

[30]  Pravin K. Trivedi,et al.  Regression Analysis of Count Data , 1998 .

[31]  Judit Bar-Ilan,et al.  A microscopic link analysis of academic institutions within a country — the case of Israel , 2004, Scientometrics.

[32]  Mike Thelwall,et al.  Motivations for academic web site interlinking: evidence for the Web as a novel source of information on informal scholarly communication , 2003, J. Inf. Sci..

[33]  Mike Thelwall,et al.  Evidence for the existence of geographic trends in university Web site interlinking , 2002, J. Documentation.

[34]  Paula E. Stephan,et al.  Scientific Teams and Institution Collaborations: Evidence from U.S. Universities, 1981-1999 , 2004 .

[35]  R. Rousseau Why am I not cited or, Why are multi-authored papers more cited than others? , 1992 .

[36]  Mike Thelwall,et al.  What is this link doing here? Beginning a fine-grained process of identifying reasons for academic hyperlink creation , 2003, Inf. Res..

[37]  Mike Thelwall,et al.  Which academic subjects have most online impact? A pilot study and a new classification process , 2003, Online Inf. Rev..

[38]  Judit Bar-Ilan,et al.  Search engine results over time-a case study on search engine stability , 1998 .

[39]  Judit Bar-Ilan,et al.  What do we know about links and linking? A framework for studying links in academic environments , 2005, Inf. Process. Manag..

[40]  Katarina Prpic,et al.  Gender and productivity differentials in science , 2004, Scientometrics.

[41]  Mike Thelwall,et al.  Extracting macroscopic information from Web links , 2001, J. Assoc. Inf. Sci. Technol..

[42]  B. J. Fogg,et al.  What makes Web sites credible?: a report on a large quantitative study , 2001, CHI.

[43]  Ziming Liu,et al.  Perceptions of credibility of scholarly information on the web , 2004, Inf. Process. Manag..

[44]  Mike Thelwall,et al.  Patterns of national and international Web inlinks to US academic departments: An analysis of disciplinary variations , 2004, Scientometrics.

[45]  Lada A. Adamic,et al.  Power-Law Distribution of the World Wide Web , 2000, Science.

[46]  Mike Thelwall,et al.  The relationship between the WIFs or inlinks of Computer Science Departments in UK and their RAE ratings or research productivities in 2001 , 2003, Scientometrics.

[47]  David M. Pennock,et al.  Winners don't take all: Characterizing the competition for links on the web , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[48]  Gaston Heimeriks,et al.  Mapping communication and collaboration in heterogeneous research networks , 2003, Scientometrics.

[49]  Mike Thelwall,et al.  Which factors explain the Web impact of scientists' personal homepages? , 2007 .

[50]  Mike Thelwall,et al.  Search engine coverage bias: evidence and possible causes , 2004, Inf. Process. Manag..

[51]  Mike Thelwall,et al.  Link Analysis: An Information Science Approach , 2004 .

[52]  Mike Thelwall,et al.  An initial exploration of the link relationship between UK university Web sites , 2002, Aslib Proc..

[53]  Anne Beaulieu,et al.  Textured Connectivity: an ethnographic approach to understanding the timescape of hyperlinks , 2006 .

[54]  Gaston Heimeriks,et al.  Analyzing hyperlinks networks: The meaning of hyperlink based indicators of knowledge production , 2006 .

[55]  Mike Thelwall Interpreting social science link analysis research: A theoretical framework , 2006 .

[56]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[57]  Thomas J. Allen,et al.  Managing the flow of technology: technology transfer and the dissemination of technological informat , 1977 .

[58]  Philipp Mayr,et al.  Google Web APIs - an Instrument for Webometric Analyses? , 2006, ArXiv.

[59]  Mike Thelwall,et al.  National and international university departmental Web site interlinking , 2005, Scientometrics.