The geography of scientific productivity: scaling in US computer science

Here we extract the geographical addresses of authors in the Citeseer database of computer science papers. We show that the productivity of research centres in the United States follows a power-law regime, apart from the most productive centres for which we do not have enough data to reach definite conclusions. To investigate the spatial distribution of computer science research centres in the United States, we compute the two-point correlation function of the spatial point process and show that the observed power laws do not disappear even when we change the physical representation from geographical space to cartogram space. Our work suggests that the effect of physical location poses a challenge to ongoing efforts to develop realistic models of scientific productivity. We propose that the introduction of a fine scale geography may lead to more sophisticated indicators of scientific output.

[1]  John Weiner,et al.  Letter to the Editor , 1992, SIGIR Forum.

[2]  Michael T. Gastner,et al.  Optimal design of spatial distribution networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  V. Latora,et al.  Complex networks: Structure and dynamics , 2006 .

[4]  Weimao Ke,et al.  Mapping the diffusion of scholarly knowledge among major U.S. research institutions , 2006, Scientometrics.

[5]  Matthew Zook,et al.  The geography of the Internet industry : venture capital, dot-coms, and local knowledge , 2005 .

[6]  B. Ripley Modelling Spatial Patterns , 1977 .

[7]  Weimao Ke,et al.  Mapping the Diffusion of Information Among Major U.S. Research Institutions , 2005 .

[8]  Klaus Mecke,et al.  Statistical Physics and Spatial Statistics , 2000 .

[9]  M. Dodge Understanding cyberspace cartographies. , 2008 .

[10]  M. Newman Erratum: Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality (Physical Review e (2001) 64 (016132)) , 2006 .

[11]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Sidney I. Resnick,et al.  How to make a Hill Plot , 2000 .

[13]  P H Abelson President reagan, science, and engineering. , 1981, Science.

[14]  A. Barabasi,et al.  Evolution of the social network of scientific collaborations , 2001, cond-mat/0104162.

[15]  D. King The scientific impact of nations , 2004, Nature.

[16]  M. Batty The Geography of Scientific Citation , 2003 .

[17]  M E Newman,et al.  Scientific collaboration networks. I. Network construction and fundamental results. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  V. Plerou,et al.  Similarities between the growth dynamics of university research and of competitive economic activities , 1999, Nature.

[19]  Dietrich Stoyan,et al.  Basic Ideas ofSpatial Statistics , 2000 .

[20]  C. Lee Giles,et al.  Scholarly publishing in the Internet age: a citation analysis of computer science literature , 2001, Inf. Process. Manag..

[21]  C. Lee Giles,et al.  Who gets acknowledged: Measuring scientific contributions through automatic acknowledgment indexing , 2004, Proc. Natl. Acad. Sci. USA.

[22]  Harry Eugene Stanley,et al.  Scaling phenomena in the growth dynamics of scientific output , 2005, J. Assoc. Inf. Sci. Technol..

[23]  Martin Kerscher,et al.  Statistical Analysis of Large-Scale Structure in the Universe , 1999 .

[24]  A. Robinson Elements of Cartography , 1953 .

[25]  Szalay,et al.  A Comparison of Estimators for the Two-Point Correlation Function. , 1999, The Astrophysical journal.

[26]  Henk F. Moed,et al.  Science policy: The business of research , 1999, Nature.

[27]  Harry Eugene Stanley,et al.  Application of statistical physics methods and conceptsto the study of science & technology systems , 2001, Scientometrics.

[28]  Leo Egghe,et al.  Introduction to Informetrics: Quantitative Methods in Library, Documentation and Information Science , 1990 .

[29]  B. Ripley The Second-Order Analysis of Stationary Point Processes , 1976 .

[30]  M. Newman,et al.  From The Cover: Diffusion-based method for producing density-equalizing maps. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Hawoong Jeong,et al.  Modeling the Internet's large-scale topology , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[32]  M. Newman,et al.  Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[33]  L. Glass,et al.  General: Uniform Distribution of Objects in a Homogeneous Field: Cities on a Plain , 1971, Nature.

[34]  Liu Lin-qing Mapping knowledge domains of research with document co-citation , 2005 .