The prevalence of power laws in the citations to scientific papers

This paper presents some preliminary evidence about the existence of power laws representing the upper tail of the citation distributions among 221 scientific sub-fields or Web of Science categories distinguished by Thomson Scientific within the natural and the social sciences. The main finding is that, in a sample consisting of 767,828 articles published in 1998 with a 5-year citation window, in 181 out of 221 sub-fields (representing approximately 77% of the sample of articles) the existence of a power law cannot be rejected. In most sub-fields, the upper tail that can be represented by a power law is small but captures a considerable proportion of the total citations received. The value of the scale parameter is between 2 and 3 for only 21 sub-fields or 6% of the total, greater than 3 for 114 sub-fields or 67%, and greater than 4 for the remaining 46 sub-fields that represent 27% of the total. The estimation of the parameters of the power laws has been done with a novel procedure for citation distributions that is shown to perform better than the standard maximum likelihood methods in the presence of extreme observations. • The authors acknowledge financial support from the Spanish MEC, Grants SEJ2007-67436, SEJ2007-63098 and SEJ200605710. The database of Thomson Scientific (formerly Thomson-ISI; Institute for Scientific Information) has been acquired with funds from Santander Universities Global Division of Banco Santander. This paper is part of the SCIFI-GLOW Collaborative Project supported by the European Commission’s Seventh Research Framework Programme, Contract no. SSH7-CT-2008217436.

[1]  Wolfgang Glänzel,et al.  A new classification scheme of science fields and subfields designed for scientometric evaluation purposes , 2004, Scientometrics.

[2]  G. M. Laslett,et al.  GENERALIZATIONS OF POWER-LAW DISTRIBUTIONS APPLICABLE TO SAMPLED FAULT-TRACE LENGTHS : MODEL CHOICE, PARAMETER ESTIMATION AND CAVEATS , 1999 .

[3]  D J PRICE,et al.  NETWORKS OF SCIENTIFIC PAPERS. , 1965, Science.

[4]  Wolfgang Glänzel,et al.  On the h-index - A mathematical approach to a new measure of publication activity and citation impact , 2006, Scientometrics.

[5]  Loet Leydesdorff Can scientific journals be classified in terms of aggregated journal-journal citation relations using the Journal Citation Reports? , 2006 .

[6]  Michel L. Goldstein,et al.  Problems with fitting to the power-law distribution , 2004, cond-mat/0402322.

[7]  Michael Mitzenmacher,et al.  A Brief History of Generative Models for Power Law and Lognormal Distributions , 2004, Internet Math..

[8]  Henry Small Visualizing science by citation mapping , 1999 .

[9]  Loet Leydesdorff,et al.  Top-down decomposition of the Journal Citation Reportof the Social Science Citation Index: Graph- and factor-analytical approaches , 2004, Scientometrics.

[10]  S. Redner Citation statistics from 110 years of physical review , 2005, physics/0506056.

[11]  Jan Beirlant,et al.  Asymptotics for the Hirsch Index , 2007 .

[12]  Per O. Seglen,et al.  The Skewness of Science , 1992, J. Am. Soc. Inf. Sci..

[13]  B. Enquist,et al.  On estimating the exponent of power-law frequency distributions. , 2008, Ecology.

[14]  H. Bauke Parameter estimation for power-law distributions by maximum likelihood methods , 2007, 0704.1867.

[15]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..

[16]  D. Sanderson,et al.  Sampling power-law distributions , 1995 .

[17]  S. Redner How popular is your paper? An empirical study of the citation distribution , 1998, cond-mat/9804163.

[18]  D. Sornette,et al.  Stretched exponential distributions in nature and economy: “fat tails” with characteristic scales , 1998, cond-mat/9801293.

[19]  Leo Egghe,et al.  An informetric model for the Hirsch-index , 2006, Scientometrics.

[20]  J. E. Hirsch,et al.  An index to quantify an individual's scientific research output , 2005, Proc. Natl. Acad. Sci. USA.

[21]  Ismael Rafols,et al.  A global map of science based on the ISI subject categories , 2009, J. Assoc. Inf. Sci. Technol..

[22]  Ismael Rafols,et al.  A global map of science based on the ISI subject categories , 2009 .

[23]  Kevin W. Boyack,et al.  Mapping the backbone of science , 2004, Scientometrics.

[24]  M. E. J. Newman,et al.  Power laws, Pareto distributions and Zipf's law , 2005 .