Applications of the generalized law of Benford to informetric data

In a previous work (Egghe, 2011), the first author showed that Benford's law (describing the logarithmic distribution of the numbers 1, 2, … , 9 as first digits of data in decimal form) is related to the classical law of Zipf with exponent 1. The work of Campanario and Coslado (2011), however, shows that Benford's law does not always fit practical data in a statistical sense. In this article, we use a generalization of Benford's law related to the general law of Zipf with exponent β > 0. Using data from Campanario and Coslado, we apply nonlinear least squares to determine the optimal β and show that this generalized law of Benford fits the data better than the classical law of Benford. © 2012 Wiley Periodicals, Inc.

[1]  Juan Miguel Campanario,et al.  Benford’s law and citations, articles and impact factors of scientific journals , 2011, Scientometrics.

[2]  Yun Q. Shi,et al.  First Digit Law and Its Application to Digital Forensics , 2008, IWDW.

[3]  Steven J. Miller,et al.  Benford’s Law Applied to Hydrology Data—Results and Relevance to Other Geophysical Data , 2007 .

[4]  Bertram C. Brookes Ranking techniques and the empirical log law , 1984, Inf. Process. Manag..

[5]  L. Egghe Power Laws in the Information Production Process: Lotkaian Informetrics , 2005 .

[6]  Leo Egghe A new short proof of Naranan's theorem, explaining Lotka's law and Zipf's law , 2010 .

[7]  Bertram C. Brookes,et al.  Frequency-rank distributions , 1978, J. Am. Soc. Inf. Sci..

[8]  B. C. Brookes The foundations of information science , 1980 .

[9]  Leo Egghe Benford's law is a simple consequence of Zipf's law , 2011 .

[10]  Wei Su,et al.  A generalized Benford's law for JPEG coefficients and its applications in image forensics , 2007, Electronic Imaging.

[11]  Simon Newcomb,et al.  Note on the Frequency of Use of the Different Digits in Natural Numbers , 1881 .

[12]  Alessandro Vespignani,et al.  Explaining the uneven distribution of numbers in nature: the laws of Benford and Zipf , 2001 .

[13]  Lucas Lacasa,et al.  The first-digit frequencies of prime numbers and Riemann zeta zeros , 2009, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences.