Best Finite Approximations of Benford’s Law

For arbitrary Borel probability measures with compact support on the real line, characterizations are established of the best finitely supported approximations, relative to three familiar probability metrics (Lévy, Kantorovich, and Kolmogorov), given any number of atoms, and allowing for additional constraints regarding weights or positions of atoms. As an application, best (constrained or unconstrained) approximations are identified for Benford’s Law (logarithmic distribution of significands) and other familiar distributions. The results complement and extend known facts in the literature; they also provide new rigorous benchmarks against which to evaluate empirical observations regarding Benford’s law.

[1]  Arno Berger,et al.  Scale-Distortion Inequalities for Mantissas of Finite Data Sets , 2008 .

[2]  T. Hill A Statistical Derivation of the Significant-Digit Law , 1995 .

[3]  David Thomas,et al.  The Art in Computer Programming , 2001 .

[4]  Arno Berger,et al.  An Introduction to Benford's Law , 2015 .

[5]  Roger S. Pinkham,et al.  On the Distribution of First Significant Digits , 1961 .

[6]  Stoyan V. Stoyanov,et al.  A Structural Classification of Probability Distances , 2013 .

[7]  I. Bloch,et al.  Defining and computing Hausdorff distances between distributions on the real line and on the circle: link between optimal transport and morphological dilations , 2016, Math. Morphol. Theory Appl..

[8]  Chuang Xu,et al.  Best finite constrained approximations of one-dimensional probabilities , 2017, J. Approx. Theory.

[9]  Arno Berger,et al.  Benford’s Law Strikes Back: No Simple Explanation in Sight for Mathematical Gem , 2011 .

[10]  Peter Schatte,et al.  On Mantissa Distributions in Computing and Benford's Law , 1988, J. Inf. Process. Cybern..

[11]  William Feller,et al.  An Introduction to Probability Theory and Its Applications , 1967 .

[12]  J. Delahaye,et al.  Scatter and regularity imply Benford's law... and more , 2009, 0910.1359.

[13]  R. A. Raimi The First Digit Problem , 1976 .

[14]  Klaus Felsenstein,et al.  An asymptotic Result on Principal Points for univariate Distributions , 1994 .

[15]  Yoshiyuki Morie,et al.  On the distribution of the leading digit of $$a^n$$an: a study via $$\chi ^2$$χ2 statistics , 2016, Period. Math. Hung..

[16]  James Bruce Lee,et al.  Theory and Application , 2019, Wearable Sensors in Sport.

[17]  P. Diaconis The Distribution of Leading Digits and Uniform Distribution Mod 1 , 1977 .

[18]  Pieter C. Allaart AN INVARIANT-SUM CHARACTERIZATION OF BENFORD'S LAW , 1997 .

[19]  Steffen Dereich,et al.  The High Resolution Vector Quantization Problem with Orlicz Norm Distortion , 2010, 1010.4248.

[20]  Feller William,et al.  An Introduction To Probability Theory And Its Applications , 1950 .

[21]  L. Duembgen,et al.  Explicit Bounds for the Approximation Error in Benford's Law , 2007, 0705.4488.

[22]  Aurel Spataru Analysis and Probability , 2013 .

[23]  S. Graf,et al.  Foundations of Quantization for Probability Distributions , 2000 .

[24]  Theodore P. Hill,et al.  Base-Invariance Implies Benford's Law , 1995 .

[25]  Harald Luschgy,et al.  Quantization for Probability Measures in the Prohorov Metric , 2009, Universität Trier, Mathematik/Informatik, Forschungsbericht.

[26]  S. Bobkov,et al.  One-dimensional empirical measures, order statistics, and Kantorovich transport distances , 2019, Memoirs of the American Mathematical Society.

[27]  Dudley,et al.  Real Analysis and Probability: Measurability: Borel Isomorphism and Analytic Sets , 2002 .

[28]  S. Rachev,et al.  The Methods of Distances in the Theory of Probability and Statistics , 2013 .

[29]  Simon Newcomb,et al.  Note on the Frequency of Use of the Different Digits in Natural Numbers , 1881 .

[30]  Jim Freeman Probability Metrics and the Stability of Stochastic Models , 1991 .

[31]  Alison L Gibbs,et al.  On Choosing and Bounding Probability Metrics , 2002, math/0209021.

[32]  S. Rachev,et al.  Mass transportation problems , 1998 .

[33]  B. Ripley,et al.  Robust Statistics , 2018, Encyclopedia of Mathematical Geosciences.

[34]  G. Pflug,et al.  Approximations for Probability Distributions and Stochastic Optimization Problems , 2011 .

[35]  Steven J. Miller Benford's Law: Theory and Applications , 2015 .

[36]  Arno Berger,et al.  On the significands of uniform random variables , 2018, Journal of Applied Probability.