Theory and applications of proper scoring rules

A scoring rule $$S(x; q)$$S(x;q) provides a way of judging the quality of a quoted probability density $$q$$q for a random variable $$X$$X in the light of its outcome $$x$$x. It is called proper if honesty is your best policy, i.e., when you believe $$X$$X has density $$p$$p, your expected score is optimised by the choice $$q = p$$q=p. The most celebrated proper scoring rule is the logarithmic score, $$S(x; q) = -\log {q(x)}$$S(x;q)=-logq(x): this is the only proper scoring rule that is local, in the sense of depending on the density function $$q$$q only through its value at the observed value $$x$$x. It is closely connected with likelihood inference, with communication theory, and with minimum description length model selection. However, every statistical decision problem induces a proper scoring rule, so there is a very wide variety of these. Many of them have additional interesting structure and properties. At a theoretical level, any proper scoring rule can be used as a foundational basis for the theory of subjective probability. At an applied level a proper scoring can be used to compare and improve probability forecasts, and, in a parametric setting, as an alternative tool for inference. In this article we give an overview of some uses of proper scoring rules in statistical inference, including frequentist estimation theory and Bayesian model selection with improper priors.

[1]  G. Brier VERIFICATION OF FORECASTS EXPRESSED IN TERMS OF PROBABILITY , 1950 .

[2]  D. Lindley,et al.  Bayes Estimates for the Linear Model , 1972 .

[3]  J. Besag Statistical Analysis of Non-Lattice Data , 1975 .

[4]  C. Tsallis Possible generalization of Boltzmann-Gibbs statistics , 1988 .

[5]  B. Gidas,et al.  A Variational Method for Estimating the Parameters of MRF from Complete or Incomplete Data , 1993 .

[6]  D. Cox,et al.  Inference and Asymptotics , 1994 .

[7]  A. O'Hagan,et al.  Fractional Bayes factors for model comparison , 1995 .

[8]  J. Berger,et al.  The Intrinsic Bayes Factor for Model Selection and Prediction , 1996 .

[9]  M. C. Jones,et al.  Robust and efficient estimation by minimising a density power divergence , 1998 .

[10]  Paola Sebastiani,et al.  Coherent dispersion criteria for optimal experimental design , 1999 .

[11]  V. Vovk Competitive On‐line Statistics , 2001 .

[12]  B. Ripley,et al.  Robust Statistics , 2018, Wiley Series in Probability and Statistics.

[13]  Aapo Hyvärinen,et al.  Estimation of Non-Normalized Statistical Models by Score Matching , 2005, J. Mach. Learn. Res..

[14]  Steffen L. Lauritzen,et al.  The Geometry of Decision Theory , 2006 .

[15]  A. Dawid The geometry of proper scoring rules , 2007 .

[16]  Aapo Hyvärinen,et al.  Some extensions of score matching , 2007, Comput. Stat. Data Anal..

[17]  S. Lauritzen,et al.  Proper local scoring rules , 2011, 1101.5011.

[18]  Steffen Lauritzen,et al.  PROPER LOCAL SCORING RULES ON DISCRETE SAMPLE SPACES , 2011, 1104.2224.

[19]  Monica Musio,et al.  Estimation of spatial processes using local scoring rules , 2013 .

[20]  M. Parry Multidimensional local scoring rules , 2013 .

[21]  A. Dawid,et al.  Local Scoring Rules: A Versatile Tool for Inference , 2013 .

[22]  A. Dawid,et al.  Minimum Scoring Rule Inference , 2014, 1403.3920.

[23]  Monica Musio,et al.  Bayesian Model Selection Based on Proper Scoring Rules , 2014, 1409.5291.