Measurement of statistical evidence on an absolute scale following thermodynamic principles

Statistical analysis is used throughout biomedical research and elsewhere to assess strength of evidence. We have previously argued that typical outcome statistics (including p values and maximum likelihood ratios) have poor measure-theoretic properties: they can erroneously indicate decreasing evidence as data supporting an hypothesis accumulate; and they are not amenable to calibration, necessary for meaningful comparison of evidence across different study designs, data types, and levels of analysis. We have also previously proposed that thermodynamic theory, which allowed for the first time derivation of an absolute measurement scale for temperature (T), could be used to derive an absolute scale for evidence (E). Here we present a novel thermodynamically based framework in which measurement of E on an absolute scale, for which “one degree” always means the same thing, becomes possible for the first time. The new framework invites us to think about statistical analyses in terms of the flow of (evidential) information, placing this work in the context of a growing literature on connections among physics, information theory, and statistics.

[1]  G. Faè,et al.  The physical review , 1895 .

[2]  Solomon Kullback,et al.  Information Theory and Statistics , 1970, The Mathematical Gazette.

[3]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[4]  E. Jaynes Information Theory and Statistical Mechanics , 1957 .

[5]  D. Whiffen Thermodynamics , 1973, Nature.

[6]  Evgueni A. Haroutunian,et al.  Information Theory and Statistics , 2011, International Encyclopedia of Statistical Science.

[7]  H. Callen Thermodynamics and an Introduction to Thermostatistics , 1988 .

[8]  Veronica J. Vieland,et al.  Measurement of Evidence and Evidence of Measurement , 2011 .

[9]  E. Soofi Principal Information Theoretic Approaches , 2000 .

[10]  X. R. Wang,et al.  Relating Fisher information to order parameters. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Veronica J Vieland,et al.  Thermometers: Something for Statistical Geneticists to Think about , 2006, Human Heredity.

[12]  Peter Beresford,et al.  Where's the evidence? , 2004, Mental health today.

[13]  W. K. BURTON Understanding Thermodynamics , 1966, Nature.

[14]  A. Caticha Relative Entropy and Inductive Inference , 2003, physics/0311093.

[15]  Daniel Zwillinger,et al.  CRC standard mathematical tables and formulae; 30th edition , 1995 .

[16]  Hasok Chang,et al.  Inventing Temperature: Measurement and Scientific Progress , 2004 .

[17]  Jaap Van Brakel,et al.  Foundations of measurement , 1983 .

[18]  Expected Monotonicity – A Desirable Property for Evidence Measures? , 2010, Human Heredity.

[19]  Claude E. Shannon,et al.  The mathematical theory of communication , 1950 .

[20]  W. Beyer CRC Standard Mathematical Tables and Formulae , 1991 .

[21]  Yungui Huang,et al.  Association statistics under the PPL framework , 2010, Genetic epidemiology.

[22]  V. Vieland,et al.  Statistical Evidence: A Likelihood Paradigm , 1998 .

[23]  R. T. Cox The Algebra of Probable Inference , 1962 .

[24]  Jonathan Barzilai,et al.  On the foundations of measurement , 2001, 2001 IEEE International Conference on Systems, Man and Cybernetics. e-Systems and e-Man for Cybernetics in Cyberspace (Cat.No.01CH37236).

[25]  Ehsan S. Soo Principal Information Theoretic Approaches , 2000 .

[26]  S. Frank Natural selection maximizes Fisher information , 2009, Journal of evolutionary biology.

[27]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[28]  V. Vieland,et al.  Accumulating quantitative trait linkage evidence across multiple datasets using the posterior probability of linkage , 2007, Genetic epidemiology.

[29]  R. Callen,et al.  Thermodynamics and an Introduction to Thermostatistics, 2nd Edition , 1985 .

[30]  Veronica J. Vieland 1st Two Laws Sept 2013 , 2013 .

[31]  G. A. Barnard,et al.  THE LOGIC OF STATISTICAL INFERENCE1 , 1972, The British Journal for the Philosophy of Science.

[32]  Ümit V. Çatalyürek,et al.  KELVIN: A Software Package for Rigorous Measurement of Statistical Evidence in Human Genetics , 2011, Human Heredity.

[33]  Jian Huang,et al.  Comparison of ‘Model-Free’ and ‘Model-Based’ Linkage Statistics in the Presence of Locus Heterogeneity: Single Data Set and Multiple Data Set Applications , 2001, Human Heredity.