Computing Entropies With Nested Sampling

The Shannon entropy, and related quantities such as mutual information, can be used to quantify uncertainty and relevance. However, in practice, it can be difficult to compute these quantities for arbitrary probability distributions, particularly if the probability mass functions or densities cannot be evaluated. This paper introduces a computational approach, based on Nested Sampling, to evaluate entropies of probability distributions that can only be sampled. I demonstrate the method on three examples: a simple Gaussian example where the key quantities are available analytically; (ii) an experimental design example about scheduling observations in order to measure the period of an oscillating signal; and (iii) predicting the future from the past in a heavy-tailed scenario.

[1]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[2]  M. Payne,et al.  Determining pressure-temperature phase diagrams of materials , 2015, 1503.03404.

[3]  G. L. Bretthorst Nonuniform sampling: Bandwidth and aliasing , 2001 .

[4]  Clément Walter,et al.  Point process-based Monte Carlo estimation , 2014, Stat. Comput..

[5]  Michael Habeck,et al.  Bayesian evidence and model selection , 2014, Digit. Signal Process..

[6]  Brendon J. Brewer,et al.  Fast Bayesian inference for exoplanet discovery in radial velocity data , 2015 .

[7]  Brendon J. Brewer,et al.  Diffusive nested sampling , 2009, Stat. Comput..

[8]  T. Neumann Probability Theory The Logic Of Science , 2016 .

[9]  Gábor Csányi,et al.  Efficient sampling of atomic configurational spaces. , 2009, The journal of physical chemistry. B.

[10]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[11]  Kevin Knuth,et al.  Toward Question-Asking Machines: The Logic of Questions and the Inquiry Calculus , 2005, AISTATS.

[12]  P. Gregory A Bayesian Analysis of Extrasolar Planet Data for HD 73526 , 2005 .

[13]  A. Lasenby,et al.  polychord: next-generation nested sampling , 2015, 1506.00171.

[14]  Lei Cao,et al.  Combined-chain nested sampling for efficient Bayesian model comparison , 2017, Digit. Signal Process..

[15]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[16]  J. Bernardo Reference Analysis , 2005 .

[17]  Richard J. Morris,et al.  Bayesian Model Comparison and Parameter Inference in Systems Biology Using Nested Sampling , 2014, PloS one.

[18]  Kevin H. Knuth,et al.  Foundations of Inference , 2010, Axioms.

[19]  J. Skilling Nested sampling for general Bayesian computation , 2006 .

[20]  Nassim Nicholas Taleb,et al.  The Black Swan: The Impact of the Highly Improbable , 2007 .

[21]  Zoltán Szabó,et al.  Information theoretical estimators toolbox , 2014, J. Mach. Learn. Res..

[22]  Ariel Caticha,et al.  Lectures on Probability, Entropy, and Statistical Physics , 2008, ArXiv.

[23]  D. Frenkel,et al.  Superposition Enhanced Nested Sampling , 2014, Physical Review X.

[24]  F. Feroz,et al.  MultiNest: an efficient and robust Bayesian inference tool for cosmology and particle physics , 2008, 0809.3437.

[25]  M. Tribus,et al.  Probability theory: the logic of science , 2003 .

[26]  Yuhong Yang,et al.  Information Theory, Inference, and Learning Algorithms , 2005 .

[27]  Marshall F Chalverus,et al.  The Black Swan: The Impact of the Highly Improbable , 2007 .

[28]  Daniel Foreman-Mackey,et al.  DNest4: Diffusive Nested Sampling in C++ and Python , 2016, 1606.03757.