Revealing Relationships among Relevant Climate Variables with Information Theory

A primary objective of the NASA Earth-Sun Exploration Technology Office is to understand the observed Earth climate variability, thus enabling the determination and prediction of the climate's response to both natural and human-induced forcing. We are currently developing a suite of computational tools that will allow researchers to calculate, from data, a variety of information-theoretic quantities such as mutual information, which can be used to identify relationships among climate variables, and transfer entropy, which indicates the possibility of causal interactions. Our tools estimate these quantities along with their associated error bars, the latter of which is critical for describing the degree of uncertainty in the estimates. This work is based upon optimal binning techniques that we have developed for piecewise-constant, histogram-style models of the underlying density functions. Two useful side benefits have already been discovered. The first allows a researcher to determine whether there exist sufficient data to estimate the underlying probability density. The second permits one to determine an acceptable degree of round-off when compressing data for efficient transfer and storage. We also demonstrate how mutual information and transfer entropy can be applied so as to allow researchers not only to identify relations among climate variables, but also to characterize and quantify their possible causal interactions.

[1]  Schreiber,et al.  Measuring information transfer , 2000, Physical review letters.

[2]  John M. Wallace,et al.  Large-scale atmospheric circulation features of warm and cold episodes in the tropical Pacific , 1990 .

[3]  C. Granger Investigating causal relations by econometric models and cross-spectral methods , 1969 .

[4]  W. Rossow,et al.  The International Satellite Cloud Climatology Project (ISCCP): The First Project of the World Climate Research Programme , 1983 .

[5]  W. Rossow,et al.  Advances in understanding clouds from ISCCP , 1999 .

[6]  A. J. Bell THE CO-INFORMATION LATTICE , 2003 .

[7]  K. Knuth Measuring Questions: Relevance and its Relation to Entropy , 2004, physics/0409084.

[8]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[9]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[10]  John M. Wallace,et al.  El Niño events and their relation to the Southern Oscillation: 1925–1986 , 1987 .

[11]  K. Knuth Optimal Data-Based Binning for Histograms , 2006, physics/0605197.

[12]  John M. Wallace,et al.  ENSO seasonality : 1950-78 versus 1979-92 , 1996 .

[13]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[14]  A C C Gibbs,et al.  Data Analysis , 2009, Encyclopedia of Database Systems.

[15]  Kevin H. Knuth,et al.  Lattice duality: The origin of probability and entropy , 2013, Neurocomputing.

[16]  John Skilling,et al.  Data analysis : a Bayesian tutorial , 1996 .

[17]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .