Using Causal Discovery to Track Information Flow in Spatio-Temporal Data - A Testbed and Experimental Results Using Advection-Diffusion Simulations

Causal discovery algorithms based on probabilistic graphical models have emerged in geoscience applications for the identification and visualization of dynamical processes. The key idea is to learn the structure of a graphical model from observed spatio-temporal data, which indicates information flow, thus pathways of interactions, in the observed physical system. Studying those pathways allows geoscientists to learn subtle details about the underlying dynamical mechanisms governing our planet. Initial studies using this approach on real-world atmospheric data have shown great potential for scientific discovery. However, in these initial studies no ground truth was available, so that the resulting graphs have been evaluated only by whether a domain expert thinks they seemed physically plausible. This paper seeks to fill this gap. We develop a testbed that emulates two dynamical processes dominant in many geoscience applications, namely advection and diffusion, in a 2D grid. Then we apply the causal discovery based information tracking algorithms to the simulation data to study how well the algorithms work for different scenarios and to gain a better understanding of the physical meaning of the graph results, in particular of instantaneous connections. We make all data sets used in this study available to the community as a benchmark. Keywords: Information flow, graphical model, structure learning, causal discovery, geoscience.

[1]  Thomas S. Richardson,et al.  Learning high-dimensional directed acyclic graphs with latent and selection variables , 2011, 1104.5617.

[2]  Nitesh V. Chawla,et al.  Complex Networks In Climate Science: Progress, Opportunities And Challenges , 2010, CIDU.

[3]  K. Lehnertz,et al.  A Gaussian graphical model approach to climate networks. , 2014, Chaos.

[4]  P. Spirtes,et al.  An Algorithm for Fast Recovery of Sparse Causal Graphs , 1991 .

[5]  Diego Colombo,et al.  Order-independent constraint-based causal structure learning , 2012, J. Mach. Learn. Res..

[6]  Imme Ebert-Uphoff,et al.  Causal Discovery for Climate Research Using Graphical Models , 2012 .

[7]  I. Ebert‐Uphoff,et al.  A new type of climate network based on probabilistic graphical models: Results of boreal winter versus summer , 2012 .

[8]  S. Havlin,et al.  Climate networks around the globe are significantly affected by El Niño. , 2008, Physical review letters.

[9]  Richard E. Neapolitan,et al.  Learning Bayesian networks , 2007, KDD '07.

[10]  Paul J. Roebber,et al.  The architecture of the climate network , 2004 .

[11]  R. Reynolds,et al.  The NCEP/NCAR 40-Year Reanalysis Project , 1996, Renewable Energy.

[12]  Yi Deng,et al.  Causal Discovery from Spatio-Temporal Data with Applications to Climate Science , 2014, 2014 13th International Conference on Machine Learning and Applications.

[13]  C. Glymour,et al.  Data Driven Methods for Nonlinear Granger Causality: Climate Teleconnection Mechanisms , 2005 .

[14]  Yan Liu,et al.  Temporal causal modeling with graphical granger methods , 2007, KDD '07.

[15]  Jürgen Kurths,et al.  Networks from Flows - From Dynamics to Topology , 2014, Scientific Reports.

[16]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[17]  Brent W. Auvermann,et al.  Transport by Advection and Diffusion Revisited , 2003 .

[18]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[19]  Frank P. Incropera,et al.  Fundamentals of Heat and Mass Transfer , 1981 .

[20]  Imme Ebert-Uphoff,et al.  Weakening of atmospheric information flow in a warming climate in the Community Climate System Model , 2014 .

[21]  Norbert Marwan,et al.  The backbone of the climate network , 2009, 1002.2100.

[22]  W. Collins,et al.  The NCEP–NCAR 50-Year Reanalysis: Monthly Means CD-ROM and Documentation , 2001 .

[23]  Jakob Runge,et al.  Detecting and quantifying causality from time series of complex systems , 2014 .

[24]  Paul J. Roebber,et al.  What Do Networks Have to Do with Climate , 2006 .

[25]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[26]  Sergey Kravtsov,et al.  A new dynamical mechanism for major climate shifts , 2007 .

[27]  A. Abdel-azim Fundamentals of Heat and Mass Transfer , 2011 .

[28]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.