An exploration of climate data using complex networks

To discover patterns in historical data, climate scientists have applied various clustering methods with the goal of identifying regions that share some common climatological behavior. However, past approaches are limited by the fact that they either consider only a single time period (snapshot) of multivariate data, or they consider only a single variable by using the time series data as multi-dimensional feature vector. In both cases, potentially useful information may be lost. Moreover, clusters in high-dimensional data space can be difficult to interpret, prompting the need for a more effective data representation. We address both of these issues by employing a complex network (graph) to represent climate data, a more intuitive model that can be used for analysis while also having a direct mapping to the physical world for interpretation. A cross correlation function is used to weight network edges, thus respecting the temporal nature of the data, and a community detection algorithm identifies multivariate clusters. Examining networks for consecutive periods allows us to study structural changes over time. We show that communities have a climatological interpretation and that disturbances in structure can be an indicator of climate events (or lack thereof). Finally, we discuss how this model can be applied for the discovery of more complex concepts such as unknown teleconnections or the development of multivariate climate indices and predictive insights.

[1]  K. Wyrtki,et al.  Teleconnections in the Equatorial Pacific Ocean , 1973, Science.

[2]  Fu Congbin,et al.  Large signals of climatic variation over the ocean in the asian monsoon region , 1988 .

[3]  R. Fovell,et al.  Climate zones of the conterminous United States defined using cluster analysis , 1993 .

[4]  R. Reynolds,et al.  The NCEP/NCAR 40-Year Reanalysis Project , 1996, Renewable Energy.

[5]  R. E. Livezey,et al.  A Comparison of the NCEP-NCAR Reanalysis Precipitation and the GPCP Rain Gauge-Satellite Combined Dataset with Observational Error Considerations , 1998 .

[6]  William W. Hargrove,et al.  Using multivariate clustering to characterize ecoregion borders , 1999, Comput. Sci. Eng..

[7]  Edward A. Keller,et al.  Introduction to environmental geology , 1999 .

[8]  K. K. Kumar,et al.  On forecasting the Indian summer monsoon: The intriguing season of 2002 , 2002 .

[9]  Vipin Kumar,et al.  Finding Clusters of Different Sizes, Shapes, and Densities in Noisy, High Dimensional Data , 2003, SDM.

[10]  Vipin Kumar,et al.  Discovery of climate indices using clustering , 2003, KDD '03.

[11]  Bart Selman,et al.  Tracking evolving communities in large linked networks , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[12]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  Mohd. Noor Md. Sap,et al.  Finding spatio-temporal patterns in climate data using clustering , 2005, 2005 International Conference on Cyberworlds (CW'05).

[14]  George Ostrouchov,et al.  Nonlinear statistics reveals stronger ties between ENSO and the tropical hydrological cycle , 2006 .

[15]  Matthieu Latapy,et al.  Computing Communities in Large Networks Using Random Walks , 2004, J. Graph Algorithms Appl..

[16]  B. Rudolf,et al.  World Map of the Köppen-Geiger climate classification updated , 2006 .

[17]  S. Saigal,et al.  Relative performance of mutual information estimation methods for quantifying the dependence among short and noisy data. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  Anastasios A. Tsonis,et al.  Introducing Networks in Climate Studies , 2007 .

[19]  Srinivasan Parthasarathy,et al.  An ensemble framework for clustering protein-protein interaction networks , 2007, ISMB/ECCB.

[20]  Sergey Kravtsov,et al.  A new dynamical mechanism for major climate shifts , 2007 .

[21]  T. McMahon,et al.  Updated world map of the Köppen-Geiger climate classification , 2007 .

[22]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[23]  A. Tsonis,et al.  Topology and predictability of El Niño and La Niña networks. , 2008, Physical review letters.

[24]  M. Newman,et al.  Robustness of community structure in networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  S. Havlin,et al.  Climate networks around the globe are significantly affected by El Niño. , 2008, Physical review letters.