Testing the significance of spatio-temporal teleconnection patterns

Dipoles represent long distance connections between the pressure anomalies of two distant regions that are negatively correlated with each other. Such dipoles have proven important for understanding and explaining the variability in climate in many regions of the world, e.g., the El Nino climate phenomenon is known to be responsible for precipitation and temperature anomalies over large parts of the world. Systematic approaches for dipole detection generate a large number of candidate dipoles, but there exists no method to evaluate the significance of the candidate teleconnections. In this paper, we present a novel method for testing the statistical significance of the class of spatio-temporal teleconnection patterns called as dipoles. One of the most important challenges in addressing significance testing in a spatio-temporal context is how to address the spatial and temporal dependencies that show up as high autocorrelation. We present a novel approach that uses the wild bootstrap to capture the spatio-temporal dependencies, in the special use case of teleconnections in climate data. Our approach to find the statistical significance takes into account the autocorrelation, the seasonality and the trend in the time series over a period of time. This framework is applicable to other problems in spatio-temporal data mining to assess the significance of the patterns.

[1]  Peter J. Diggle,et al.  Simple Monte Carlo Tests for Spatial Pattern , 1977 .

[2]  John E. Janowiak,et al.  AN investigation of interannual rainfall variability in Africa , 1988 .

[3]  C. Granger Some properties of time series data and their use in econometric model specification , 1981 .

[4]  A. J. Miller,et al.  Factors affecting the detection of trends: Statistical considerations and applications to environmental data , 1998 .

[5]  R. Fisher 014: On the "Probable Error" of a Coefficient of Correlation Deduced from a Small Sample. , 1921 .

[6]  Changbao Wu,et al.  Jackknife, Bootstrap and Other Resampling Methods in Regression Analysis , 1986 .

[7]  Vipin Kumar,et al.  Discovery of climate indices using clustering , 2003, KDD '03.

[8]  Mary C. Brennan,et al.  on the , 1982 .

[9]  J. Wallace,et al.  Teleconnections in the Geopotential Height Field during the Northern Hemisphere Winter , 1981 .

[10]  Gemma C. Garriga,et al.  Randomization Techniques for Graphs , 2009, SDM.

[11]  R. Reynolds,et al.  The NCEP/NCAR 40-Year Reanalysis Project , 1996, Renewable Energy.

[12]  C. Granger,et al.  Co-integration and error correction: representation, estimation and testing , 1987 .

[13]  Nagiza F. Samatova,et al.  Data Guided Discovery of Dynamic Climate Dipoles , 2011, CIDU.

[14]  Aristides Gionis,et al.  Assessing data mining results via swap randomization , 2007, TKDD.

[15]  Vipin Kumar,et al.  Discovering Dynamic Dipoles in Climate Data , 2011, SDM.

[16]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[17]  H. Storch,et al.  Statistical Analysis in Climate Research , 2000 .

[18]  Paulo J. Azevedo,et al.  Time Series Motifs Statistical Significance , 2011, SDM.

[19]  W. Briggs Statistical Methods in the Atmospheric Sciences , 2007 .

[20]  Tim Oates,et al.  PERUSE: An unsupervised algorithm for finding recurring patterns in time series , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[21]  Anthony Waldron,et al.  Null Models of Geographic Range Size Evolution Reaffirm Its Heritability , 2007, The American Naturalist.

[22]  Keith M. Hines,et al.  Artificial surface pressure trends in the NCEP-NCAR reanalysis over the Southern Ocean and Antarctica , 2000 .

[23]  Paulo J. Azevedo,et al.  Mining Approximate Motifs in Time Series , 2006, Discovery Science.

[24]  Joseph A. Veech,et al.  A NULL MODEL FOR DETECTING NONRANDOM PATTERNS OF SPECIES RICHNESS ALONG SPATIAL GRADIENTS , 2000 .

[25]  Werner Ulrich,et al.  Null model analysis of species nestedness patterns. , 2007, Ecology.

[26]  E. Mammen Bootstrap and Wild Bootstrap for High Dimensional Linear Models , 1993 .

[27]  Eugene S. Edgington,et al.  Randomization Tests , 2011, International Encyclopedia of Statistical Science.

[28]  E. Mammen,et al.  Comparing Nonparametric Versus Parametric Regression Fits , 1993 .