Comparison of correlation analysis techniques for irregularly sampled time series

Abstract. Geoscientific measurements often provide time series with irregular time sampling, requiring either data reconstruction (interpolation) or sophisticated methods to handle irregular sampling. We compare the linear interpolation technique and different approaches for analyzing the correlation functions and persistence of irregularly sampled time series, as Lomb-Scargle Fourier transformation and kernel-based methods. In a thorough benchmark test we investigate the performance of these techniques. All methods have comparable root mean square errors (RMSEs) for low skewness of the inter-observation time distribution. For high skewness, very irregular data, interpolation bias and RMSE increase strongly. We find a 40 % lower RMSE for the lag-1 autocorrelation function (ACF) for the Gaussian kernel method vs. the linear interpolation scheme,in the analysis of highly irregular time series. For the cross correlation function (CCF) the RMSE is then lower by 60 %. The application of the Lomb-Scargle technique gave results comparable to the kernel methods for the univariate, but poorer results in the bivariate case. Especially the high-frequency components of the signal, where classical methods show a strong bias in ACF and CCF magnitude, are preserved when using the kernel methods. We illustrate the performances of interpolation vs. Gaussian kernel method by applying both to paleo-data from four locations, reflecting late Holocene Asian monsoon variability as derived from speleothem δ18O measurements. Cross correlation results are similar for both methods, which we attribute to the long time scales of the common variability. The persistence time (memory) is strongly overestimated when using the standard, interpolation-based, approach. Hence, the Gaussian kernel is a reliable and more robust estimator with significant advantages compared to other techniques and suitable for large scale application to paleo-data.

[1]  W. T. Mayo Spectrum Measurements with Laser Velocimeters , 1978 .

[2]  J. Scargle Studies in astronomical time series analysis. I - Modeling random processes in the time domain , 1981 .

[3]  J. Scargle Studies in astronomical time series analysis. II - Statistical aspects of spectral analysis of unevenly spaced data , 1982 .

[4]  R Edelson,et al.  The Discrete Correlation Function: a New Method for Analyzing Unevenly Sampled Variability Data , 1988 .

[5]  J. Scargle Studies in astronomical time series analysis. III - Fourier transforms, autocorrelation functions, and cross-correlation functions of unevenly spaced data , 1989 .

[6]  Nicholas I. Fisher,et al.  On the Nonparametric Estimation of Covariance Functions , 1994 .

[7]  Michael Schulz,et al.  Spectrum: spectral analysis of unevenly spaced paleoclimatic time series , 1997 .

[8]  Xiahong Feng,et al.  Tree-Ring δD as an Indicator of Asian Monsoon Intensity , 1999, Quaternary Research.

[9]  R. Bos,et al.  The Accuracy of Time Series Analysis for Laser-Doppler Velocimetry , 2000 .

[10]  Cameron Tropea,et al.  Estimation of turbulent velocity spectra from laser Doppler data , 2000 .

[11]  M Schimmel,et al.  Emphasizing Difficulties in the Detection of Rhythms with Lomb-Scargle Periodograms , 2001, Biological rhythm research.

[12]  Manfred Mudelsee,et al.  TAUEST: a computer program for estimating persistence in unevenly spaced weather/climate time series , 2002 .

[13]  Piet M. T. Broersen,et al.  Autoregressive spectral estimation by application of the Burg algorithm to irregularly sampled data , 2002, IEEE Trans. Instrum. Meas..

[14]  W. Falck,et al.  Nonparametric spatial covariance functions: Estimation and testing , 2001, Environmental and Ecological Statistics.

[15]  R. Lawrence Edwards,et al.  The Holocene Asian Monsoon: Links to Solar Changes and North Atlantic Climate , 2005, Science.

[16]  Robert F. Mudde,et al.  Estimation of turbulence power spectra for bubbly flows from Laser Doppler Anemometry signals , 2005 .

[17]  M. Langlois,et al.  Society of Photo-Optical Instrumentation Engineers , 2005 .

[18]  Petre Stoica,et al.  Spectral analysis of irregularly-sampled data: Paralleling the regularly-sampled data approaches , 2006, Digit. Signal Process..

[19]  B. A. Maher,et al.  Holocene variability of the East Asian summer monsoon from Chinese cave records: a re-assessment , 2008 .

[20]  Gideon M. Henderson,et al.  Quantification of Holocene Asian monsoon rainfall from spatially separated cave records , 2008 .

[21]  Guofu Yuan,et al.  Stable isotopes of summer monsoonal precipitation in southern China and the moisture sources evidence from δ18O signature , 2008 .

[22]  R. Lawrence Edwards,et al.  A Test of Climate, Sun, and Culture Relationships from an 1810-Year Chinese Cave Record , 2008, Science.

[23]  K. Hocke,et al.  Gap filling and noise reduction of unevenly sampled data by means of the Lomb-Scargle periodogram , 2008 .

[24]  Hao He,et al.  Spectral Analysis of Nonuniformly Sampled Data: A New Approach Versus the Periodogram , 2009, IEEE Transactions on Signal Processing.

[25]  Piet M. T. Broersen Five Separate Bias Contributions in Time Series Models for Equidistantly Resampled Irregular Data , 2009, IEEE Transactions on Instrumentation and Measurement.

[26]  T. Hovatta,et al.  LONG-TERM VARIABILITY OF RADIO-BRIGHT BL LACERTAE OBJECTS , 2009, 0910.2607.

[27]  S. Carpenter,et al.  Early-warning signals for critical transitions , 2009, Nature.

[28]  Hai Cheng,et al.  Persistent multidecadal power of the Indian Summer Monsoon , 2010 .

[29]  Z. Cao,et al.  Multi-band optical variability of BL Lac object OQ 530 , 2010 .

[30]  C. Dermer,et al.  TIMING SIGNATURES OF THE INTERNAL-SHOCK MODEL FOR BLAZARS , 2010, 1001.1606.

[31]  Petre Stoica,et al.  Spectral analysis of nonuniformly sampled data - a review , 2010, Digit. Signal Process..

[32]  S. Clemens,et al.  Orbital‐scale timing and mechanisms driving Late Pleistocene Indo‐Asian summer monsoons: Reinterpreting cave speleothem δ18O , 2010 .

[33]  Zhi-qiang Shen,et al.  Long-term variation time scales in OJ 287 , 2010 .

[34]  Andrew C. Parnell,et al.  Climate Time Series Analysis: Classical Statistical and Bootstrap Methods , 2013 .