Untenable nonstationarity: An assessment of the fi tness for purpose of trend tests in hydrology

The detection and attribution of long-term patterns in hydrological time series have been important research topics for decades. A significant portion of the literature regards such patterns as ‘deterministic components’ or ‘trends’ even though the complexity of hydrological systems does not allow easy deterministic explanations and attributions. Consequently, trend estimation techniques have been developed to make and justify statements about tendencies in the historical data, which are often used to predict future events. Testing trend hypothesis on observed time series is widespread in the hydro-meteorological literature mainly due to the interest in detecting consequences of human activities on the hydrological cycle. This analysis usually relies on the application of some null hypothesis significance tests (NHSTs) for slowly-varying and/or abrupt changes, such as MannKendall, Pettitt, or similar, to summary statistics of hydrological time series (e.g., annual averages, maxima, minima, etc.). However, the reliability of this application has seldom been explored in detail. This paper discusses misuse, misinterpretation, and logical flaws of NHST for trends in the analysis of hydrological data from three different points of view: historic-logical, semantic-epistemological, and practical. Based on a review of NHST rationale, and basic statistical definitions of stationarity, nonstationarity, and ergodicity, we show that even if the empirical estimation of trends in hydrological time series is always feasible from a numerical point of view, it is uninformative and does not allow the inference of nonstationarity without assuming a priori additional information on the underlying stochastic process, according to deductive reasoning. This prevents the use of trend NHST outcomes to support nonstationary frequency analysis and modeling. We also show that the correlation structures characterizing hydrological time series might easily be underestimated, further compromising the attempt to draw conclusions about trends spanning the period of records. Moreover, even though adjusting procedures accounting for correlation have been developed, some of them are insufficient or are applied only to some tests, while some others are theoretically flawed but still widely applied. In particular, using 250 unimpacted stream flow time series across the conterminous United States (CONUS), we show that the test results can dramatically change if the sequences of annual values are reproduced starting from daily stream flow records, whose larger sizes enable a more reliable assessment of the correlation structures. Faced with a sample of unknown origin, many applied statisticians working in economics, meteorology and the like, hasten to decompose it into a trend and an oscillation (and added periodic terms). They assume implicitly that the addends are attributable to distinct generating mechanisms, and are statistically independent. This last implicit assumption is quite unwarranted, except when the sample is generated by Brownian motion. (B.B. Mandelbrot, The Fractal Geometry of Nature, p. 352, 1982)

[1]  Bellie Sivakumar,et al.  Chaos in Hydrology: Bridging Determinism and Stochasticity , 2018 .

[2]  D. Rajan Probability, Random Variables, and Stochastic Processes , 2017 .

[3]  Gabriele Villarini,et al.  Analysis of changes in the magnitude, frequency, and seasonality of heavy precipitation over the contiguous USA , 2017, Theoretical and Applied Climatology.

[4]  Michael Leonard,et al.  A global-scale investigation of trends in annual maximum streamflow , 2017 .

[5]  Witold F. Krajewski,et al.  Effect of Spatially Distributed Small Dams on Flood Frequency: Insights from the Soap Creek Watershed , 2017 .

[6]  Jasper A. Vrugt,et al.  Predicting nonstationary flood frequencies: Evidence supports an updated stationarity thesis in the United States , 2017 .

[7]  Irene Brox Nilsen,et al.  A probabilistic approach for attributing temperature changes to synoptic type frequency , 2017 .

[8]  Kirk R. Barrett,et al.  Prevalence and Magnitude of Trends in Peak Annual Flow and 5-, 10-, and 20-Year Flows in the Northeastern United States , 2017 .

[9]  F. Serinaldi,et al.  General simulation algorithm for autocorrelated binary processes. , 2017, Physical review. E.

[10]  James M. Vose,et al.  The influence of watershed characteristics on spatial patterns of trends in annual scale streamflow variability in the continental U.S. , 2016 .

[11]  Pratik Pathak,et al.  Wavelet-Aided Analysis to Estimate Seasonal Variability and Dominant Periodicities in Temperature, Precipitation, and Streamflow in the Midwestern United States , 2016, Water Resources Management.

[12]  Elisabeth J. Moyer,et al.  Estimating trends in the global mean temperature record , 2016, 1607.03855.

[13]  S. Goodman,et al.  Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations , 2016, European Journal of Epidemiology.

[14]  Francesco Serinaldi,et al.  Irreversibility and complex network behavior of stream flow fluctuations , 2016 .

[15]  Francesco Serinaldi,et al.  Understanding Persistence to Avoid Underestimation of Collective Flood Risk , 2016 .

[16]  N. Lazar,et al.  The ASA Statement on p-Values: Context, Process, and Purpose , 2016 .

[17]  Kuk-Hyun Ahn,et al.  Trend and Variability in Observed Hydrological Extremes in the United States , 2016 .

[18]  Francesco Serinaldi,et al.  The importance of prewhitening in change point analysis under persistence , 2016, Stochastic Environmental Research and Risk Assessment.

[19]  Demetris Koutsoyiannis,et al.  Generic and parsimonious stochastic modelling for hydrology and beyond , 2016 .

[20]  Aldo Fiori,et al.  One hundred years of return period: Strengths and limitations , 2015 .

[21]  Michael D. Dettinger,et al.  On Critiques of “Stationarity is Dead: Whither Water Management?” , 2015 .

[22]  J. Vose,et al.  Continental U.S. streamflow trends from 1940 to 2009 and their relationships with watershed spatial characteristics , 2015 .

[23]  M. Bayazit,et al.  Nonstationarity of Hydrological Records and Recent Trends in Trend Analysis: A State-of-the-art Review , 2015, Environmental Processes.

[24]  Demetris Koutsoyiannis,et al.  Negligent killing of scientific concepts: the stationarity case , 2015 .

[25]  Francesco Serinaldi,et al.  Dismissing return periods! , 2015, Stochastic Environmental Research and Risk Assessment.

[26]  Francesco Serinaldi,et al.  Stationarity is undead: Uncertainty dominates the distribution of extremes , 2015 .

[27]  G. Villarini,et al.  The changing nature of flooding across the central United States , 2015 .

[28]  R. Katz,et al.  Non-stationary extreme value analysis in a changing climate , 2014, Climatic Change.

[29]  Nicholas J Gotelli,et al.  P values, hypothesis testing, and model selection: it's déjà vu all over again. , 2014, Ecology.

[30]  J. Salas,et al.  Revisiting the Concepts of Return Period and Risk for Nonstationary Hydrologic Extreme Events , 2014 .

[31]  Francesco Serinaldi,et al.  Analysis of time variation of rainfall in transnational basins in Iberia: abrupt changes or trends? , 2014 .

[32]  S. El Adlouni,et al.  Trends and variability in extreme precipitation indices over Maghreb countries , 2013 .

[33]  Ilaria Prosdocimi,et al.  Non-stationarity in annual and seasonal series of peak flow and precipitation in the UK , 2013 .

[34]  Holger Rootzén,et al.  Design Life Level: Quantifying risk in a changing climate , 2013 .

[35]  Elena Volpi,et al.  Just two moments! A cautionary note against use of high-order moments in multifractal models in hydrology , 2013 .

[36]  Christian L. E. Franzke,et al.  A novel method to test for significant trends in extreme values in serially dependent time series , 2013 .

[37]  Ximing Cai,et al.  Detecting gradual and abrupt changes in hydrological records , 2013 .

[38]  Stelios Katsanevakis,et al.  Strengthening statistical usage in marine ecology , 2012 .

[39]  Daniel M. Murphy,et al.  Identifying weekly cycles in meteorological variables: The importance of an appropriate statistical analysis , 2012 .

[40]  Bruno Merz,et al.  HESS Opinions "More efforts and scientific rigour are needed to attribute trends in flood time series" , 2012 .

[41]  Gabriele Villarini,et al.  Detecting inhomogeneities in the Twentieth Century Reanalysis over the central United States , 2012 .

[42]  Khaled H. Hamed The distribution of Kendall's tau for testing the significance of cross-correlation in persistent data , 2011 .

[43]  T. Ouarda,et al.  Bayesian Nonstationary Frequency Analysis of Hydrological Variables 1 , 2011 .

[44]  Witold F. Krajewski,et al.  Examining Flood Frequency Distributions in the Midwest U.S. 1 , 2011 .

[45]  Richard E. Chandler,et al.  Statistical Methods for Trend Detection and Analysis in the Environmental Sciences , 2011 .

[46]  Francesco Serinaldi,et al.  Analyses of seasonal and annual maximum daily discharge records for central Europe , 2011 .

[47]  George Christakos,et al.  Integrative Problem-Solving in a Time of Decadence , 2010 .

[48]  Robin T. Clarke,et al.  On the (mis)use of statistical methods in hydro-climatological research , 2010 .

[49]  G. Villarini,et al.  Flood peak distributions for the eastern United States , 2009 .

[50]  Demetris Koutsoyiannis,et al.  HESS Opinions "A random walk on water" , 2009 .

[51]  Khaled H. Hamed Effect of persistence on the significance of Kendall’s tau as a measure of correlation between natural time series , 2009 .

[52]  Taha B. M. J. Ouarda,et al.  Identification of temporal trends in annual and seasonal low flows occurring in Canadian rivers: The effect of short- and long-term persistence , 2009 .

[53]  T. Ouarda,et al.  Identification of hydrological trends in the presence of serial and cross correlations: A review of selected methods and their application to annual flow regimes of Canadian rivers , 2009 .

[54]  Khaled H. Hamed Enhancing the effectiveness of prewhitening in trend analysis of hydrologic data , 2009 .

[55]  P. Bates,et al.  Flood frequency analysis for nonstationary annual peak records in an urban drainage basin , 2009 .

[56]  T. Levine,et al.  A Critical Assessment of Null Hypothesis Significance Testing in Quantitative Communication Research , 2008 .

[57]  Thomas C. Piechota,et al.  Changes in U.S. Streamflow and Western U.S. Snowpack , 2008 .

[58]  Khaled H. Hamed Trend detection in hydrologic data: The Mann–Kendall trend test under the scaling hypothesis , 2008 .

[59]  A. I. McLeod,et al.  Algorithms for Linear Time Series Analysis: With R Package , 2007 .

[60]  M. Bayazit,et al.  To prewhiten or not to prewhiten in trend analysis? , 2007 .

[61]  Barbara G. Brown,et al.  The problem of multiplicity in research on teleconnections , 2007 .

[62]  David M. Hannah,et al.  Large-Scale Climatic Controls on New England River Flow , 2007 .

[63]  Demetris Koutsoyiannis,et al.  Statistical analysis of hydroclimatic time series: Uncertainty and insights , 2007 .

[64]  C. Simmer,et al.  Statistical characteristics of surrogate data based on geophysical measurements , 2006 .

[65]  Clemens Simmer,et al.  A Stochastic Iterative Amplitude Adjusted Fourier Transform algorithm with improved accuracy , 2006 .

[66]  Shlomo Havlin,et al.  Long-term memory: a natural mechanism for the clustering of extreme events and anomalous residual times in climate records. , 2005, Physical review letters.

[67]  S. Yue,et al.  The Mann-Kendall Test Modified by Effective Sample Size to Detect Trend in Serially Correlated Hydrological Series , 2004 .

[68]  Zbigniew W. Kundzewicz,et al.  Change detection in hydrological records—a review of the methodology / Revue méthodologique de la détection de changements dans les chroniques hydrologiques , 2004 .

[69]  A. Sankarasubramanian,et al.  Effect of persistence on trend detection via regression , 2003 .

[70]  Christopher K. Wikle,et al.  Modeling Hydrologic Change: Statistical Methods , 2003, Technometrics.

[71]  M. Tribus,et al.  Probability theory: the logic of science , 2003 .

[72]  Demetris Koutsoyiannis,et al.  Climate change, the Hurst phenomenon, and hydrological statistics , 2003 .

[73]  Sheng Yue,et al.  The influence of autocorrelation on the ability to detect trend in hydrological series , 2002 .

[74]  Sheng Yue,et al.  Applicability of prewhitening to eliminate the influence of serial correlation on the Mann‐Kendall test , 2002 .

[75]  Richard W. Katz,et al.  Sir Gilbert Walker and a Connection between El Niño and Statistics , 2002 .

[76]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[77]  Neville Nicholls,et al.  commentary and analysis: The Insignificance of Significance Testing , 2001 .

[78]  Richard M. Vogel,et al.  Trends in floods and low flows in the United States: impact of spatial correlation , 2000 .

[79]  H. Storch,et al.  Statistical Analysis in Climate Research , 2000 .

[80]  J. Gill The Insignificance of Null Hypothesis Significance Testing , 1999 .

[81]  Douglas H. Johnson The Insignificance of Statistical Significance Testing , 1999 .

[82]  D. Kugiumtzis Test your surrogate data before you test for nonlinearity. , 1999, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[83]  James H. Lambert,et al.  Risk of Extreme Events Under Nonstationary Conditions , 1998 .

[84]  Khaled H. Hamed,et al.  A modified Mann-Kendall trend test for autocorrelated data , 1998 .

[85]  Klaus Hasselmann,et al.  Multi-pattern fingerprint method for detection and attribution of climate change , 1997 .

[86]  H. Storch,et al.  Changes in the winter precipitation in Romania and its relation to the large-scale circulation , 1996 .

[87]  Schreiber,et al.  Improved Surrogate Data for Nonlinearity Tests. , 1996, Physical review letters.

[88]  Hongshik Ahn,et al.  Generation of Over-Dispersed and Under-Dispersed Binomial Variates , 1995 .

[89]  Jacob Cohen The earth is round (p < .05) , 1994 .

[90]  J. R. Wallis,et al.  Hydro-Climatological Trends in the Continental United States, 1948-88 , 1994 .

[91]  K. Hasselmann Optimal Fingerprints for the Detection of Time-dependent Climate Change , 1993 .

[92]  Timothy J. Brown,et al.  Criteria and Methods for Performing and Evaluating Solar-Weather Studies , 1993 .

[93]  Hans von Storch,et al.  Monte Carlo experiments on the effect of serial correlation on the Mann-Kendall test of trend , 1992 .

[94]  P. Phillips,et al.  Testing the null hypothesis of stationarity against the alternative of a unit root: How sure are we that economic time series have a unit root? , 1992 .

[95]  L. M. Berliner,et al.  Statistics, Probability and Chaos , 1992 .

[96]  John Beatty,et al.  The Empire of Chance: How Probability Changed Science and Everyday Life , 1989 .

[97]  Donald E. Myers,et al.  To be or not to be... stationary? That is the question , 1989 .

[98]  R. Katz Statistical Procedures for Making Inferences about Climate Variability , 1988 .

[99]  P. Pollard,et al.  On the probability of making Type I errors. , 1987 .

[100]  W. Fuller,et al.  Distribution of the Estimators for Autoregressive Time Series with a Unit Root , 1979 .

[101]  A. I. McLeod,et al.  Preservation of the rescaled adjusted range: 1. A reassessment of the Hurst Phenomenon , 1978 .

[102]  Umberto Eco,et al.  Peirce's Notion of Interpretant , 1976 .

[103]  Vujica Yevjevich,et al.  Determinism and stochasticity in hydrology , 1974 .

[104]  J. G. Skellam A Probability Distribution Derived from the Binomial Distribution by Regarding the Probability of Success as Variable between the Sets of Trials , 1948 .

[105]  J. M. Hammersley,et al.  The “Effective” Number of Independent Observations in an Autocorrelated Time Series , 1946 .

[106]  H. B. Mann Nonparametric Tests Against Trend , 1945 .

[107]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[108]  Shlomo Havlin,et al.  The Statistics of Return Intervals, Maxima, and Centennial Events Under the Influence of Long-Term Correlations , 2011 .

[109]  Demetris Koutsoyiannis,et al.  Simultaneous estimation of the parameters of the Hurst–Kolmogorov stochastic process , 2011 .

[110]  Demetris Koutsoyiannis “Hurst-Kolomogorov Dynamics and Uncertainty” , 2010 .

[111]  Demetris Koutsoyiannis A random walk on water , 2009 .

[112]  J. H. Steiger,et al.  The Problem Is Epistemology, Not Statistics: Replace Significance Tests by Confidence Intervals and Quantify Accuracy of Risky Numerical Predictions , 2002 .

[113]  Francis W. Zwiers,et al.  Detection of climate change and attribution of causes , 2001 .

[114]  Daniel S. Wilks,et al.  Resampling Hypothesis Tests for Autocorrelated Fields , 1997 .

[115]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[116]  Hans von Storch,et al.  Misuses of Statistical Analysis in Climate Research , 1995 .

[117]  G. McBride,et al.  What do significance tests really tell us about the environment? , 1993 .

[118]  A. N. PETTrrr A Non-parametric Approach to the Change-point Problem , 1979 .

[119]  E. S. Pearson,et al.  On the Problem of the Most Efficient Tests of Statistical Hypotheses , 1933 .