Detecting breakpoints in artificially modified- and real-life time series using three state-of-the-art methods

Abstract Time series often contain breakpoints of different origin, i.e. breakpoints, caused by (i) shifts in trend, (ii) other changes in trend and/or, (iii) changes in variance. In the present study, artificially generated time series with white and red noise structures are analyzed using three recently developed breakpoint detection methods. The time series are modified so that the exact “locations” of the artificial breakpoints are prescribed, making it possible to evaluate the methods exactly. Hence, the study provides a deeper insight into the behaviour of the three different breakpoint detection methods. Utilizing this experience can help solving breakpoint detection problems in real-life data sets, as is demonstrated with two examples taken from the fields of paleoclimate research and petrology.

[1]  Elena Marchiori,et al.  Chromosomal Breakpoint Detection in Human Cancer , 2003, EvoWorkshops.

[2]  G. Morreale,et al.  Spring thermal resources for grapevine in Kőszeg (Hungary) deduced from a very long pictorial time series (1740–2009) , 2014, Climatic Change.

[3]  Pedro M. A. Miranda,et al.  Piecewise linear fitting and trend changing points of climate parameters , 2004 .

[4]  Jon D. Pelletier,et al.  Long-range persistence in climatological and hydrological time series: analysis, modeling and application to drought hazard assessment , 1997 .

[5]  Jifan Chou,et al.  A new method for abrupt change detection in dynamic structures , 2008 .

[6]  P. Yodzis,et al.  THE COLOR OF ENVIRONMENTAL NOISE , 2004 .

[7]  Simon D. P. Williams,et al.  Offsets in Global Positioning System time series , 2003 .

[8]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[9]  Georgy Sofronov,et al.  Multiple Break-Points Detection in Array CGH Data via the Cross-Entropy Method , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[10]  P. Varotsos,et al.  Long-range correlations in the electric signals that precede rupture: further investigations. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Douglas M. Hawkins,et al.  Statistical Process Control for Shifts in Mean or Variance Using a Changepoint Formulation , 2005, Technometrics.

[12]  David O Siegmund,et al.  A Modified Bayes Information Criterion with Applications to the Analysis of Comparative Genomic Hybridization Data , 2007, Biometrics.

[13]  J.H.M. Thomeer,et al.  Introduction of a Pore Geometrical Factor Defined by the Capillary Pressure Curve , 1960 .

[14]  Douglas M. Hawkins,et al.  The Changepoint Model for Statistical Process Control , 2003 .

[15]  Jiahui Wang,et al.  A Bayesian Time Series Model of Multiple Structural Changes in Level, Trend, and Variance , 2000 .

[16]  S. Korsmeyer,et al.  Cloning the chromosomal breakpoint of t(14;18) human lymphomas: clustering around Jh on chromosome 14 and near a transcriptional unit on 18 , 1985, Cell.

[17]  N. Wardlaw,et al.  Mercury Capillary Pressure Curves and the Intepretation of Pore Structure and Capillary Behaviour in Reservoir Rocks , 1976 .

[18]  József Szalai,et al.  Detection and evaluation of changes induced by the diversion of River Danube in the territorial appearance of latent effects governing shallow-groundwater fluctuations , 2015 .

[19]  Wolfgang Urfer,et al.  Spatial prediction of the intensity of latent effects governing hydrogeological phenomena , 1999 .

[20]  A. C. Costa,et al.  Homogenization of Climate Data: Review and New Perspectives Using Geostatistics , 2009 .

[21]  Mark D. Schwartz,et al.  Phenology and climate: the timing of life cycle events as indicators of climatic variability and change , 2002 .

[22]  I. Haidu A stochastic vision of the paleoclimate. Modelling and predictibility , 2014 .

[23]  A. Shirvani Change point analysis of mean annual air temperature in Iran , 2015 .

[24]  Dimitris K. Tasoulis,et al.  Nonparametric Monitoring of Data Streams for Changes in Location and Scale , 2011, Technometrics.

[25]  Holly A. Titchner,et al.  Assessing Bias and Uncertainty in the HadAT-Adjusted Radiosonde Climate Record , 2008 .

[26]  I. Nemes Revisiting the applications of drainage capillary pressure curves in water-wet hydrocarbon systems , 2016 .

[27]  George Y. Sofronov,et al.  A modified cross entropy method for detecting multiple change points in DNA Count Data , 2012, 2012 IEEE Congress on Evolutionary Computation.

[28]  S. Brönnimann,et al.  Break detection of annual Swiss temperature series , 2012 .

[29]  M. S. Lazaridou,et al.  Latest aspects of earthquake prediction in Greece based on seismic electric signals : Tectonophysics V188, N3/4, March 1991, P321–347 , 1991 .

[30]  D. W. Zimmerman Comparative Power of Student T Test and Mann-Whitney U Test for Unequal Sample Sizes and Variances , 1987 .

[31]  H. Akaike A new look at the statistical model identification , 1974 .

[32]  I. Matyasovszky,et al.  Abrupt temperature changes during the last 1,500 years , 2013, Theoretical and Applied Climatology.

[33]  Jörg Franke,et al.  Spectral biases in tree-ring climate proxies , 2013 .

[34]  I. Matyasovszky,et al.  Detecting abrupt climate changes on different time scales , 2011 .

[35]  Khaled H. Hamed,et al.  A modified Mann-Kendall trend test for autocorrelated data , 1998 .

[36]  Victor Venema,et al.  On the multiple breakpoint problem and the number of significant breaks in homogenization of climate records , 2013 .

[37]  M. S. Lazaridou,et al.  Fluctuations, under time reversal, of the natural time and the entropy distinguish similar looking electric signals of different dynamics , 2007, 0707.3074.

[38]  R. Tsay Outliers, Level Shifts, and Variance Changes in Time Series , 1988 .

[39]  G. Manley Central England temperatures: Monthly means 1659 to 1973 , 1974 .

[40]  Matt A. King,et al.  Detecting offsets in GPS time series: First results from the detection of offsets in GPS experiment , 2013 .

[41]  Fred A. Spiring,et al.  On the average run lengths of quality control schemes using a Markov chain approach , 2002 .

[42]  Mark D. Schwartz,et al.  Special Issue: Phenology and Climate. Issue Edited by A. J. H. van Vliet, M. D. Schwartz. , 2002 .

[43]  P. Varotsos,et al.  Detrended fluctuation analysis of the magnetic and electric field variations that precede rupture. , 2009, Chaos.

[44]  T. Karl,et al.  The record breaking global temperatures of 1997 and 1998: Evidence for an increase in the rate of global warming? , 2000 .

[45]  Francis W. Zwiers,et al.  An Introduction to Trends in Extreme Weather and Climate Events: Observations, Socioeconomic Impacts, Terrestrial Ecological Impacts, and Model Projections* , 2000 .

[46]  Gareth E. Evans,et al.  Estimating change-points in biological sequences via the cross-entropy method , 2011, Ann. Oper. Res..

[47]  P. Perron,et al.  Estimating and testing linear models with multiple structural changes , 1995 .

[48]  Douglas M. Hawkins,et al.  A Change-Point Model for a Shift in Variance , 2005 .

[49]  T. Levanič,et al.  Natural proxy records of temperature- and hydroclimate variability with annual resolution from the Northern Balkan–Carpathian region for the past millennium – Review & recalibration , 2016 .

[50]  J. W. Backus The History of FORTRAN I, II and III , 1979, IEEE Ann. Hist. Comput..

[51]  Masashi Kamogawa,et al.  Natural time analysis of critical phenomena , 2011, Proceedings of the National Academy of Sciences.

[52]  V. Venema,et al.  The uncertainty of break positions detected by homogenization algorithms in climate records , 2016 .