distantia: an open‐source toolset to quantify dissimilarity between multivariate ecological time‐series

There is a large array of methods to extract knowledge and perform ecological forecasting from ecological time‐series. However, in spite of its importance for data‐mining, pattern‐matching and ecological synthesis, methods to assess their similarity are scarce. We introduce distantia (v1.0.1), an R package providing general toolset to quantify dissimilarity between ecological time‐series, independently of their regularity and number of samples. The functions in distantia provide the means to compute dissimilarity scores by time and by shape and assess their significance, evaluate the partial contribution of each variable to dissimilarity, and align or combine sequences by similarity. We evaluate the sensitivity of the dissimilarity metrics implemented in distantia, describe its structure and functionality, and showcase its applications with two examples. Particularly, we evaluate how geographic factors drive the dissimilarity between nine pollen sequences dated to the Last Interglacial, and compare the temporal dynamics of climate and enhanced vegetation index of three stands across the range of the European beech. We expect this package may enhance the capabilities of researchers from different fields to explore dissimilarity patterns between multivariate ecological time‐series, and aid in generating and testing new hypotheses on why the temporal dynamics of complex‐systems changes over space and time.

[1]  A. Strahler,et al.  Climate controls on vegetation phenological patterns in northern mid‐ and high latitudes inferred from MODIS data , 2004 .

[2]  Andy Purvis,et al.  MODISTools – downloading and processing MODIS remotely sensed data in R , 2014, Ecology and evolution.

[3]  U. Müller A Late-Pleistocene pollen sequence from the Jammertal, south-western Germany with particular reference to location and altitude as factors determining Eemian forest composition , 2000 .

[4]  Matthias Baumann,et al.  Phenology from Landsat when data is scarce: Using MODIS and Dynamic Time-Warping to combine multi-year Landsat imagery to derive annual phenology curves , 2017, Int. J. Appl. Earth Obs. Geoinformation.

[5]  T. Korsman,et al.  Spatial heterogeneity of diatom stratigraphy in varved and non-varved sediments of a small, boreal-forest lake , 1994, Aquatic Sciences.

[6]  Paul M. Thompson,et al.  Phenological sensitivity to climate across taxa and trophic levels , 2016, Nature.

[7]  Martin H. Trauth,et al.  Abrupt or gradual? Change point analysis of the late Pleistocene–Holocene climate record from Chew Bahir, southern Ethiopia , 2018, Quaternary Research.

[8]  Gesche Krause,et al.  Time is an affliction: Why ecology cannot be as predictive as physics and why it needs time series , 2015 .

[9]  A. Hense,et al.  Eemian and Early Weichselian temperature and precipitation variability in northern Germany , 2007 .

[10]  W. Morawski,et al.  Geological position and pollen analysis of Eemian Interglacial sediments of Warsaw-Wawrzyszew , 1993 .

[11]  Simon Goring,et al.  neotoma: A Programmatic Interface to the Neotoma Paleoecological Database , 2015 .

[12]  G. Woillard Grande Pile Peat Bog: A Continuous Pollen Record for the Last 140,000 Years , 1978, Quaternary Research.

[13]  Toni Giorgino,et al.  Computing and Visualizing Dynamic Time Warping Alignments in R: The dtw Package , 2009 .

[14]  H. Birks,et al.  Biodiversity baselines, thresholds and resilience: testing predictions and assumptions using palaeoecological data. , 2010, Trends in ecology & evolution.

[15]  Tomasz Górecki,et al.  Multivariate time series classification with parametric derivative dynamic time warping , 2015, Expert Syst. Appl..

[16]  Alexander Mendiburu,et al.  Distance Measures for Time Series in R: The TSdist Package , 2016, R J..

[17]  C. Ravazzi Late Quaternary history of spruce in southern Europe , 2002 .

[18]  Eamonn J. Keogh,et al.  Experimental comparison of representation methods and distance measures for time series data , 2010, Data Mining and Knowledge Discovery.

[19]  Roy Thompson,et al.  Paleorainfall Reconstructions from Pedogenic Magnetic Susceptibility Variations in the Chinese Loess and Paleosols , 1995, Quaternary Research.

[20]  A. Lotter,et al.  Rates of change and chronological problems during the late-glacial period , 1992 .

[21]  A. D. Gordon,et al.  NUMERICAL-METHODS IN QUATERNARY PALEOECOLOGY .2. COMPARISON OF POLLEN DIAGRAMS , 1972 .

[22]  Peter Turchin,et al.  Complex Dynamics in Ecological Time Series , 1992 .

[23]  Achille Mauri,et al.  EU-Forest, a high-resolution tree occurrence dataset for Europe , 2017, Scientific Data.

[24]  Jarrett J. Barber,et al.  Quantifying ecological memory in plant and ecosystem processes. , 2015, Ecology letters.

[25]  R. M. Clark,et al.  A FORTRAN program for constrained sequence-slotting based on minimum combined path length , 1985 .

[26]  J. Guiot,et al.  Continental European Eemian and early Würmian climate evolution: comparing signals using different quantitative reconstruction approaches based on pollen , 2003 .

[27]  P. Jones,et al.  Updated high‐resolution grids of monthly climatic observations – the CRU TS3.10 Dataset , 2014 .

[28]  B. Frenzel Über einen frühen letzteiszeitlichen Vorstoß des Rheingletschers in das deutsche Alpenvorland , 1991 .

[29]  V. Brovkin,et al.  Mid-Pleistocene transition in glacial cycles explained by declining CO2 and regolith removal , 2019, Science Advances.

[30]  T. Suominen Applying MERIS time series and dynamic time warping for delineating areas with similar temporal behaviour in the northern Baltic Sea , 2018, Ecological Indicators.

[31]  George Sugihara,et al.  Detecting Causality in Complex Ecosystems , 2012, Science.

[32]  S. Carpenter,et al.  Methods for Detecting Early Warnings of Critical Transitions in Time Series Illustrated Using Simulated Ecological Data , 2012, PloS one.