Clustering Nonstationary Circadian Rhythms using Locally Stationary Wavelet Representations

How does soil pollution affect a plant's circadian clock? Are there any differences between how the clock reacts when exposed to different concentrations of elements of the periodic table? If so, can we characterise these differences? We approach these questions by analysing and modelling circadian plant data, where the levels of expression of a luciferase reporter gene were measured at regular intervals over a number of days after exposure to different concentrations of lithium. A key aspect of circadian data analysis is to determine whether a time series (derived from experimental data) is `rhythmic' and, if so, to determine the underlying period. However, our dataset displays nonstationary traits such as changes in amplitude, gradual changes in period and phase-shifts. In this paper, we develop clustering methods using a wavelet transform. Wavelets are chosen as they are ideally suited to identifying discriminant local time and scale features. Furthermore, we propose treating the observed time series as realisations of locally stationary wavelet processes. This allows us to define and estimate the evolutionary wavelet spectrum. We can then compare, in a quantitative way, using a functional principal components analysis, the time-frequency patterns of the time series. Our approach uses a clustering algorithm to group the data according to their time-frequency patterns. We demonstrate the advantages of our methodology over alternative approaches and show that it successfully clusters our data.

[1]  M. B. Priestley,et al.  A Test for Non‐Stationarity of Time‐Series , 1969 .

[2]  Tomasz Zielinski,et al.  Online period estimation and determination of rhythmicity in circadian data, using the BioDare data infrastructure. , 2014, Methods in molecular biology.

[3]  Hernando Ombao,et al.  Consistent Classification of Nonstationary Time Series Using Stochastic Wavelet Representations , 2009 .

[4]  Paul Fearnhead,et al.  Classification of non‐stationary time series , 2014 .

[5]  D. B. Preston Spectral Analysis and Time Series , 1983 .

[6]  S. Mallat Multiresolution approximations and wavelet orthonormal bases of L^2(R) , 1989 .

[7]  Anestis Antoniadis,et al.  Clustering Functional Data using Wavelets , 2010, Int. J. Wavelets Multiresolution Inf. Process..

[8]  Fred W. Turek,et al.  Overview of Circadian Rhythms , 2001, Alcohol research & health : the journal of the National Institute on Alcohol Abuse and Alcoholism.

[9]  A. Millar,et al.  Circadian genetics in the model higher plant, Arabidopsis thaliana. , 2005, Methods in enzymology.

[10]  B. Silverman,et al.  The Stationary Wavelet Transform and some Statistical Applications , 1995 .

[11]  D. R. Hoagland,et al.  The Water-Culture Method for Growing Plants Without Soil , 2018 .

[12]  B. Silverman,et al.  Functional Data Analysis , 1997 .

[13]  P. Hardin,et al.  Circadian rhythms from multiple oscillators: lessons from diverse organisms , 2005, Nature Reviews Genetics.

[14]  Eamonn J. Keogh,et al.  An Enhanced Representation of Time Series Which Allows Fast and Accurate Classification, Clustering and Relevance Feedback , 1998, KDD.

[15]  Bernard Cazelles,et al.  Analysing multiple time series and extending significance testing in wavelet analysis , 2008 .

[16]  P. Huijser,et al.  Modulation of copper deficiency responses by diurnal and circadian rhythms in Arabidopsis thaliana , 2015, Journal of experimental botany.

[17]  M. Priestley Evolutionary Spectra and Non‐Stationary Processes , 1965 .

[18]  R. Dahlhaus On the Kullback-Leibler information divergence of locally stationary processes , 1996 .

[19]  G. Nason,et al.  Wavelet processes and adaptive estimation of the evolutionary wavelet spectrum , 2000 .

[20]  C. Robertson McClung,et al.  Plant Circadian Rhythms , 2006, The Plant Cell Online.

[21]  R. Shumway Time-frequency clustering and discriminant analysis , 2003 .

[22]  Piotr Fryzlewicz,et al.  Haar–Fisz estimation of evolutionary wavelet spectra , 2006 .

[23]  Yannig Goude,et al.  Modeling and Forecasting Daily Electricity Load Curves: A Hybrid Approach , 2013, 1611.08632.

[24]  Christopher K Wikle,et al.  Modeling Complex Phenotypes: Generalized Linear Models Using Spectrogram Predictors of Animal Communication Signals , 2010, Biometrics.

[25]  Ferenc Nagy,et al.  Multiple phytohormones influence distinct parameters of the plant circadian clock , 2006, Genes to cells : devoted to molecular & cellular mechanisms.

[26]  Efstathios Paparoditis,et al.  Wavelet Methods in Statistics with R , 2010 .

[27]  S. Kay,et al.  Circadian Control of cab Gene Transcription and mRNA Accumulation in Arabidopsis. , 1991, The Plant cell.

[28]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Noel A Cressie,et al.  Statistics for Spatio-Temporal Data , 2011 .

[30]  R. Dahlhaus Fitting time series models to nonstationary processes , 1997 .

[31]  Daubechies,et al.  Ten Lectures on Wavelets Volume 921 , 1992 .