Detecting Abrupt Changes in the Presence of Local Fluctuations and Autocorrelated Noise

Whilst there are a plethora of algorithms for detecting changes in mean in univariate time-series, almost all struggle in real applications where there is autocorrelated noise or where the mean fluctuates locally between the abrupt changes that one wishes to detect. In these cases, default implementations, which are often based on assumptions of a constant mean between changes and independent noise, can lead to substantial over-estimation of the number of changes. We propose a principled approach to detect such abrupt changes that models local fluctuations as a random walk process and autocorrelated noise via an AR(1) process. We then estimate the number and location of changepoints by minimising a penalised cost based on this model. We develop a novel and efficient dynamic programming algorithm, DeCAFS, that can solve this minimisation problem; despite the additional challenge of dependence across segments, due to the autocorrelated noise, which makes existing algorithms inapplicable. Theory and empirical results show that our approach has greater power at detecting abrupt changes than existing approaches. We apply our method to measuring gene expression levels in bacteria.

[1]  S. Robin,et al.  A robust approach for estimating change-points in the mean of an AR(p) process , 2014, 1403.1958.

[2]  Paul Fearnhead,et al.  Changepoint Detection in the Presence of Outliers , 2016, Journal of the American Statistical Association.

[3]  A. Munk,et al.  Multiscale change point inference , 2013, 1301.7212.

[4]  R. Tibshirani,et al.  Sparsity and smoothness via the fused lasso , 2005 .

[5]  Claudia Kirch,et al.  A MOSUM procedure for the estimation of multiple random change points , 2018 .

[6]  Consistency of a range of penalised cost approaches for detecting multiple changepoints , 2019, 1911.01716.

[7]  P. Fearnhead,et al.  A log-linear time algorithm for constrained changepoint detection , 2017, 1703.03352.

[8]  Piotr Fryzlewicz,et al.  Wild binary segmentation for multiple change-point detection , 2014, 1411.0858.

[9]  Paul Fearnhead,et al.  Constrained Dynamic Programming and Supervised Penalty Learning Algorithms for Peak Detection in Genomic Data , 2020, J. Mach. Learn. Res..

[10]  Jeffrey D. Scargle,et al.  An algorithm for optimal partitioning of data on an interval , 2003, IEEE Signal Processing Letters.

[11]  Keith E. Muller,et al.  Linear Model Theory: Univariate, Multivariate, and Mixed Models , 2006 .

[12]  Stéphane Robin,et al.  Transcriptional landscape estimation from tiling array data using a model of signal shift and drift , 2009, Bioinform..

[13]  Robert Lund,et al.  A Review and Comparison of Changepoint Detection Techniques for Climate Data , 2007 .

[14]  É. Moulines,et al.  Least‐squares Estimation of an Unknown Number of Shifts in a Time Series , 2000 .

[15]  Heinz H. Bauschke,et al.  Convex Analysis and Monotone Operator Theory in Hilbert Spaces , 2011, CMS Books in Mathematics.

[16]  M. Wigler,et al.  Circular binary segmentation for the analysis of array-based DNA copy number data. , 2004, Biostatistics.

[17]  P. Fryzlewicz Tail-greedy bottom-up data decompositions and fast multiple change-point detection , 2018, The Annals of Statistics.

[18]  Axel Munk,et al.  Multiscale DNA partitioning: statistical evidence for segments , 2014, Bioinform..

[19]  Alessandro Rinaldo,et al.  A Sharp Error Analysis for the Fused Lasso, with Application to Approximate Changepoint Screening , 2017, NIPS.

[20]  P. Rousseeuw,et al.  Alternatives to the Median Absolute Deviation , 1993 .

[21]  Paul Fearnhead,et al.  On optimal multiple changepoint algorithms for large data , 2014, Statistics and Computing.

[22]  P. Massart,et al.  Adaptive estimation of a quadratic functional by model selection , 2000 .

[23]  Sylvain Arlot,et al.  Minimal penalties and the slope heuristics: a survey , 2019, 1901.07277.

[24]  Philip Jonathan,et al.  Detection of changes in variance of oceanographic time-series using changepoint analysis , 2010 .

[25]  Piotr Fryzlewicz,et al.  Detecting possibly frequent change-points: Wild Binary Segmentation 2 and steepest-drop model selection , 2018, Journal of the Korean Statistical Society.

[26]  K. Riedel Numerical Bayesian Methods Applied to Signal Processing , 1996 .

[27]  Zhen Liu,et al.  Efficient Bayesian analysis of multiple changepoint models with dependence across segments , 2009, Stat. Comput..

[28]  Axel Munk,et al.  Idealizing Ion Channel Recordings by a Jump Segmentation Multiresolution Filter , 2013, IEEE Transactions on NanoBioscience.

[29]  K OrJ Numerical Bayesian methods applied to signal processing , 1996 .

[30]  R. Tibshirani Adaptive piecewise polynomial estimation via trend filtering , 2013, 1304.2986.

[31]  D. Hinkley Inference about the change-point from cumulative sum tests , 1971 .

[32]  P. Fryzlewicz,et al.  Detecting possibly frequent change-points: Wild Binary Segmentation 2 and steepest-drop model selection—rejoinder , 2020, Journal of the Korean Statistical Society.

[33]  Adam N. Letchford,et al.  Detecting Changes in Slope With an L0 Penalty , 2017, Journal of Computational and Graphical Statistics.

[34]  Martin Spott,et al.  Most Recent Changepoint Detection in Panel Data , 2016, Technometrics.

[35]  Ali Jalali,et al.  A Dirty Model for Multiple Sparse Regression , 2011, IEEE Transactions on Information Theory.

[36]  Zhuang Fengqing,et al.  Patients’ Responsibilities in Medical Ethics , 2016 .

[37]  Paul Fearnhead,et al.  Fast nonconvex deconvolution of calcium imaging data. , 2018, Biostatistics.

[38]  P. Fearnhead,et al.  Computationally Efficient Changepoint Detection for a Range of Penalties , 2017 .

[39]  Stephen P. Boyd,et al.  1 Trend Filtering , 2009, SIAM Rev..

[40]  Ali Shojaie,et al.  Joint Structural Break Detection and Parameter Estimation in High-Dimensional Nonstationary VAR Models , 2017, Journal of the American Statistical Association.

[41]  P. Fearnhead,et al.  Optimal detection of changepoints with a linear computational cost , 2011, 1101.1438.

[42]  Yi-Ching Yao Estimating the number of change-points via Schwarz' criterion , 1988 .

[43]  Chang-Jin Kim,et al.  The Structural Break in the Equity Premium , 2005 .

[44]  Daniela Witten,et al.  EXACT SPIKE TRAIN INFERENCE VIA ℓ0 OPTIMIZATION. , 2017, The annals of applied statistics.