Epidemic changepoint detection in the presence of nuisance changes.

Many time series problems feature epidemic changes - segments where a parameter deviates from a background baseline. The number and location of such changes can be estimated in a principled way by existing detection methods, providing that the background level is stable and known. However, practical data often contains nuisance changes in background level, which interfere with standard estimation techniques. Furthermore, such changes often differ from the target segments only in duration, and appear as false alarms in the detection results. To solve these issues, we propose a two-level detector that models and separates nuisance and signal changes. As part of this method, we developed a new, efficient approach to simultaneously estimate unknown, but fixed, background level and detect epidemic changes. The analytic and computational properties of the proposed methods are established, including consistency and convergence. We demonstrate via simulations that our two-level detector provides accurate estimation of changepoints under a nuisance process, while other state-of-the-art detectors fail. Using real-world genomic and demographic datasets, we demonstrate that our method can identify and localise target events while separating out seasonal variations and experimental artefacts.

[1]  Lei Shi,et al.  Online seismic event picking via sequential change-point detection , 2016, 2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[2]  J. Kline,et al.  The cusum test of homogeneity with an application in spontaneous abortion epidemiology. , 1985, Statistics in medicine.

[3]  J. Ibrahim,et al.  ZINBA integrates local covariates with DNA-seq data to identify broad and narrow regions of enrichment, even within amplified genomic regions , 2011, Genome Biology.

[4]  E. S. Page CONTINUOUS INSPECTION SCHEMES , 1954 .

[5]  David Pearce,et al.  The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.

[6]  P. Fearnhead,et al.  Generalized Functional Pruning Optimal Partitioning (GFPOP) for Constrained Changepoint Detection in Genomic Data , 2018, J. Stat. Softw..

[7]  Paul Fearnhead,et al.  On optimal multiple changepoint algorithms for large data , 2014, Statistics and Computing.

[8]  Toby Hocking,et al.  Optimizing ChIP-seq peak detectors using visual labels and supervised machine learning , 2016, Bioinform..

[9]  Consistency of a range of penalised cost approaches for detecting multiple changepoints , 2019, 1911.01716.

[10]  Kevin Bracker,et al.  Detecting and modeling changing volatility in the copper futures market , 1999 .

[11]  Diane J. Cook,et al.  A survey of methods for time series change point detection , 2017, Knowledge and Information Systems.

[12]  Pierre Priouret,et al.  Adaptive Algorithms and Stochastic Approximations , 1990, Applications of Mathematics.

[13]  Sumitra Purkayastha,et al.  Simple proofs of two results on convolutions of unimodal distributions , 1998 .

[14]  M. Wigler,et al.  Circular binary segmentation for the analysis of array-based DNA copy number data. , 2004, Biostatistics.

[15]  P. Fearnhead,et al.  Optimal detection of changepoints with a linear computational cost , 2011, 1101.1438.

[16]  Liliane Pellegrin,et al.  Outbreak definition by change point analysis: a tool for public health decision? , 2016, BMC Medical Informatics and Decision Making.

[17]  Minghui Li,et al.  Monitoring transmissibility and mortality of COVID-19 in Europe , 2020, International Journal of Infectious Diseases.

[18]  C. Yau,et al.  Alternating Pruned Dynamic Programming for Multiple Epidemic Change-Point Estimation , 2019 .

[19]  Paul Fearnhead,et al.  A linear time method for the detection of point and collective anomalies , 2018, ArXiv.

[20]  W. A. Shewhart,et al.  Economic quality control of manufactured product , 1930 .

[21]  Per Block,et al.  Demographic science aids in understanding the spread and fatality rates of COVID-19 , 2020, Proceedings of the National Academy of Sciences.

[22]  Arun Kejariwal,et al.  Automatic Anomaly Detection in the Cloud Via Statistical Learning , 2017, ArXiv.

[23]  James V. Zidek,et al.  Uncertainty, entropy, variance and the effect of partial information , 2003 .

[24]  Milos R Popovic,et al.  Application of singular spectrum-based change-point analysis to EMG-onset detection. , 2010, Journal of electromyography and kinesiology : official journal of the International Society of Electrophysiological Kinesiology.

[25]  Christopher K. I. Williams,et al.  An Evaluation of Change Point Detection Algorithms , 2020, ArXiv.

[26]  Xiaolong Qi,et al.  Real estimates of mortality following COVID-19 infection , 2020, The Lancet Infectious Diseases.

[27]  Nicholas J. A. Harvey,et al.  Tight Analyses for Non-Smooth Stochastic Gradient Descent , 2018, COLT.

[28]  P. Fryzlewicz,et al.  Narrowest‐over‐threshold detection of multiple change points and change‐point‐like features , 2016, Journal of the Royal Statistical Society: Series B (Statistical Methodology).

[29]  Piotr Fryzlewicz,et al.  Wild binary segmentation for multiple change-point detection , 2014, 1411.0858.

[30]  Wee Peng Tay,et al.  Quickest Change Detection in the Presence of a Nuisance Change , 2019, IEEE Transactions on Signal Processing.

[31]  Jeffrey D. Scargle,et al.  An algorithm for optimal partitioning of data on an interval , 2003, IEEE Signal Processing Letters.