A computationally efficient nonparametric approach for changepoint detection

In this paper we build on an approach proposed by Zou et al. (2014) for nonparametric changepoint detection. This approach defines the best segmentation for a data set as the one which minimises a penalised cost function, with the cost function defined in term of minus a non-parametric log-likelihood for data within each segment. Minimising this cost function is possible using dynamic programming, but their algorithm had a computational cost that is cubic in the length of the data set. To speed up computation, Zou et al. (2014) resorted to a screening procedure which means that the estimated segmentation is no longer guaranteed to be the global minimum of the cost function. We show that the screening procedure adversely affects the accuracy of the changepoint detection method, and show how a faster dynamic programming algorithm, pruned exact linear time (PELT) (Killick et al. 2012), can be used to find the optimal segmentation with a computational cost that can be close to linear in the amount of data. PELT requires a penalty to avoid under/over-fitting the model which can have a detrimental effect on the quality of the detected changepoints. To overcome this issue we use a relatively new method, changepoints over a range of penalties (Haynes et al. 2016), which finds all of the optimal segmentations for multiple penalty values over a continuous range. We apply our method to detect changes in heart-rate during physical activity.

[1]  Lie Lu,et al.  Speaker change detection and tracking in real-time news broadcasting analysis , 2002, MULTIMEDIA '02.

[2]  L. Dümbgen The Asymptotic Behavior of Some Nonparametric Change-Point Estimators , 1991 .

[3]  J. Bai,et al.  Least squares estimation of a shift in linear processes , 1994 .

[4]  Nicholas A. James,et al.  Change Points via Probabilistically Pruned Objectives , 2015, 1505.04302.

[5]  A. Pettitt A Non‐Parametric Approach to the Change‐Point Problem , 1979 .

[6]  Chung-Bow Lee,et al.  NONPARAMETRIC MULTIPLE CHANGE-POINT ESTIMATORS , 1996 .

[7]  A. Munk,et al.  Multiscale change point inference , 2013, 1301.7212.

[8]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[9]  P. Fearnhead,et al.  Optimal detection of changepoints with a linear computational cost , 2011, 1101.1438.

[10]  Chris D. Nugent,et al.  Utilizing Wearable Sensors to Investigate the Impact of Everyday Activities on Heart Rate , 2011, ICOST.

[11]  M. Wigler,et al.  Circular binary segmentation for the analysis of array-based DNA copy number data. , 2004, Biostatistics.

[12]  Richard A. Davis,et al.  Structural Break Estimation for Nonstationary Time Series Models , 2006 .

[13]  Eva Wesfreid,et al.  Detection of changes in the fractal scaling of heart rate and speed in a marathon race , 2009 .

[14]  Monika Ritsch-Marte,et al.  A new method for change-point detection developed for on-line analysis of the heart beat variability during sleep , 2005 .

[15]  P. Perron,et al.  Estimating and testing linear models with multiple structural changes , 1995 .

[16]  A. Aue,et al.  Structural breaks in time series , 2013 .

[17]  Eliane R. Rodrigues,et al.  An analysis of airport noise data using a non-homogeneous Poisson model with a change-point , 2015 .

[18]  C. Lawrence,et al.  Algorithms for the optimal identification of segment neighborhoods , 1989 .

[19]  Véronique Billat,et al.  Heart Rate Regulation Processed Through Wavelet Analysis and Change Detection: Some Case Studies , 2012, Acta biotheoretica.

[20]  Changliang Zou,et al.  Nonparametric maximum likelihood approach to multiple change-point problems , 2014, 1405.7173.

[21]  Richard A. Johnson,et al.  Nonparametric Tests for Shift at an Unknown Time Point , 1968 .

[22]  V. Liebscher,et al.  Consistencies and rates of convergence of jump-penalized least squares estimators , 2009, 0902.4838.

[23]  Piotr Fryzlewicz,et al.  Wild binary segmentation for multiple change-point detection , 2014, 1411.0858.

[24]  Niall M. Adams,et al.  Two Nonparametric Control Charts for Detecting Arbitrary Distribution Changes , 2012 .

[25]  Francis R. Bach,et al.  Learning Sparse Penalties for Change-point Detection using Max Margin Interval Regression , 2013, ICML.

[26]  Marc Lavielle,et al.  Using penalized contrasts for the change-point problem , 2005, Signal Process..

[27]  John A. D. Aston,et al.  The Uncertainty of Storm Season Changes: Quantifying the Uncertainty of Autocovariance Changepoints , 2015, Technometrics.

[28]  A. Scott,et al.  A Cluster Analysis Method for Grouping Means in the Analysis of Variance , 1974 .

[29]  Michael Baron,et al.  Nonparametric adaptive change point estimation and on line detection , 2000 .

[30]  Jin Zhang Powerful goodness‐of‐fit tests based on the likelihood ratio , 2002 .

[31]  David O Siegmund,et al.  A Modified Bayes Information Criterion with Applications to the Analysis of Comparative Genomic Hybridization Data , 2007, Biometrics.

[32]  David S. Matteson,et al.  A Nonparametric Approach for Multiple Change Point Analysis of Multivariate Data , 2013, 1306.4933.

[33]  Dominik Wied,et al.  Detecting relevant changes in time series models , 2014, 1403.8120.

[34]  S. Fotopoulos,et al.  Inference for single and multiple change‐points in time series , 2013 .

[35]  Yi-Ching Yao Estimating the number of change-points via Schwarz' criterion , 1988 .

[36]  E. Carlstein Nonparametric Change-Point Estimation , 1988 .

[37]  Jeffrey D. Scargle,et al.  An algorithm for optimal partitioning of data on an interval , 2003, IEEE Signal Processing Letters.

[38]  P. Fearnhead,et al.  Computationally Efficient Changepoint Detection for a Range of Penalties , 2017 .

[39]  A. Aubert,et al.  Heart Rate Variability in Athletes , 2003, Sports medicine.

[40]  E. S. Page CONTINUOUS INSPECTION SCHEMES , 1954 .