A Comparative Study on Parameter Selection and Outlier Removal for Change Point Detection in Time Series

Change point analysis is an efficient method for understanding the unexpected behaviour of the data used in many different disciplines. Although the literature contains a variety of change point analysis methods, there are relatively fewer studies that focus on the performance of parameter selection and outlier removal that are applied on real data sets. In this study two methods based on regression and statistical properties are proposed and compared with a method using Bayesian approach to evaluate their performance on the selection of parameters and removal of outliers. The methods are executed using different parameters on the well-log data set with and without outliers that are removed either manually or automatically. The results show that different data sets require different parameters to locate their change points. The proposed methods have intuitive parameters to control the algorithm, run faster, and do not require any assumptions to be made such as maximum number of change points. These properties also make them good candidates for online change point analysis.

[1]  David B. Wolfson,et al.  Maximum likelihood estimation in the multi-path change-point problem , 1993 .

[2]  Stephan M. Winkler,et al.  Sliding Window Symbolic Regression for Detecting Changes of System Dynamics , 2014, GPTP.

[3]  M. Lesperance,et al.  PIECEWISE REGRESSION: A TOOL FOR IDENTIFYING ECOLOGICAL THRESHOLDS , 2003 .

[4]  David Wettstein,et al.  Using the Penalized Likelihood Method for Model Selection with Nuisance Parameters Present only under the Alternative: An Application to Switching Regression Models , 2005 .

[5]  Diane J. Cook,et al.  A survey of methods for time series change point detection , 2017, Knowledge and Information Systems.

[6]  Yi-Ching Yao Estimating the number of change-points via Schwarz' criterion , 1988 .

[7]  K OrJ Numerical Bayesian methods applied to signal processing , 1996 .

[8]  E. S. Page CONTINUOUS INSPECTION SCHEMES , 1954 .

[9]  Nancy R. Zhang,et al.  Detecting simultaneous changepoints in multiple sequences. , 2010, Biometrika.

[10]  Robert J. Elliott,et al.  Filtering and change point estimation for hidden Markov-modulated Poisson processes , 2014, Appl. Math. Lett..

[11]  David L. Buckeridge,et al.  Application of change point analysis to daily influenza-like illness emergency department visits , 2012, J. Am. Medical Informatics Assoc..

[12]  Masoud Asgharian,et al.  Covariates in multipath change‐point problems: Modelling and consistency of the MLE , 2001 .

[13]  Paul Fearnhead,et al.  Exact and efficient Bayesian inference for multiple changepoint problems , 2006, Stat. Comput..

[14]  Ayman A. Mostafa,et al.  Bayesian and Non-Bayesian Analysis for Random Change Point Problem Using Standard Computer Packages , 2011 .

[15]  Ryan P. Adams,et al.  Bayesian Online Changepoint Detection , 2007, 0710.3742.

[16]  Carl E. Rasmussen,et al.  Gaussian Process Change Point Models , 2010, ICML.

[17]  R. Lund,et al.  Changepoint detection in daily precipitation data , 2012 .

[18]  M. Woodroofe,et al.  Isotonic regression: Another look at the changepoint problem , 2001 .

[19]  C. Loader CHANGE POINT ESTIMATION USING NONPARAMETRIC REGRESSION , 1996 .

[20]  Adrian F. M. Smith,et al.  Hierarchical Bayesian Analysis of Changepoint Problems , 1992 .

[21]  D. Hinkley Inference about the intersection in two-phase regression , 1969 .

[22]  J. Hartigan,et al.  A Bayesian Analysis for Change Point Problems , 1993 .

[23]  H. Chernoff,et al.  ESTIMATING THE CURRENT MEAN OF A NORMAL DISTRIBUTION WHICH IS SUBJECTED TO CHANGES IN TIME , 1964 .

[24]  M. Srivastava,et al.  On Tests for Detecting Change in Mean , 1975 .

[25]  Eric Ruggieri,et al.  An exact approach to Bayesian sequential change point detection , 2016, Comput. Stat. Data Anal..

[26]  H. Bazargan,et al.  A hybrid method for estimating the process change point using support vector machine and fuzzy statistical clustering , 2016, Appl. Soft Comput..

[27]  P. Fearnhead,et al.  On‐line inference for hidden Markov models via particle filters , 2003 .

[28]  Albert Vexler,et al.  Change point problems in the model of logistic regression , 2005 .

[29]  N. Chopin Dynamic Detection of Change Points in Long Time Series , 2007 .

[30]  J. Bai,et al.  Estimation of a Change Point in Multiple Regression Models , 1997, Review of Economics and Statistics.

[31]  A. Doucet,et al.  Bayesian Computational Methods for Inference in Multiple Change-points Models , 2011 .

[32]  D. Siegmund,et al.  A diffusion process and its applications to detecting a change in the drift of Brownian motion , 1984 .

[33]  V.K. Jandhyala,et al.  Tests for parameter changes at unknown times in linear regression models , 1991 .

[34]  F. E. Grubbs Procedures for Detecting Outlying Observations in Samples , 1969 .

[35]  L. A. Gardner On Detecting Changes in the Mean of Normal Variates , 1969 .

[36]  Jean-Marc Bardet,et al.  Multiple breaks detection in general causal time series using penalized quasi-likelihood , 2012 .

[37]  Rosangela Helena Loschi,et al.  Bayesian Identification of Multiple Change Points in Poisson Data , 2005, Adv. Complex Syst..

[38]  K. Riedel Numerical Bayesian Methods Applied to Signal Processing , 1996 .

[39]  D. Picard Testing and estimating change-points in time series , 1985, Advances in Applied Probability.

[40]  D. Siegmund,et al.  The likelihood ratio test for a change-point in simple linear regression , 1989 .

[41]  A. Shiryaev On Optimum Methods in Quickest Detection Problems , 1963 .

[42]  J. Durbin,et al.  Techniques for Testing the Constancy of Regression Relationships Over Time , 1975 .

[43]  E. S. Page A test for a change in a parameter occurring at an unknown point , 1955 .

[44]  C. Lawrence,et al.  Change point method for detecting regime shifts in paleoclimatic time series: Application to δ18O time series of the Plio‐Pleistocene , 2009 .

[45]  Sooyoung Cheon,et al.  Bayesian multiple change-point estimation with annealing stochastic approximation Monte Carlo , 2010, Comput. Stat..

[46]  Bernard Bobée,et al.  Bayesian change-point analysis in hydrometeorological time series. Part 1. The normal model revisited , 2000 .

[47]  S. W. Roberts A Comparison of Some Control Chart Procedures , 1966 .

[48]  Liping Zhao,et al.  A Support Vector Machine Based Multi-kernel Method for Change Point Estimation on Control Chart , 2015, 2015 IEEE International Conference on Systems, Man, and Cybernetics.

[49]  Arjun K. Gupta,et al.  Testing and Locating Variance Changepoints with Application to Stock Prices , 1997 .

[50]  A. Teschendorff,et al.  Using high-density DNA methylation arrays to profile copy number alterations , 2014, Genome Biology.

[51]  Arjun K. Gupta,et al.  Parametric Statistical Change Point Analysis , 2000 .