Multiple‐change‐point detection for high dimensional time series via sparsified binary segmentation

type="main" xml:id="rssb12079-abs-0001"> Time series segmentation, which is also known as multiple-change-point detection, is a well-established problem. However, few solutions have been designed specifically for high dimensional situations. Our interest is in segmenting the second-order structure of a high dimensional time series. In a generic step of a binary segmentation algorithm for multivariate time series, one natural solution is to combine cumulative sum statistics obtained from local periodograms and cross-periodograms of the components of the input time series. However, the standard ‘maximum’ and ‘average’ methods for doing so often fail in high dimensions when, for example, the change points are sparse across the panel or the cumulative sum statistics are spuriously large. We propose the sparsified binary segmentation algorithm which aggregates the cumulative sum statistics by adding only those that pass a certain threshold. This ‘sparsifying’ step reduces the influence of irrelevant noisy contributions, which is particularly beneficial in high dimensions. To show the consistency of sparsified binary segmentation, we introduce the multivariate locally stationary wavelet model for time series, which is a separate contribution of this work.

[1]  G. C. Tiao,et al.  Use of Cumulative Sums of Squares for Retrospective Detection of Changes of Variance , 1994 .

[2]  Arjun K. Gupta,et al.  Testing and Locating Variance Changepoints with Application to Stock Prices , 1997 .

[3]  Piotr Fryzlewicz,et al.  Modelling and forecasting financial log-returns as locally stationary wavelet processes , 2005 .

[4]  Hernando Ombao,et al.  The SLEX Model of a Non-Stationary Random Process , 2002 .

[5]  T. Mikosch,et al.  Nonstationarities in Financial Time Series, the Long-Range Dependence, and the IGARCH Effects , 2004, Review of Economics and Statistics.

[6]  J. Raz,et al.  Automatic Statistical Analysis of Bivariate Nonstationary Time Series , 2001 .

[7]  A. Korostelev On Minimax Estimation of a Discontinuous Signal , 1988 .

[8]  H. Ombao,et al.  SLEX Analysis of Multivariate Nonstationary Time Series , 2005 .

[9]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[10]  P. Fryzlewicz,et al.  Measuring dependence between non-stationary time series using the locally stationary wavelet model , 2008 .

[11]  É. Moulines,et al.  Least‐squares Estimation of an Unknown Number of Shifts in a Time Series , 2000 .

[12]  M. Lavielle,et al.  Detection of multiple change-points in multivariate time series , 2006 .

[13]  Piotr Fryzlewicz,et al.  Forecasting non-stationary time series by wavelet process modelling , 2003 .

[14]  G. Nason A test for second‐order stationarity and approximate confidence intervals for localized autocovariances for locally stationary time series , 2013 .

[15]  B. Silverman,et al.  The Stationary Wavelet Transform and some Statistical Applications , 1995 .

[16]  Piotr Fryzlewicz,et al.  Multiscale interpretation of taut string estimation and its connection to Unbalanced Haar wavelets , 2011, Stat. Comput..

[17]  Richard A. Davis,et al.  Structural Break Estimation for Nonstationary Time Series Models , 2006 .

[18]  Piotr Fryzlewicz,et al.  Estimating linear dependence between nonstationary time series using the locally stationary wavelet model , 2010 .

[19]  Carsten Jentsch,et al.  A test for second order stationarity of a multivariate time series , 2015 .

[20]  Piotr Fryzlewicz,et al.  Multiscale and multilevel technique for consistent segmentation of nonstationary time series , 2016, 1611.09727.

[21]  Richard A. Davis,et al.  Break Detection for a Class of Nonlinear Time Series Models , 2008 .

[22]  John Odenckantz,et al.  Nonparametric Statistics for Stochastic Processes: Estimation and Prediction , 2000, Technometrics.

[23]  Lei Qi,et al.  Sparse High Dimensional Models in Economics. , 2011, Annual review of economics.

[24]  Piotr Fryzlewicz,et al.  Wild binary segmentation for multiple change-point detection , 2014, 1411.0858.

[25]  George Kapetanios,et al.  MULTIVARIATE METHODS FOR MONITORING STRUCTURAL CHANGE , 2013 .

[26]  W. R. Buckland,et al.  Distributions in Statistics: Continuous Multivariate Distributions , 1973 .

[27]  A. Aue,et al.  Break detection in the covariance structure of multivariate time series models , 2009, 0911.3796.

[28]  Su Cheng Testing and Locating Variance Change Points with Application to the Volatility of Chinese Stock Market , 2003 .

[29]  Piotr Fryzlewicz,et al.  Haar–Fisz estimation of evolutionary wavelet spectra , 2006 .

[30]  Jean-Philippe Vert,et al.  Fast detection of multiple change-points shared by many signals using group LARS , 2010, NIPS.

[31]  Nouna Kettaneh,et al.  Statistical Modeling by Wavelets , 1999, Technometrics.

[32]  Yogesh K. Dwivedi,et al.  A test for second‐order stationarity of a time series based on the discrete Fourier transform , 2009, 0911.4744.

[33]  G. Nason,et al.  Wavelet processes and adaptive estimation of the evolutionary wavelet spectrum , 2000 .

[34]  L. Horváth,et al.  Change‐point detection in panel data , 2012 .