Simultaneous multiple change-point and factor analysis for high-dimensional time series

We propose the first comprehensive treatment of high-dimensional time series factor models with multiple change-points in their second-order structure. We operate under the most flexible definition of piecewise stationarity, and estimate the number and locations of change-points consistently as well as identifying whether they originate in the common or idiosyncratic components. Through the use of wavelets, we transform the problem of change-point detection in the second-order structure of a high-dimensional time series, into the (relatively easier) problem of change-point detection in the means of high-dimensional panel data. Also, our methodology circumvents the difficult issue of the accurate estimation of the true number of factors in the presence of multiple change-points by adopting a screening procedure. We further show that consistent factor analysis is achieved over each segment defined by the change-points estimated by the proposed methodology. In extensive simulation studies, we observe that factor analysis prior to change-point detection improves the detectability of change-points, and identify and describe an interesting ‘spillover’ effect in which substantial breaks in the idiosyncratic components get, naturally enough, identified as change-points in the common components, which prompts us to regard the corresponding change-points as also acting as a form of ‘factors’. Our methodology is implemented in the R package factorcpt, available from CRAN.

[1]  H. White,et al.  Automatic Block-Length Selection for the Dependent Bootstrap , 2004 .

[2]  Jushan Bai,et al.  Estimation and Inference of Structural Changes in High Dimensional Factor Models , 2017 .

[3]  M. Rothschild,et al.  Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets , 1983 .

[4]  Marco Lippi,et al.  OPENING THE BLACK BOX: STRUCTURAL FACTOR MODELS WITH LARGE CROSS SECTIONS , 2009, Econometric Theory.

[5]  J. Stock,et al.  Consistent Factor Estimation in Dynamic Factor Models with Structural Instability , 2013 .

[6]  Daniele Massacci Least Squares Estimation of Large Dimensional Threshold Factor Models , 2016 .

[7]  J. Bai,et al.  Determining the Number of Factors in Approximate Factor Models , 2000 .

[8]  Yohei Yamamoto,et al.  Testing for Factor Loading Structural Change under Common Breaks , 2015 .

[9]  Markus Pelger Large-Dimensional Factor Modeling Based on High-Frequency Observations , 2018 .

[10]  Atsushi Inoue,et al.  TESTS FOR PARAMETER INSTABILITY IN DYNAMIC FACTOR MODELS , 2014, Econometric Theory.

[11]  Mark W. Watson,et al.  Forecasting in dynamic factor models subject to structural instability , 2009 .

[12]  Matteo Barigozzi,et al.  A network analysis of the volatility of high dimensional financial series , 2017 .

[13]  B. Brodsky,et al.  Nonparametric Methods in Change Point Problems , 1993 .

[14]  M. Hallin,et al.  Networks, Dynamic Factors, and the Volatility Analysis of High-Dimensional Financial Series , 2015, 1510.05118.

[15]  Marco Lippi,et al.  THE GENERALIZED DYNAMIC FACTOR MODEL: REPRESENTATION THEORY , 2001, Econometric Theory.

[16]  A. Korostelev On Minimax Estimation of a Discontinuous Signal , 1988 .

[17]  Haeran Cho,et al.  Change-point detection in panel data via double CUSUM statistic , 2016, 1611.08631.

[18]  Matteo Barigozzi,et al.  Improved penalization for determining the number of factors in approximate factor models , 2010 .

[19]  Shujie Ma,et al.  Estimation of large dimensional factor models with an unknown number of breaks , 2018, Journal of Econometrics.

[20]  Dacheng Xiu,et al.  Using Principal Component Analysis to Estimate a High Dimensional Factor Model with High-Frequency Data , 2016 .

[21]  Joseph P. Romano,et al.  The stationary bootstrap , 1994 .

[22]  Frank Schorfheide,et al.  Shrinkage Estimation of High-Dimensional Factor Models with Structural Instabilities , 2013 .

[23]  Laure Sansonnet,et al.  Nonparametric homogeneity tests and multiple change-point estimation for analyzing large Hi-C data matrices , 2016, 1605.03751.

[24]  M. Hallin,et al.  The Generalized Dynamic-Factor Model: Identification and Estimation , 2000, Review of Economics and Statistics.

[25]  George Tauchen,et al.  Rank Tests at Jump Events , 2019 .

[26]  Zoran Nikoloski,et al.  Segmentation of biological multivariate time-series data , 2015, Scientific Reports.

[27]  Piotr Fryzlewicz,et al.  Haar–Fisz estimation of evolutionary wavelet spectra , 2006 .

[28]  Claudia Kirch,et al.  Change Points in High Dimensional Settings , 2014 .

[29]  Marco Lippi,et al.  Factor models in high-dimensional time series—A time-domain approach , 2013 .

[30]  Tengyao Wang,et al.  High dimensional change point estimation via sparse projection , 2016, 1606.06246.

[31]  Badi H. Baltagi,et al.  Identification and estimation of a large factor model with structural instability , 2017 .

[32]  Hernando Ombao,et al.  FreSpeD: Frequency-Specific Change-Point Detection in Epileptic Seizure Multi-Channel EEG Data , 2018, Journal of the American Statistical Association.

[33]  Jörg Breitung,et al.  Testing for Structural Breaks in Dynamic Factor Models , 2011, SSRN Electronic Journal.

[34]  George Kapetanios,et al.  MULTIVARIATE METHODS FOR MONITORING STRUCTURAL CHANGE , 2013 .

[35]  A. Onatski Determining the Number of Factors from Empirical Distribution of Eigenvalues , 2010, The Review of Economics and Statistics.

[36]  A. Onatski Asymptotic analysis of the squared estimation error in misspecified factor models , 2015 .

[37]  M. Jirak Uniform change point tests in high dimension , 2015, 1511.05333.

[38]  Piotr Fryzlewicz,et al.  Multiple‐change‐point detection for high dimensional time series via sparsified binary segmentation , 2015, 1611.08639.

[39]  J. Bai,et al.  Inferential Theory for Factor Models of Large Dimensions , 2003 .

[40]  Rainer von Sachs,et al.  Locally adaptive estimation of evolutionary wavelet spectra , 2008, 0808.1452.

[41]  Piotr Fryzlewicz,et al.  Multiscale and multilevel technique for consistent segmentation of nonstationary time series , 2016, 1611.09727.

[42]  John Odenckantz,et al.  Nonparametric Statistics for Stochastic Processes: Estimation and Prediction , 2000, Technometrics.

[43]  Mtw,et al.  Nonparametric Statistics for Stochastic Process: Estimation and Prediction , 2000 .

[44]  P. Fryzlewicz,et al.  Multiple‐change‐point detection for auto‐regressive conditional heteroscedastic processes , 2014 .

[45]  Andrew J. Patton,et al.  Correction to “Automatic Block-Length Selection for the Dependent Bootstrap” by D. Politis and H. White , 2009 .

[46]  P. Fryzlewicz,et al.  factorcpt: Simultaneous Change-Point and Factor Analysis , 2016 .

[47]  Jianqing Fan,et al.  High Dimensional Covariance Matrix Estimation in Approximate Factor Models , 2011, Annals of statistics.

[48]  W. Kahan,et al.  The Rotation of Eigenvectors by a Perturbation. III , 1970 .

[49]  Sı́lvia Gonçalves,et al.  Bootstrapping Factor-Augmented Regression Models , 2012 .

[50]  J. Stock,et al.  Forecasting Using Principal Components From a Large Number of Predictors , 2002 .

[51]  Valentina Corradi,et al.  Testing for Structural Stability of Factor Augmented Forecasting Models , 2013 .

[52]  J. Bai,et al.  Estimation and Inference of Change Points in High Dimensional Factor Models , 2018 .

[53]  Carsten Jentsch,et al.  Covariance matrix estimation and linear process bootstrap for multivariate time series of possibly increasing dimension , 2015, 1506.00816.

[54]  Jianqing Fan,et al.  Large covariance estimation by thresholding principal orthogonal complements , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[55]  Seung C. Ahn,et al.  Eigenvalue Ratio Test for the Number of Factors , 2013 .

[56]  Piotr Fryzlewicz,et al.  Wild binary segmentation for multiple change-point detection , 2014, 1411.0858.

[57]  Lorenzo Trapani On bootstrapping panel factor series , 2013 .

[58]  Piotr Fryzlewicz,et al.  Multiscale interpretation of taut string estimation and its connection to Unbalanced Haar wavelets , 2011, Stat. Comput..

[59]  Xiaodong Liu,et al.  A hybrid segmentation method for multivariate time series based on the dynamic factor model , 2017, Stochastic Environmental Research and Risk Assessment.

[60]  Farida Enikeeva,et al.  High-dimensional change-point detection with sparse alternatives , 2013, 1312.1900.

[61]  Tengyao Wang,et al.  A useful variant of the Davis--Kahan theorem for statisticians , 2014, 1405.0680.

[62]  Benoit Perron,et al.  Bootstrapping factor models with cross sectional dependence , 2018 .

[63]  L. Horváth,et al.  Change‐point detection in panel data , 2012 .

[64]  Jukka-Pekka Onnela,et al.  Change Point Detection in Correlation Networks , 2014, Scientific Reports.

[65]  Chandler Davis The rotation of eigenvectors by a perturbation , 1963 .

[66]  E. Rio,et al.  A Bernstein type inequality and moderate deviations for weakly dependent sequences , 2009, 0902.0582.

[67]  Lei Qi,et al.  Sparse High Dimensional Models in Economics. , 2011, Annual review of economics.

[68]  Jesus Gonzalo,et al.  Detecting big structural breaks in large factor models , 2014 .

[69]  G. Nason,et al.  Wavelet processes and adaptive estimation of the evolutionary wavelet spectrum , 2000 .