BayesProject: Fast computation of a projection direction for multivariate changepoint detection

This article focuses on the challenging problem of efficiently detecting changes in mean within multivariate data sequences. Multivariate changepoints can be detected by projecting a multivariate series to a univariate one using a suitable projection direction that preserves a maximal proportion of signal information. However, for some existing approaches the computation of such a projection direction can scale unfavourably with the number of series and might rely on additional assumptions on the data sequences, thus limiting their generality. We introduce BayesProject, a computationally inexpensive Bayesian approach to compute a projection direction in such a setting. The proposed approach allows the incorporation of prior knowledge of the changepoint scenario, when such information is available, which can help to increase the accuracy of the method. A simulation study shows that BayesProject is robust, yields projections close to the oracle projection direction and, moreover, that its accuracy in detecting changepoints is comparable to, or better than, existing algorithms while scaling linearly with the number of series.

[1]  Nancy R. Zhang,et al.  Detecting simultaneous variant intervals in aligned sequences , 2011, 1108.3177.

[2]  Haeran Cho,et al.  Change-point detection in panel data via double CUSUM statistic , 2016, 1611.08631.

[3]  P. Fearnhead,et al.  Computationally Efficient Changepoint Detection for a Range of Penalties , 2017 .

[4]  Julia Hirschberg,et al.  V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure , 2007, EMNLP.

[5]  David S. Matteson,et al.  A Nonparametric Approach for Multiple Change Point Analysis of Multivariate Data , 2013, 1306.4933.

[6]  A. Munk,et al.  Multiscale change point inference , 2013, 1301.7212.

[7]  Yann Guédon,et al.  Exploring the latent segmentation space for the assessment of multiple change-point models , 2013, Comput. Stat..

[8]  Gueorgui Mihaylov,et al.  High-dimensional changepoint detection via a geometrically inspired mapping , 2020, Stat. Comput..

[9]  E. S. Page CONTINUOUS INSPECTION SCHEMES , 1954 .

[10]  Piotr Fryzlewicz,et al.  Multiple‐change‐point detection for high dimensional time series via sparsified binary segmentation , 2015, 1611.08639.

[11]  Markus Eriksson,et al.  Computationally Efficient Off-Line Joint Change Point Detection in Multiple Time Series , 2019, IEEE Transactions on Signal Processing.

[12]  A. Aue,et al.  Break detection in the covariance structure of multivariate time series models , 2009, 0911.3796.

[13]  D. Siegmund,et al.  Tests for a change-point , 1987 .

[14]  Paul Fearnhead,et al.  On optimal multiple changepoint algorithms for large data , 2014, Statistics and Computing.

[15]  P. Perron,et al.  Estimating and testing linear models with multiple structural changes , 1995 .

[16]  Tengyao Wang,et al.  High dimensional change point estimation via sparse projection , 2016, 1606.06246.

[17]  John A. D. Aston,et al.  The Uncertainty of Storm Season Changes: Quantifying the Uncertainty of Autocovariance Changepoints , 2015, Technometrics.

[18]  L. Duembgen,et al.  Maximum likelihood estimation of a log-concave density and its distribution function: Basic properties and uniform consistency , 2007, 0709.0334.

[19]  H. Dette,et al.  Detection of Multiple Structural Breaks in Multivariate Time Series , 2013, 1309.1309.

[20]  Manuel Davy,et al.  An online kernel change detection algorithm , 2005, IEEE Transactions on Signal Processing.

[21]  M. Srivastava,et al.  Likelihood Ratio Tests for a Change in the Multivariate Normal Mean , 1986 .

[22]  B. Mercier,et al.  A dual algorithm for the solution of nonlinear variational problems via finite element approximation , 1976 .

[23]  Farida Enikeeva,et al.  High-dimensional change-point detection with sparse alternatives , 2013, 1312.1900.

[24]  Nicolas Vayatis,et al.  A review of change point detection methods , 2018, ArXiv.

[25]  Olivier Cappé,et al.  Distributed detection/localization of change-points in high-dimensional network traffic data , 2009, Statistics and Computing.

[26]  Gene H. Golub,et al.  Matrix computations , 1983 .

[27]  L. Horváth,et al.  Extensions of some classical methods in change point analysis , 2014 .

[28]  A. Munk,et al.  FDR-Control in Multiscale Change-point Segmentation , 2014, 1412.5844.

[29]  Peter Neal,et al.  Dynamic stochastic block models: parameter estimation and detection of changes in community structure , 2018, Stat. Comput..

[30]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[31]  C'eline L'evy-Leduc,et al.  Detection and localization of change-points in high-dimensional network traffic data , 2009, 0908.2310.

[32]  P. Rousseeuw,et al.  Alternatives to the Median Absolute Deviation , 1993 .

[33]  Piotr Fryzlewicz,et al.  Wild binary segmentation for multiple change-point detection , 2014, 1411.0858.

[34]  Nancy R. Zhang,et al.  Detecting simultaneous changepoints in multiple sequences. , 2010, Biometrika.