Consistent and powerful graph-based change-point test for high-dimensional data

Significance Change-point detection in high-dimensional time series is necessary in many areas of science and engineering, including neuroscience, signal processing, network evolution, image analysis, and text analysis. In terms of a multivariate generalization of the Wald–Wolfowitz run test using the shortest Hamiltonian path, this paper proposes a distribution-free, consistent graph-based change-point detection for high-dimensional data. Once a change-point is detected, its location is estimated by using ratio cut. The test is very powerful against alternatives with a shift in mean or variance and is accurate in change-point estimation. Its applicability is demonstrated in the example of tracking cell division. A change-point detection is proposed by using a Bayesian-type statistic based on the shortest Hamiltonian path, and the change-point is estimated by using ratio cut. A permutation procedure is applied to approximate the significance of Bayesian-type statistics. The change-point test is proven to be consistent, and an error probability in change-point estimation is provided. The test is very powerful against alternatives with a shift in variance and is accurate in change-point estimation, as shown in simulation studies. Its applicability in tracking cell division is illustrated.

[1]  J. Kruskal On the shortest spanning subtree of a graph and the traveling salesman problem , 1956 .

[2]  H. Chernoff,et al.  ESTIMATING THE CURRENT MEAN OF A NORMAL DISTRIBUTION WHICH IS SUBJECTED TO CHANGES IN TIME , 1964 .

[3]  Davi Geiger,et al.  Label free cell-tracking and division detection based on 2D time-lapse images for lineage analysis of early embryo development , 2014, Comput. Biol. Medicine.

[4]  Piotr Fryzlewicz,et al.  Multiple‐change‐point detection for high dimensional time series via sparsified binary segmentation , 2015, 1611.08639.

[5]  Anil K. Ghosh,et al.  A distribution-free two-sample run test applicable to high-dimensional data , 2014 .

[6]  Christian Demant,et al.  Industrial Image Processing , 2013, Springer Berlin Heidelberg.

[7]  George Michailidis,et al.  Change point estimation in high dimensional Markov random‐field models , 2014, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[8]  J. Wolfowitz,et al.  On a Test Whether Two Samples are from the Same Population , 1940 .

[9]  Hao Chen,et al.  Graph-based change-point detection , 2012, 1209.1625.

[10]  Ramin Zabih,et al.  Comparing images using color coherence vectors , 1997, MULTIMEDIA '96.

[11]  Chung-Kuan Cheng,et al.  Towards efficient hierarchical designs by ratio cut partitioning , 1989, 1989 IEEE International Conference on Computer-Aided Design. Digest of Technical Papers.

[12]  L. A. Gardner On Detecting Changes in the Mean of Normal Variates , 1969 .

[13]  Peter Waszkewitz,et al.  Industrial Image Processing: Visual Quality Control in Manufacturing , 1999 .

[14]  J. Friedman,et al.  Multivariate generalizations of the Wald--Wolfowitz and Smirnov two-sample tests , 1979 .

[15]  M. Jirak Uniform change point tests in high dimension , 2015, 1511.05333.