论文信息 - Pruned dynamic programming for optimal multiple change-point detection

Pruned dynamic programming for optimal multiple change-point detection

Multiple change-point detection models assume that the observed data is a realization of an independent random process affected by K − 1 abrupt changes, called change-points, at some unknown positions. For off-line detection a dynamic programming (DP) algorithm retrieves the K − 1 change-points minimizing the quadratic loss and reduces the complexity from Θ(nK) to Θ(Kn2) where n is the number of observations. The quadratic complexity in n still restricts the use of such an algorithm to small or intermediate values of n. We propose a pruned DP algorithm that recovers the optimal solution. We demonstrate that at worst the complexity is in O(Kn2) time and O(Kn) space and is therefore at worst equivalent to the classical DP. We show empirically that the run-time of our proposed algorithm is drastically reduced compared to the classical DP algorithm. More precisely, our algorithm is able to process a million points in a matter of minutes compared to several days with the classical DP algorithm. Moreover, the principle of the proposed algorithm can be extended to other convex losses (for example the Poisson loss) and as the algorithm process one observation after the other it could be adapted for on-line problems.

Guillem Rigaill | G. Rigaill

[1] Yonina C. Eldar,et al. A fast and flexible method for the segmentation of aCGH data , 2008, ECCB.

[2] Servane Gey,et al. Using CART to Detect Multiple Change Points in the Mean for Large Sample , 2008 .

[3] Y. Guédon. Exploring the segmentation space for models multiple change-point models , 2008 .

[4] R. Tibshirani,et al. Spatial smoothing and hot spot detection for CGH data using the fused lasso. , 2008, Biostatistics.

[5] Zaïd Harchaoui,et al. Catching Change-points with Lasso , 2007, NIPS.

[6] P. Fearnhead,et al. On‐line inference for multiple changepoint problems , 2007 .

[7] N. Chopin. Dynamic Detection of Change Points in Long Time Series , 2007 .

[8] Marc Lavielle,et al. Using penalized contrasts for the change-point problem , 2005, Signal Process..

[9] Franck Picard,et al. A statistical approach for array CGH data analysis , 2005, BMC Bioinformatics.

[10] P. Perron,et al. Computation and Analysis of Multiple Structural-Change Models , 1998 .

[11] P. Perron,et al. Estimating and testing linear models with multiple structural changes , 1995 .

[12] Michèle Basseville,et al. Detection of abrupt changes: theory and application , 1993 .