On‐line inference for multiple changepoint problems

Summary.  We propose an on‐line algorithm for exact filtering of multiple changepoint problems. This algorithm enables simulation from the true joint posterior distribution of the number and position of the changepoints for a class of changepoint models. The computational cost of this exact algorithm is quadratic in the number of observations. We further show how resampling ideas from particle filters can be used to reduce the computational cost to linear in the number of observations, at the expense of introducing small errors, and we propose two new, optimum resampling algorithms for this problem. One, a version of rejection control, allows the particle filter to choose the number of particles that are required at each time step automatically. The new resampling algorithms substantially outperform standard resampling algorithms on examples that we consider; and we demonstrate how the resulting particle filter is practicable for segmentation of human G+C content.

[1]  J. Hartigan,et al.  Product Partition Models for Change Point Problems , 1992 .

[2]  I. Johnstone,et al.  Ideal spatial adaptation by wavelet shrinkage , 1994 .

[3]  D. Stephens Bayesian Retrospective Multiple‐Changepoint Identification , 1994 .

[4]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[5]  Jun S. Liu,et al.  Sequential Monte Carlo methods for dynamic systems , 1997 .

[6]  H. Müller,et al.  Statistical methods for DNA sequence segmentation , 1998 .

[7]  S. Chib Estimation and comparison of multiple change-point models , 1998 .

[8]  Jun S. Liu,et al.  Rejection Control and Sequential Importance Sampling , 1998 .

[9]  P. Fearnhead,et al.  Improved particle filter for nonlinear problems , 1999 .

[10]  Jun S. Liu,et al.  Bayesian inference on biopolymer models , 1999, Bioinform..

[11]  P. Fearnhead,et al.  An improved particle filter for non-linear problems , 1999 .

[12]  G Bernardi,et al.  Isochores and the evolutionary genomics of vertebrates. , 2000, Gene.

[13]  H. Müller,et al.  Multiple changepoint fitting via quasilikelihood, with application to DNA sequence segmentation , 2000 .

[14]  Jun S. Liu,et al.  Mixture Kalman filters , 2000 .

[15]  Rong Chen,et al.  A Theoretical Framework for Sequential Importance Sampling with Resampling , 2001, Sequential Monte Carlo Methods in Practice.

[16]  Christophe Andrieu,et al.  Bayesian curve fitting using MCMC with applications to signal segmentation , 2002, IEEE Trans. Signal Process..

[17]  N. Chopin A sequential particle filter method for static models , 2002 .

[18]  Robert Lund,et al.  Detection of Undocumented Changepoints: A Revision of the Two-Phase Regression Model , 2002 .

[19]  P. Moral,et al.  Sequential Monte Carlo samplers , 2002, cond-mat/0212648.

[20]  H. Bergman,et al.  Detection of onset of neuronal activity by allowing for heterogeneity in the change points , 2002, Journal of Neuroscience Methods.

[21]  P. Fearnhead,et al.  On‐line inference for hidden Markov models via particle filters , 2003 .

[22]  David Haussler,et al.  Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution. , 2003, Genome research.

[23]  S. Harkema,et al.  A Bayesian change-point analysis of electromyographic data: detecting muscle activation patterns and associated applications. , 2003, Biostatistics.

[24]  Michael Hackenberg,et al.  IsoFinder: computational prediction of isochores in genome sequences , 2004, Nucleic Acids Res..

[25]  Hugh Griffiths,et al.  IEE Proceedings - Radar, Sonar and Navigation , 2004 .

[26]  Laurence H Pearl,et al.  Crystal structure of the catalytic fragment of murine poly(ADP-ribose) polymerase-2. , 2002, Nucleic acids research.

[27]  Paul Fearnhead,et al.  Exact Bayesian curve fitting and signal segmentation , 2005, IEEE Transactions on Signal Processing.

[28]  Paul Fearnhead,et al.  Exact and efficient Bayesian inference for multiple changepoint problems , 2006, Stat. Comput..

[29]  N. Chopin Dynamic Detection of Change Points in Long Time Series , 2007 .