An Improved Bayesian Information Criterion for Multiple Change-Point Models

In multiple change-point analysis, inferring the number of change points is often achieved by minimizing a selection criterion that trades off data fidelity with complexity. We address the open problem of defining a selection criterion adapted to the context of multiple change-point analysis. Our approach is inspired by the Schwarz seminal formulation of the Bayesian information criterion (BIC): similarly, we introduce priors—here describing the occurrence of change points—and we use the Laplace approximation to derive a closed-form expression of the criterion. Differently from this previous work, we take advantage of the a priori information introduced, instead of asymptotically eliminating the dependence on priors. Results obtained on simulated series show a substantial gain in performance versus recent alternative criteria used in multiple change-point analysis. Results also show that the a priori information introduced in our criterion on the regularity of interevent times is the main driver of this substantial performance gain. Methods are motivated by and demonstrated on a meteorological application involving the homogenization of a temperature series.

[1]  Robert Lund,et al.  An MDL approach to the climate segmentation problem , 2010, 1010.1397.

[2]  P. Bickel Efficient and Adaptive Estimation for Semiparametric Models , 1993 .

[3]  Richard A. Davis,et al.  Structural Break Estimation for Nonstationary Time Series Models , 2006 .

[4]  Arjun K. Gupta,et al.  Parametric Statistical Change Point Analysis , 2000 .

[5]  T. Louis,et al.  Empirical Bayes: Past, Present and Future , 2000 .

[6]  Thomas H. Cormen,et al.  Introduction to algorithms [2nd ed.] , 2001 .

[7]  Philippe Naveau,et al.  Bayesian multiple change points and segmentation: Application to homogenization of climatic series , 2009 .

[8]  D. Hawkins Fitting multiple change-point models to data , 2001 .

[9]  Yi-Ching Yao Estimating the number of change-points via Schwarz' criterion , 1988 .

[10]  T. Lai,et al.  AUTOREGRESSIVE MODELS WITH PIECEWISE CONSTANT VOLATILITY AND REGRESSION PARAMETERS , 2005 .

[11]  H. Müller,et al.  Multiple changepoint fitting via quasilikelihood, with application to DNA sequence segmentation , 2000 .

[12]  Claude N. Williams,et al.  Homogenization of Temperature Series via Pairwise Comparisons , 2009 .

[13]  Andrea Toreti,et al.  Homogenization of daily maximum temperature series in the Mediterranean , 2009 .

[14]  Jeffrey C. Lagarias,et al.  Convergence Properties of the Nelder-Mead Simplex Method in Low Dimensions , 1998, SIAM J. Optim..

[15]  Haipeng Xing,et al.  A SIMPLE BAYESIAN APPROACH TO MULTIPLE CHANGE-POINTS , 2011 .

[16]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[17]  Marc Lavielle,et al.  An application of MCMC methods for the multiple change-points problem , 2001, Signal Process..

[18]  H. Akaike A new look at the statistical model identification , 1974 .

[19]  Paul Embrechts,et al.  Change-Point Analysis for Dependence Structures in Finance and Insurance , 2001 .

[20]  Yoshiyuki Ninomiya,et al.  Information criterion for Gaussian change-point model , 2005 .

[21]  Helmut E. Landsberg,et al.  Methods in Climatology , 1945 .

[22]  David O Siegmund,et al.  A Modified Bayes Information Criterion with Applications to the Analysis of Comparative Genomic Hybridization Data , 2007, Biometrics.

[23]  L. Tierney,et al.  Fully Exponential Laplace Approximations to Expectations and Variances of Nonpositive Functions , 1989 .

[24]  Henri Caussinus,et al.  Detection and correction of artificial shifts in climate series , 2004 .

[25]  Olivier Mestre,et al.  Decadal to secular time scales variability in temperature measurements over France , 2006 .

[26]  Jiahua Chen,et al.  Application of modified information criterion to multiple change point problems , 2006 .

[27]  A. Raftery,et al.  Bayesian analysis of a Poisson process with a change-point , 1986 .

[28]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[29]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[30]  S. Chib Estimation and comparison of multiple change-point models , 1998 .

[31]  Yi-Ching Yao Estimation of a Noisy Discrete-Time Step Function: Bayes and Empirical Bayes Approaches , 1984 .

[32]  Michèle Basseville,et al.  Detection of abrupt changes: theory and application , 1993 .

[33]  Jacques Bernier,et al.  Retrospective multivariate Bayesian change-point analysis: A simultaneous single change in the mean of several hydrological sequences , 2000 .

[34]  Christian P. Robert,et al.  The Bayesian choice : from decision-theoretic foundations to computational implementation , 2007 .

[35]  Michèle Basseville,et al.  Detection of Abrupt Changes: Theory and Applications. , 1995 .

[36]  Robert Lund,et al.  A Review and Comparison of Changepoint Detection Techniques for Climate Data , 2007 .

[37]  Marc Lavielle,et al.  Using penalized contrasts for the change-point problem , 2005, Signal Process..

[38]  D. Albertson,et al.  Chromosome aberrations in solid tumors , 2003, Nature Genetics.

[39]  K. Do,et al.  Efficient and Adaptive Estimation for Semiparametric Models. , 1994 .

[40]  Monika Serbinowska,et al.  Consistency of an estimator of the number of changes in binomial observations , 1996 .

[41]  P. Massart,et al.  Gaussian model selection , 2001 .

[42]  Alain Celisse,et al.  Model selection via cross-validation in density estimation, regression, and change-points detection , 2008 .

[43]  Arjun K. Gupta,et al.  Testing and Locating Variance Changepoints with Application to Stock Prices , 1997 .

[44]  C. Varin On composite marginal likelihoods , 2008 .

[45]  L. Wasserman,et al.  Computing Bayes Factors by Combining Simulation and Asymptotic Approximations , 1997 .