Bayesian cross validation for gravitational-wave searches in pulsar-timing array data

Gravitational-wave data analysis demands sophisticated statistical noise models in a bid to extract highly obscured signals from data. In Bayesian model comparison, we choose among a landscape of models by comparing their marginal likelihoods. However, this computation is numerically fraught and can be sensitive to arbitrary choices in the specification of parameter priors. In Bayesian cross validation, we characterize the fit and predictive power of a model by computing the Bayesian posterior of its parameters in a training data set, and then use that posterior to compute the averaged likelihood of a different testing data set. The resulting cross-validation scores are straightforward to compute; they are insensitive to prior tuning; and they penalize unnecessarily complex models that overfit the training data at the expense of predictive performance. In this article, we discuss cross validation in the context of pulsar-timing-array data analysis, and we exemplify its application to simulated pulsar data (where it successfully selects the correct spectral index of a stochastic gravitational-wave background), and to a pulsar data set from the NANOGrav 11-yr release (where it convincingly favours a model that represents a transient feature in the interstellar medium). We argue that cross validation offers a promising alternative to Bayesian model comparison, and we discuss its use for gravitational-wave detection, by selecting or refuting models that include a gravitational-wave component.

[1]  Chiara M. F. Mingarelli,et al.  The Second International Pulsar Timing Array Mock Data Challenge , 2018, 1810.10527.

[2]  P. S. Ray,et al.  The NANOGrav 11 Year Data Set: Pulsar-timing Constraints on the Stochastic Gravitational-wave Background , 2018, 1801.02617.

[3]  Stephen R. Taylor,et al.  The NANOGrav 11-year Data Set: High-precision Timing of 45 Millisecond Pulsars , 2017, 1801.01837.

[4]  D. Stinebring,et al.  A Second Chromatic Timing Event of Interstellar Origin toward PSR J1713+0747 , 2017, The Astrophysical Journal.

[5]  A. Sesana Prospects for Multiband Gravitational-Wave Astronomy after GW150914. , 2016, Physical review letters.

[6]  D. Stinebring,et al.  From spin noise to systematics: Stochastic processes in the first International Pulsar Timing Array data release , 2016, 1602.05570.

[7]  N. Cornish,et al.  Towards robust gravitational wave detection with pulsar timing arrays , 2015, 1512.06829.

[8]  J. Cordes,et al.  SYSTEMATIC AND STOCHASTIC VARIATIONS IN PULSAR DISPERSION MEASURES , 2015, 1512.02203.

[9]  A. Lommen Pulsar timing arrays: the promise of gravitational wave detection , 2015, Reports on progress in physics. Physical Society.

[10]  P. Lasky,et al.  Pulsar timing noise and the minimum observation time to detect gravitational waves with pulsar timing arrays , 2015, 1503.03298.

[11]  S. McWilliams,et al.  Constraining the Solution to the Last Parsec Problem with Pulsar Timing , 2015, 1503.02662.

[12]  M. Vallisneri,et al.  New advances in the Gaussian-process approach to pulsar-timing data analysis , 2014, 1407.1838.

[13]  J. Gair,et al.  Accelerated Bayesian model-selection and parameter-estimation in continuous gravitational-wave searches with pulsar-timing arrays , 2014, 1406.5224.

[14]  J. Cordes Limits to PTA sensitivity: spin stability and arrival time precision of millisecond pulsars , 2013 .

[15]  D. Stinebring Effects of the interstellar medium on detection of low-frequency gravitational waves , 2013, 1310.8316.

[16]  J. Gair,et al.  Weighing The Evidence For A Gravitational-Wave Background In The First International Pulsar Timing Array Data Challenge , 2012, 1210.6014.

[17]  David B. Dunson,et al.  Bayesian data analysis, third edition , 2013 .

[18]  M. Vallisneri Testing general relativity with gravitational waves: a reality check , 2012, 1207.4759.

[19]  J. Cordes,et al.  A Measurement Model for Precision Pulsar Timing , 2010, 1010.3785.

[20]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[21]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[22]  R. Trotta Bayes in the sky: Bayesian inference and model selection in cosmology , 2008, 0803.4089.

[23]  Scott A. Sisson,et al.  Transdimensional Markov Chains , 2005 .

[24]  Philip C. Gregory,et al.  Bayesian Logical Data Analysis for the Physical Sciences: Acknowledgements , 2005 .

[25]  P. Gregory Bayesian Logical Data Analysis for the Physical Sciences: Multivariate Gaussian from maximum entropy , 2005 .

[26]  S. Godsill On the Relationship Between Markov chain Monte Carlo Methods for Model Uncertainty , 2001 .

[27]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .