论文信息 - The Posterior Predictive Null

The Posterior Predictive Null

. Bayesian model criticism is an important part of the practice of Bayesian statistics. Traditionally, model criticism methods have been based on the predictive check, an adaptation of goodness-of-ﬁt testing to Bayesian modeling and an eﬀective method to understand how well a model captures the distribution of the data. In modern practice, however, researchers iteratively build and develop many models, exploring a space of models to help solve the problem at hand. While classical predictive checks can help assess each one, they cannot help the researcher understand how the models relate to each other. This paper introduces the posterior predictive null check (PPN), a method for Bayesian model criticism that helps characterize the relationships between models. The idea behind the PPN is to check whether data from one model’s predictive distribution can pass a predictive check designed for another model. This form of criticism complements the classical predictive check by providing a comparative tool. A collection of PPNs, which we call a PPN study, can help us understand which models are equivalent and which models provide diﬀerent perspectives on the data. With mixture models, we demonstrate how a PPN study, along with traditional predictive checks, can help select the number of components by the principle of parsimony. With probabilistic factor models, we demonstrate how a PPN study can help understand relationships between diﬀerent classes of models, such as linear models and models based on neural networks. Finally, we analyze data from the literature on predictive checks to show how a PPN study can improve the practice of Bayesian model criticism. Code to replicate the results in this paper is available at https://github.com/gemoran/ppn-code .

Gemma E. Moran | J. Cunningham | D. Blei

[1] Aki Vehtari,et al. Bayesian Workflow. , 2020, 2011.01808.

[2] Cynthia Rudin,et al. A study in Rashomon curves and volumes: A new perspective on generalization and model simplicity in machine learning , 2019, ArXiv.

[3] Alexander M. Rush,et al. Avoiding Latent Variable Collapse With Generative Skip Models , 2018, AISTATS.

[4] David M. Blei,et al. Variational Inference: A Review for Statisticians , 2016, ArXiv.

[5] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[6] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[7] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[8] Noah D. Goodman,et al. Amortized Inference in Probabilistic Reasoning , 2014, CogSci.

[9] Cosma Rohilla Shalizi,et al. Philosophy and the practice of Bayesian statistics. , 2010, The British journal of mathematical and statistical psychology.

[10] Aki Vehtari,et al. A survey of Bayesian predictive methods for model assessment, selection and comparison , 2012 .

[11] D. J. Spiegelhalter,et al. Identifying outliers in Bayesian hierarchical models: a simulation-based approach , 2007 .