Conditional vs marginal estimation of the predictive loss of hierarchical models using WAIC and cross-validation

The predictive loss of Bayesian models can be estimated using a sample from the full-data posterior by evaluating the Watanabe-Akaike information criterion (WAIC) or using an importance sampling (ISCVL) approximation to leave-one-out cross-validation loss. With hierarchical models the loss can be specified at different levels of the hierarchy, and in the published literature, it is routine for these estimators to use the conditional likelihood provided by the lowest level of model hierarchy. However, the regularity conditions underlying these estimators may not hold at this level, and the behaviour of conditional-level WAIC as an estimator of conditional-level predictive loss must be determined on a case-by-case basis. Conditional-level ISCVL does not target conditional-level predictive loss and instead is an estimator of marginal-level predictive loss. Using examples for analysis of over-dispersed count data, it is shown that conditional-level WAIC does not provide a reliable estimator of its target loss, and simulations show that it can favour the incorrect model. Moreover, conditional-level ISCVL is numerically unstable compared to marginal-level ISCVL. It is recommended that WAIC and ISCVL be evaluated using the marginalized likelihood where practicable and that the reliability of these estimators always be checked using appropriate diagnostics.

[1]  N. Lazar,et al.  Methods and Criteria for Model Selection , 2004 .

[2]  Aki Vehtari,et al.  Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC , 2015, Statistics and Computing.

[3]  Sumio Watanabe,et al.  Equations of States in Singular Statistical Estimation , 2007, Neural Networks.

[4]  A. Gelfand,et al.  Bayesian Model Choice: Asymptotics and Exact Calculations , 1994 .

[5]  M. Yamasaki,et al.  Uncovering a Nuisance Influence of a Phenological Trait of Plants Using a Nonlinear Structural Equation: Application to Days to Heading and Culm Length in Asian Cultivated Rice (Oryza Sativa L.) , 2016, PloS one.

[6]  Sumio Watanabe Algebraic Analysis for Singular Statistical Estimation , 1999, ALT.

[7]  Sumio Watanabe,et al.  Equations of States in Statistical Learning for a Nonparametrizable and Regular Case , 2009, ArXiv.

[8]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .

[9]  R. Millar,et al.  Assessment of locally influential observations in Bayesian models , 2007 .

[10]  Ole Winther,et al.  Bayesian Leave-One-Out Cross-Validation Approximations for Gaussian Latent Variable Models , 2014, J. Mach. Learn. Res..

[11]  Donald L. Alderman,et al.  The Effects of Special Preparation on SAT-Verbal Scores , 1980 .

[12]  Sumio Watanabe Algebraic Geometry and Statistical Learning Theory , 2009 .

[13]  E. Ionides Truncated Importance Sampling , 2008 .

[14]  Marti J. Anderson,et al.  Spatial variation and effects of habitat on temperate reef fish assemblages in northeastern New Zealand , 2004 .

[15]  Sumio Watanabe,et al.  Asymptotic Equivalence of Bayes Cross Validation and Widely Applicable Information Criterion in Singular Learning Theory , 2010, J. Mach. Learn. Res..

[16]  Aki Vehtari,et al.  Understanding predictive information criteria for Bayesian models , 2013, Statistics and Computing.

[17]  Xiao-Li Meng,et al.  Simulating Normalizing Constants: From Importance Sampling to Bridge Sampling to Path Sampling , 1998 .

[18]  Sumio Watanabe,et al.  Asymptotic Learning Curve and Renormalizable Condition in Statistical Learning Theory , 2010, ArXiv.

[19]  R. Sukumar,et al.  How much Dillenia indica seed predation occurs from Asian elephant dung , 2016 .

[20]  John K. Kruschke,et al.  Decision-making in stimulant and opiate addicts in protracted abstinence: evidence from computational modeling with pure users , 2014, Front. Psychol..

[21]  Jouko Lampinen,et al.  Bayesian Model Assessment and Comparison Using Cross-Validation Predictive Densities , 2002, Neural Computation.

[22]  François Laviolette,et al.  Domain-Adversarial Training of Neural Networks , 2015, J. Mach. Learn. Res..

[23]  Shi Qiu,et al.  Approximating cross-validatory predictive evaluation in Bayesian latent variable models with integrated IS and WAIC , 2014, Stat. Comput..

[24]  Aki Vehtari,et al.  A survey of Bayesian predictive methods for model assessment, selection and comparison , 2012 .

[25]  Yang Li,et al.  A Bayesian Semiparametric Approach for Endogeneity and Heterogeneity in Choice Models , 2014, Manag. Sci..

[26]  P. Burman A comparative study of ordinary cross-validation, v-fold cross-validation and the repeated learning-testing methods , 1989 .

[27]  Hawthorne L. Beyer,et al.  The pitfalls of ignoring behaviour when quantifying habitat selection , 2014 .

[28]  S. Geisser,et al.  A Predictive Approach to Model Selection , 1979 .