Partitioning Degrees of Freedom in Hierarchical and Other Richly Parameterized Models

Hodges and Sargent (2001) have developed a measure of a hierarchical model’s complexity, degrees of freedom (DF), that is consistent with definitions for scatterplot smoothers, is interpretable in terms of simple models, and enables control of a fit’s complexity by means of a prior distribution on complexity. But although DF describes the complexity of the whole fitted model, in general it remains unclear how to allocate DF to individual effects. Here we present a new definition of DF for arbitrary normal-error linear hierarchical models, consistent with that of Hodges and Sargent, that naturally partitions the n observations into DF for individual effects and for error. The new conception of an effect’s DF is the ratio of the effect’s modeled variance matrix to the total variance matrix. This provides a way to describe the sizes of different parts of a model (e.g., spatial clustering vs. heterogeneity), to place DF-based priors on smoothing parameters, and to describe how a smoothed effect competes with other effects. It also avoids difficulties with the most common definition of DF for residuals. We conclude by comparing DF with the effective number of parameters, pD, of Spiegelhalter et al. (2002). Technical appendices and a data set are available online as supplemental materials.

[1]  Aki Vehtari Discussion to "Bayesian measures of model complexity and fit" by Spiegelhalter, D.J., Best, N.G., Carlin, B.P., and van der Linde, A. , 2002 .

[2]  B. Carlin,et al.  On the Precision of the Conditionally Autoregressive Prior in Spatial Models , 2003, Biometrics.

[3]  Raquel Prado,et al.  Multichannel electroencephalographic analyses via dynamic regression models with time‐varying lag–lead structure , 2001 .

[4]  B. Carlin,et al.  Spatial Analyses of Periodontal Data Using Conditionally Autoregressive Priors Having Two Classes of Neighbor Relations , 2007 .

[5]  K A Eaton,et al.  Unification of the "Burst" and "Linear" Theories of Periodontal Disease Progression: A Multilevel Manifestation of the Same Phenomenon , 2003, Journal of dental research.

[6]  R. D. Murphy,et al.  Iterative solution of nonlinear equations , 1994 .

[7]  Andrew Gelman,et al.  Bayesian Measures of Explained Variance and Pooling in Multilevel (Hierarchical) Models , 2006, Technometrics.

[8]  C. W. Groetsch,et al.  Inverse Problems in the Mathematical Sciences , 1993 .

[9]  M. Plummer Penalized loss functions for Bayesian model comparison. , 2008, Biostatistics.

[10]  B. Carlin,et al.  Measuring the complexity of generalized linear hierarchical models , 2007 .

[11]  M. Daniels A prior for the variance in hierarchical models , 1999 .

[12]  J. Besag,et al.  Bayesian image restoration, with two applications in spatial statistics , 1991 .

[13]  James M. Ortega,et al.  Iterative solution of nonlinear equations in several variables , 2014, Computer science and applied mathematics.

[14]  Malik Beshir Malik,et al.  Applied Linear Regression , 2005, Technometrics.

[15]  J. Hodges,et al.  Counting degrees of freedom in hierarchical and other richly-parameterised models , 2001 .

[16]  Christopher J Paciorek,et al.  Spatial modelling using a new class of nonstationary covariance functions , 2006, Environmetrics.

[17]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .

[18]  T. Hassard,et al.  Applied Linear Regression , 2005 .

[19]  Jianming Ye On Measuring and Correcting the Effects of Data Mining and Model Selection , 1998 .

[20]  Bradley P. Carlin,et al.  Smoothing Balanced Single-Error-Term Analysis of Variance , 2007, Technometrics.

[21]  Michael A. West,et al.  Bayesian Forecasting and Dynamic Models (2nd edn) , 1997, J. Oper. Res. Soc..

[22]  P. Gustafson,et al.  Conservative prior distributions for variance parameters in hierarchical models , 2006 .

[23]  V. Zadnik,et al.  Effects of Residual Smoothing on the Posterior of the Fixed Effects in Disease‐Mapping Models , 2006, Biometrics.

[24]  J. Hodges,et al.  Identification of the variance components in the general two-variance linear model , 2008 .

[25]  F. Vaida,et al.  Conditional Akaike information for mixed-effects models , 2005 .

[26]  James S Hodges,et al.  Modeling Longitudinal Spatial Periodontal Data: A Spatially Adaptive Model with Tools for Specifying Priors and Checking Fit , 2008, Biometrics.

[27]  R. Kass,et al.  Reference Bayesian Methods for Generalized Linear Mixed Models , 2000 .

[28]  D. Steinberg,et al.  Technometrics , 2008 .

[29]  B. Carlin,et al.  Identifiability and convergence issues for Markov chain Monte Carlo fitting of spatial models. , 2000, Statistics in medicine.

[30]  J. Hodges,et al.  Smoothing analysis of variance for general designs , 2009 .

[31]  A. Gelman Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper) , 2004 .

[32]  James R. Schott,et al.  Matrix Analysis for Statistics , 2005 .