Comparing hierarchical models for spatio-temporally misaligned data using the deviance information criterion.

Bayes and empirical Bayes methods have proven effective in smoothing crude maps of disease risk, eliminating the instability of estimates in low-population areas while maintaining overall geographic trends and patterns. Recent work extends these methods to the analysis of areal data which are spatially misaligned, that is, involving variables (typically counts or rates) which are aggregated over differing sets of regional boundaries. The addition of a temporal aspect complicates matters further, since now the misalignment can arise either within a given time point, or across time points (as when the regional boundaries themselves evolve over time). Hierarchical Bayesian methods (implemented via modern Markov chain Monte Carlo computing methods) enable the fitting of such models, but a formal comparison of their fit is hampered by their large size and often improper prior specifications. In this paper, we accomplish this comparison using the deviance information criterion (DIC), a recently proposed generalization of the Akaike information criterion (AIC) designed for complex hierarchical model settings like ours. We investigate the use of the delta method for obtaining an approximate variance estimate for DIC, in order to attach significance to apparent differences between models. We illustrate our approach using a spatially misaligned data set relating a measure of traffic density to paediatric asthma hospitalizations in San Diego County, California.

[1]  Bradley P. Carlin,et al.  Markov Chain Monte Carlo in Practice: A Roundtable Discussion , 1998 .

[2]  N. L. Johnson,et al.  Survival Models and Data Analysis , 1982 .

[3]  Purushottam W. Laud,et al.  Predictive Model Selection , 1995 .

[4]  Bradley P. Carlin,et al.  Hierarchical modeling of spatio-temporally misaligned data: relating traffic density to pediatric asthma hospitalizations , 2000 .

[5]  A. H. Versluis Methodology for predicting vehicle emissions on motorways and their impact on air quality in the Netherlands , 1994 .

[6]  Alan E. Gelfand,et al.  Model choice: A minimum posterior predictive loss approach , 1998, AISTATS.

[7]  Bradley P. Carlin,et al.  Hierarchical models for mapping Ohio lung cancer rates , 1997 .

[8]  Jianming Ye On Measuring and Correcting the Effects of Data Mining and Model Selection , 1998 .

[9]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[10]  Hong Chang,et al.  Model Determination Using Predictive Distributions with Implementation via Sampling-Based Methods , 1992 .

[11]  A. Coppalle,et al.  Dispersion and chemical reaction of a pollutant near a motorway , 1995 .

[12]  A. O'Hagan,et al.  Fractional Bayes factors for model comparison , 1995 .

[13]  L Bernardinelli,et al.  Bayesian estimates of disease maps: how important are priors? , 1995, Statistics in medicine.

[14]  D. Clayton,et al.  Empirical Bayes estimates of age-standardized relative risks for use in disease mapping. , 1987, Biometrics.

[15]  J. Hodges,et al.  Counting degrees of freedom in hierarchical and other richly-parameterised models , 2001 .

[16]  J. Besag,et al.  Bayesian image restoration, with two applications in spatial statistics , 1991 .

[17]  Bradley P. Carlin,et al.  Hierarchical Spatio-Temporal Mapping of Disease Rates , 1997 .