Multivariate Parametric Spatiotemporal Models for County Level Breast Cancer Survival Data

In clustered survival settings where the clusters correspond to geographic regions, biostatisticians are increasingly turning to models with spatially distributed random effects. These models begin with spatially oriented frailty terms, but may also include further region-level terms in the parametrization of the baseline hazards or various covariate effects (as in a spatially-varying coefficients model). In this paper, we propose a multivariate conditionally autoregressive (MCAR) model as a mixing distribution for these random effects, as a way of capturing correlation across both the regions and the elements of the random effect vector for any particular region. We then extend this model to permit analysis of temporal cohort effects, where we use the term “temporal cohort” to mean a group of subjects all of whom were diagnosed with the disease of interest (and thus, entered the study) during the same time period (say, calendar year). We show how our spatiotemporal model may be efficiently fit in a hierarchical Bayesian framework implemented using Markov chain Monte Carlo (MCMC) computational techniques. We illustrate our approach in the context of county-level breast cancer data from 22 annual cohorts of women living in the state of Iowa, as recorded by the Surveillance, Epidemiology, and End Results (SEER) database. Hierarchical model comparison using the Deviance Information Criterion (DIC), as well as maps of the fitted county-level effects, reveal the benefit of our approach.

[1]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[2]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[3]  Bradley P. Carlin,et al.  Hierarchical Spatio-Temporal Mapping of Disease Rates , 1997 .

[4]  A. Gelfand,et al.  Proper multivariate conditional autoregressive models for spatial data analysis. , 2003, Biostatistics.

[5]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[6]  B. Carlin,et al.  Hierarchical Proportional Hazards Regression Models for Highly Stratified Data , 1999, Biometrics.

[7]  J. Besag,et al.  Bayesian Computation and Stochastic Systems , 1995 .

[8]  D. Cox,et al.  Parameter Orthogonality and Approximate Conditional Inference , 1987 .

[9]  A. Gelfand,et al.  Efficient parametrisations for normal linear mixed models , 1995 .

[10]  K. Mardia Multi-dimensional multivariate Gaussian Markov random fields with application to image processing , 1988 .

[11]  B. Carlin,et al.  Spatial Semiparametric Proportional Hazards Models for Analyzing Infant Mortality Rates in Minnesota Counties , 2002 .

[12]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .

[13]  J. Haybittle,et al.  Death certification in cancer of the breast. , 1984, British medical journal.

[14]  Aki Vehtari Discussion to "Bayesian measures of model complexity and fit" by Spiegelhalter, D.J., Best, N.G., Carlin, B.P., and van der Linde, A. , 2002 .

[15]  Adrian F. M. Smith,et al.  Sampling-Based Approaches to Calculating Marginal Densities , 1990 .

[16]  J. E. H. Shaw,et al.  A parametric dynamic survival model applied to breast cancer survival times , 2002 .

[17]  Renato M. Assunção,et al.  Space varying coefficient models for small area data , 2003 .

[18]  B. Carlin,et al.  Semiparametric spatio‐temporal frailty modeling , 2003 .

[19]  Bradley P. Carlin,et al.  BAYES AND EMPIRICAL BAYES METHODS FOR DATA ANALYSIS , 1996, Stat. Comput..

[20]  A. Gelfand,et al.  Efficient parametrizations for generalized linear mixed models, (with discussion). , 1996 .

[21]  J. Grant,et al.  Intracranial infection due to mycobacterium bovis in Hodgkin's disease. , 1984, British medical journal.

[22]  Alan E. Gelfand,et al.  Model choice: A minimum posterior predictive loss approach , 1998, AISTATS.

[23]  B. Carlin,et al.  Frailty modeling for spatially correlated survival data, with application to infant mortality in Minnesota. , 2003, Biostatistics.

[24]  Bradley P. Carlin,et al.  BAYES AND EMPIRICAL BAYES METHODS FOR DATA ANALYSIS , 1996, Stat. Comput..

[25]  M. May Bayesian Survival Analysis. , 2002 .

[26]  M. Wall A close look at the spatial structure implied by the CAR and SAR models , 2004 .