Hierarchical multivariate mixture generalized linear models for the analysis of spatial data: An application to disease mapping

Disease mapping of a single disease has been widely studied in the public health setup. Simultaneous modeling of related diseases can also be a valuable tool both from the epidemiological and from the statistical point of view. In particular, when we have several measurements recorded at each spatial location, we need to consider multivariate models in order to handle the dependence among the multivariate components as well as the spatial dependence between locations. It is then customary to use multivariate spatial models assuming the same distribution through the entire population density. However, in many circumstances, it is a very strong assumption to have the same distribution for all the areas of population density. To overcome this issue, we propose a hierarchical multivariate mixture generalized linear model to simultaneously analyze spatial Normal and non-Normal outcomes. As an application of our proposed approach, esophageal and lung cancer deaths in Minnesota are used to show the outperformance of assuming different distributions for different counties of Minnesota rather than assuming a single distribution for the population density. Performance of the proposed approach is also evaluated through a simulation study.

[1]  A. Gelfand,et al.  Proper multivariate conditional autoregressive models for spatial data analysis. , 2003, Biostatistics.

[2]  Mahmoud Torabi,et al.  Spatio‐temporal modelling of disease mapping of rates , 2010 .

[3]  Mahmoud Torabi,et al.  Hierarchical Bayesian Spatiotemporal Analysis of Childhood Cancer Trends , 2012 .

[4]  M. Stephens Dealing with label switching in mixture models , 2000 .

[5]  Nicola G. Best,et al.  A shared component model for detecting joint and selective clustering of two diseases , 2001 .

[6]  Alan E Gelfand,et al.  A multivariate spatial mixture model for areal data: examining regional differences in standardized test scores , 2014, Journal of the Royal Statistical Society. Series C, Applied statistics.

[7]  J LunnDavid,et al.  WinBUGS A Bayesian modelling framework , 2000 .

[8]  Hal S. Stern,et al.  Inference for extremes in disease mapping , 1999 .

[9]  Melanie M Wall,et al.  Generalized common spatial factor model. , 2003, Biostatistics.

[10]  Bradley P. Carlin,et al.  Hierarchical Multivarite CAR Models for Spatio-Temporally Correlated Survival Data , 2002 .

[11]  Mahmoud Torabi,et al.  Hierarchical Bayes estimation of spatial statistics for rates , 2012 .

[12]  Dongchu Sun,et al.  A Bivariate Bayes Method for Improving the Estimates of Mortality Rates With a Twofold Conditional Autoregressive Model , 2001 .

[13]  Mahmoud Torabi,et al.  Spatio-temporal modelling using B-spline for disease mapping: analysis of childhood cancer trends , 2011 .

[14]  Multivariate Lattice Models for Spatial Environmental Data , 2002 .

[15]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[16]  Andrew Thomas,et al.  WinBUGS - A Bayesian modelling framework: Concepts, structure, and extensibility , 2000, Stat. Comput..

[17]  P. Green,et al.  Hidden Markov Models and Disease Mapping , 2002 .

[18]  A. Jemal,et al.  Cancer Statistics, 2008 , 2008, CA: a cancer journal for clinicians.

[19]  Mahmoud Torabi Spatial generalized linear mixed models with multivariate CAR models for areal data , 2014 .

[20]  A. Gelman Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper) , 2004 .

[21]  A. Kottas,et al.  Bayesian mixture modeling for spatial Poisson process intensities, with applications to extreme value analysis , 2007 .

[22]  Malay Ghosh,et al.  Hierarchical Bayes GLMs for the analysis of spatial data: An application to disease mapping , 1999 .

[23]  P. Moran Notes on continuous stochastic phenomena. , 1950, Biometrika.

[24]  K. Mardia Multi-dimensional multivariate Gaussian Markov random fields with application to image processing , 1988 .

[25]  Noel A Cressie,et al.  Statistics for Spatial Data, Revised Edition. , 1994 .

[26]  Philip Heidelberger,et al.  Simulation Run Length Control in the Presence of an Initial Transient , 1983, Oper. Res..

[27]  Natural variability of benthic species composition in the Delaware Bay , 2004, Environmental and Ecological Statistics.

[28]  Charmaine B. Dean,et al.  Joint analysis of multivariate spatial count and zero‐heavy count outcomes using common spatial factor models , 2012 .

[29]  Bradley P Carlin,et al.  Generalized Hierarchical Multivariate CAR Models for Areal Data , 2005, Biometrics.

[30]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[31]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[32]  Andrew Gelman,et al.  General methods for monitoring convergence of iterative simulations , 1998 .

[33]  Mike West,et al.  Spatial Mixture Modelling for Unobserved Point Processes: Examples in Immunofluorescence Histology. , 2009, Bayesian analysis.

[34]  Luciano Nieddu,et al.  Finite Mixture Models for Mapping Spatially Dependent Disease Counts , 2009, Biometrical journal. Biometrische Zeitschrift.

[35]  N. Cressie,et al.  Spatial Modeling of Regional Variables , 1993 .

[36]  Brian J. Reich,et al.  A MULTIVARIATE SEMIPARAMETRIC BAYESIAN SPATIAL MODELING FRAMEWORK FOR HURRICANE SURFACE WIND FIELDS , 2007, 0709.0427.

[37]  George Cho,et al.  Spatial Processes: Models and Applications by A.D. Cliff and J.K. Ord. 16 by 24 em, 266 pages, maps, diags., index and bibliography. london: Pion Limited, 1981. (ISBN 08-85086-081-4). £20.50. , 1983 .

[38]  S. MacEachern,et al.  Bayesian Nonparametric Spatial Modeling With Dirichlet Process Mixing , 2005 .

[39]  Xuan Liu,et al.  Spatial latent class analysis model for spatially distributed multivariate binary data , 2009, Comput. Stat. Data Anal..

[40]  A. Lawson,et al.  Spatial mixture relative risk models applied to disease mapping , 2002, Statistics in medicine.