Bayesian semi-parametric ZIP models with space-time interactions: an application to cancer registry data.

We analyse lymphoid leukemia incidence data collected between 1988 and 2002 from the cancer registry of Haut-Rhin, a region in north-east France. For each patient, sex, area of residence, date of birth and date of diagnosis are available. Incidence summaries in the registry are grouped by 3-year periods. A disproportionately large frequency of zeros in the data leads to a lack of fit for Poisson models of relative risk. The aim of our analysis was to model the spatio-temporal variations of the disease taking into account some non-standard requirements, such as count data with many zeros and space-time interactions. For this purpose, we consider a flexible zero-inflated Poisson model for semi-parametric regression which incorporates space-time interactions (modelled by means of varying coefficient model) using an extension of the methodology proposed in Fahrmeir & Osuna (2006, Structured additive regression for overdispersed and zero-inflated count data. Stoc. Models Bus. Ind., 22, 351-369). Inference is carried out from a Bayesian perspective using Markov chain Monte Carlo methods by means of the BayesX software. Our analysis of the geographical distribution of the disease and its evolution in time may be considered as a starting point for further studies.

[1]  Murray Longmore,et al.  Oxford Handbook of Clinical Specialties , 1987, Annals of Internal Medicine.

[2]  L Bernardinelli,et al.  Bayesian estimates of disease maps: how important are priors? , 1995, Statistics in medicine.

[3]  J. Hinde,et al.  A Score Test for Testing a Zero‐Inflated Poisson Regression Model Against Zero‐Inflated Negative Binomial Alternatives , 2001, Biometrics.

[4]  Jye-Chyi Lu,et al.  Bayesian analysis of zero-inflated regression models , 2006 .

[5]  L. Fahrmeir,et al.  PENALIZED STRUCTURED ADDITIVE REGRESSION FOR SPACE-TIME DATA: A BAYESIAN PERSPECTIVE , 2004 .

[6]  Annibale Biggeri,et al.  Age–period–cohort models and disease mapping , 2003 .

[7]  Noel Cressie,et al.  Hierarchical statistical modelling of influenza epidemic dynamics in space and time , 2002, Statistics in medicine.

[8]  R. Tibshirani,et al.  Varying‐Coefficient Models , 1993 .

[9]  Jason A. Duan,et al.  Modeling Disease Incidence Data with Spatial and Spatio Temporal Dirichlet Process Mixtures , 2008, Biometrical journal. Biometrische Zeitschrift.

[10]  P Schlattmann,et al.  Space-time mixture modelling of public health data. , 2000, Statistics in medicine.

[11]  Alexandra M. Schmidt,et al.  Spatio‐temporal models for mapping the incidence of malaria in Pará , 2005 .

[12]  Sylvia Richardson,et al.  A comparison of Bayesian spatial models for disease mapping , 2005, Statistical methods in medical research.

[13]  S. Wood Low‐Rank Scale‐Invariant Tensor Product Smooths for Generalized Additive Mixed Models , 2006, Biometrics.

[14]  Annibale Biggeri,et al.  A hierarchical Bayesian model for space-time variation of disease risk , 2001 .

[15]  A. Linde DIC in variable selection , 2005 .

[16]  Andreas Brezger,et al.  Generalized structured additive regression based on Bayesian P-splines , 2006, Comput. Stat. Data Anal..

[17]  Bradley P. Carlin,et al.  Hierarchical Spatio-Temporal Mapping of Disease Rates , 1997 .

[18]  L. Fahrmeir,et al.  Structured additive regression for overdispersed and zero‐inflated count data , 2006 .

[19]  D. Clayton,et al.  Empirical Bayes estimates of age-standardized relative risks for use in disease mapping. , 1987, Biometrics.

[20]  Andy H. Lee,et al.  Multi-level zero-inflated Poisson regression modelling of correlated count data with excess zeros , 2006, Statistical methods in medical research.

[21]  L Knorr-Held,et al.  Modelling risk from a disease in time and space. , 1998, Statistics in medicine.

[22]  Peter Congdon,et al.  A Model Framework for Mortality and Health Data Classified by Age, Area, and Time , 2006, Biometrics.

[23]  Y. MacNab,et al.  Spatio‐temporal modelling of rates for the construction of disease maps , 2002, Statistics in medicine.

[24]  Dankmar Böhning,et al.  The zero‐inflated Poisson model and the decayed, missing and filled teeth index in dental epidemiology , 1999 .

[25]  Leonhard Held,et al.  Gaussian Markov Random Fields: Theory and Applications , 2005 .

[26]  Y. MacNab,et al.  Autoregressive Spatial Smoothing and Temporal Spline Smoothing for Mapping Rates , 2001, Biometrics.

[27]  M. S. Ridout,et al.  A Comparison of Confidence Interval Methods for Dilution Series Experiments , 1994 .

[28]  L. Fahrmeir,et al.  Bayesian inference for generalized additive mixed models based on Markov random field priors , 2001 .

[29]  M. I. Santiago-Pérez,et al.  Distribución geográfica y temporal del suicidio en Galicia (1976–1998) , 2001 .

[30]  M. Musio,et al.  Comparison of Three Convolution Prior Spatial Models for Cancer Incidence , 2007 .

[31]  D. Hall Zero‐Inflated Poisson and Binomial Regression with Random Effects: A Case Study , 2000, Biometrics.

[32]  Volker J Schmid,et al.  Bayesian Extrapolation of Space–Time Trends in Cancer Registry Data , 2004, Biometrics.

[33]  J. Besag,et al.  Bayesian image restoration, with two applications in spatial statistics , 1991 .

[34]  John Hinde,et al.  Models for count data with many zeros , 1998 .

[35]  C. Czado,et al.  Modelling count data with overdispersion and spatial effects , 2008 .

[36]  L Knorr-Held,et al.  Bayesian modelling of inseparable space-time variation in disease risk. , 2000, Statistics in medicine.