Spatial risk mapping for rare disease with hidden Markov fields and variational EM

Current risk mapping models for pooled data focus on the estimated risk for each geographical unit. A risk classification, that is, grouping of geographical units with similar risk, is then necessary to easily draw interpretable maps, with clearly delimited zones in which protection measures can be applied. As an illustration, we focus on the Bovine Spongiform Encephalopathy (BSE) disease that threatened the bovine production in Europe and generated drastic cow culling. This example features typical animal disease risk analysis issues with very low risk values, small numbers of observed cases and population sizes that increase the difficulty of an automatic classification.We propose to handle this task in a spatial clustering framework using a nonstandard discrete hidden Markov model prior designed to favor a smooth risk variation. The model parameters are estimated using an EM algorithm and a mean field approximation for which we develop a new initialization strategy appropriate for spatial Poisson mixtures. Using both simulated and our BSE data, we show that our strategy performs well in dealing with low population sizes and accurately determines high risk regions, both in terms of localization and risk level estimation

[1]  P Schlattmann,et al.  Mixture models and disease mapping. , 1993, Statistics in medicine.

[2]  Dimitris Karlis,et al.  Choosing Initial Values for the EM Algorithm for Finite Mixtures , 2003, Comput. Stat. Data Anal..

[3]  P Schlattmann,et al.  Space-time mixture modelling of public health data. , 2000, Statistics in medicine.

[4]  Christophe Biernacki,et al.  Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models , 2003, Comput. Stat. Data Anal..

[5]  Luciano Nieddu,et al.  Finite Mixture Models for Mapping Spatially Dependent Disease Counts , 2009, Biometrical journal. Biometrische Zeitschrift.

[6]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  C Pascutto,et al.  Statistical issues in the analysis of disease mapping data. , 2000, Statistics in medicine.

[8]  A. Lawson,et al.  Review of methods for space–time disease surveillance , 2010, Spatial and Spatio-temporal Epidemiology.

[9]  Sylvia Richardson,et al.  Bayesian mapping of disease , 1995 .

[10]  A. Molli'e Bayesian mapping of disease , 1996 .

[11]  Christian Ducrot,et al.  Poultry, pig and the risk of BSE following the feed ban in France--a spatial analysis. , 2005, Veterinary research.

[12]  M. Kulldorff,et al.  An elliptic spatial scan statistic , 2006, Statistics in medicine.

[13]  M. Kulldorff A spatial scan statistic , 1997 .

[14]  M. Vignes,et al.  Gene Clustering via Integrated Markov Models Combining Individual and Pairwise Features , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[15]  Leonhard Knorr-Held,et al.  Disease Mapping of Stage‐Specific Cancer Incidence Data , 2002, Biometrics.

[16]  C. Muirhead,et al.  Spatial variation of natural radiation and childhood leukaemia incidence in Great Britain. , 1995, Statistics in medicine.

[17]  W. Qian,et al.  Estimation of parameters in hidden Markov models , 1991, Philosophical Transactions of the Royal Society of London. Series A: Physical and Engineering Sciences.

[18]  Christian Ducrot,et al.  Bovine spongiform encephalopathy and spatial analysis of the feed industry. , 2007, Emerging infectious diseases.

[19]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[20]  Andrew B Lawson,et al.  Bayesian hierarchical modeling of the dynamics of spatio-temporal influenza season outbreaks. , 2010, Spatial and spatio-temporal epidemiology.

[21]  Andrew B. Lawson,et al.  Space-time Bayesian small area disease risk models: development and evaluation with a focus on cluster detection , 2010, Environmental and Ecological Statistics.

[22]  C. Ducrot,et al.  Spatial heterogeneity of the risk of BSE in France following the ban of meat and bone meal in cattle feed. , 2005, Preventive veterinary medicine.

[23]  P. Green,et al.  Modelling spatially correlated data via mixtures: a Bayesian approach , 2002 .

[24]  P. Green,et al.  Hidden Markov Models and Disease Mapping , 2002 .

[25]  J. Besag,et al.  Bayesian image restoration, with two applications in spatial statistics , 1991 .

[26]  Florence Forbes,et al.  Hidden Markov Random Field Model Selection Criteria Based on Mean Field-Like Approximations , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[28]  Gilles Celeux,et al.  EM procedures using mean field-like approximations for Markov model-based image segmentation , 2003, Pattern Recognit..

[29]  A López-Quílez,et al.  Spatial analysis of bovine spongiform encephalopathy in Galicia, Spain (2000-2005). , 2007, Preventive veterinary medicine.

[30]  Adrian E. Raftery,et al.  Bayesian Regularization for Normal Mixture Estimation and Model-Based Clustering , 2007, J. Classif..

[31]  E. Lesaffre,et al.  Disease mapping and risk assessment for public health. , 1999 .

[32]  L Knorr-Held,et al.  Bayesian Detection of Clusters and Discontinuities in Disease Maps , 2000, Biometrics.

[33]  Sylvia Richardson,et al.  A hierarchical model for space–time surveillance data on meningococcal disease incidence , 2003 .

[34]  Fabio Divino,et al.  Disease mapping models: an empirical evaluation , 2000 .

[35]  P Schlattmann,et al.  Disease mapping models: an empirical evaluation. Disease Mapping Collaborative Group. , 2000, Statistics in medicine.

[36]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[37]  J. Besag On the Statistical Analysis of Dirty Pictures , 1986 .

[38]  L. Bernardinelli,et al.  Bayesian methods for mapping disease risk , 1996 .

[39]  Christian Ducrot,et al.  Review on the epidemiology and dynamics of BSE epidemics. , 2008, Veterinary research.

[40]  Ying C MacNab,et al.  On Gaussian Markov random fields and Bayesian disease mapping , 2011, Statistical methods in medical research.

[41]  Christophe Biernacki,et al.  Initializing EM using the properties of its trajectories in Gaussian mixtures , 2004, Stat. Comput..

[42]  A. Mollié,et al.  Empirical Bayes estimates of cancer mortality rates using spatial models. , 1991, Statistics in medicine.