Classification method for disease risk mapping based on discrete hidden Markov random fields.

Risk mapping in epidemiology enables areas with a low or high risk of disease contamination to be localized and provides a measure of risk differences between these regions. Risk mapping models for pooled data currently used by epidemiologists focus on the estimated risk for each geographical unit. They are based on a Poisson log-linear mixed model with a latent intrinsic continuous hidden Markov random field (HMRF) generally corresponding to a Gaussian autoregressive spatial smoothing. Risk classification, which is necessary to draw clearly delimited risk zones (in which protection measures may be applied), generally must be performed separately. We propose a method for direct classified risk mapping based on a Poisson log-linear mixed model with a latent discrete HMRF. The discrete hidden field (HF) corresponds to the assignment of each spatial unit to a risk class. The risk values attached to the classes are parameters and are estimated. When mapping risk using HMRFs, the conditional distribution of the observed field is modeled with a Poisson rather than a Gaussian distribution as in image segmentation. Moreover, abrupt changes in risk levels are rare in disease maps. The spatial hidden model should favor smoothed out risks, but conventional discrete Markov random fields (e.g. the Potts model) do not impose this. We therefore propose new potential functions for the HF that take into account class ordering. We use a Monte Carlo version of the expectation-maximization algorithm to estimate parameters and determine risk classes. We illustrate the method's behavior on simulated and real data sets. Our method appears particularly well adapted to localize high-risk regions and estimate the corresponding risk levels.

[1]  Leonhard Knorr-Held,et al.  Disease Mapping of Stage‐Specific Cancer Incidence Data , 2002, Biometrics.

[2]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[3]  L Knorr-Held,et al.  Bayesian Detection of Clusters and Discontinuities in Disease Maps , 2000, Biometrics.

[4]  G. C. Wei,et al.  A Monte Carlo Implementation of the EM Algorithm and the Poor Man's Data Augmentation Algorithms , 1990 .

[5]  J. Besag,et al.  Bayesian image restoration, with two applications in spatial statistics , 1991 .

[6]  Ying C MacNab,et al.  On Gaussian Markov random fields and Bayesian disease mapping , 2011, Statistical methods in medical research.

[7]  Adrian E. Raftery,et al.  Bayesian Regularization for Normal Mixture Estimation and Model-Based Clustering , 2007, J. Classif..

[8]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[9]  Sylvia Richardson,et al.  A hierarchical model for space–time surveillance data on meningococcal disease incidence , 2003 .

[10]  J. Besag On the Statistical Analysis of Dirty Pictures , 1986 .

[11]  P. Green,et al.  Hidden Markov Models and Disease Mapping , 2002 .

[12]  F. Y. Wu The Potts model , 1982 .

[13]  Christian Ducrot,et al.  Poultry, pig and the risk of BSE following the feed ban in France--a spatial analysis. , 2005, Veterinary research.

[14]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[15]  E. Lesaffre,et al.  Disease mapping and risk assessment for public health. , 1999 .

[16]  A. Lawson,et al.  Review of methods for space–time disease surveillance , 2010, Spatial and Spatio-temporal Epidemiology.

[17]  Stan Z. Li,et al.  Markov Random Field Modeling in Image Analysis , 2001, Computer Science Workbench.

[18]  Luciano Nieddu,et al.  Finite Mixture Models for Mapping Spatially Dependent Disease Counts , 2009, Biometrical journal. Biometrische Zeitschrift.

[19]  C Pascutto,et al.  Statistical issues in the analysis of disease mapping data. , 2000, Statistics in medicine.