Accounting for inaccuracies in population counts and case registration in cancer mapping studies

Disease mapping studies summarize spatial and spatiotemporal variations in disease risk. This information may be used for simple descriptive purposes, to assess whether health targets are being met or whether new policies are successful, to provide the context for further studies (by providing information on the form and size of the spatial variability in risk) or, by comparing the estiamted risk map with an exposure map, to obtain clues to aetiology. There are well-known problems with mapping raw risks and relative risks for rare diseases and/or small areas since sampling variability tends to dominate the subsequent maps. To alleviate these difficulties a multilevel modelling approach may be followed in which estimates based on small numbers are ‘shrunk’ towards a common value. In this paper we extend these models to investigate the effects of inaccuracies in the health and population data. In terms of the health data we consider the effects of errors that occur due to the imperfect collection procedures that are used by disease registers. For cancers in particular, this is a major problem, with case underascertainment (i.e. undercount) being the common type of error. The populations that are used for estimating disease risks have traditionally been treated as known quantities. In practice, however, these counts are often based on sources of data such as the census which are subject to error (in particular underenumeration) and are only available for census years. Intercensual population counts must consider not only the usual demographic changes (e.g. births and deaths) but migration also. We propose several approaches for modelling population counts and investigate the sensitivity of inference to the sizes of these errors. We illustrate the methods proposed using data for breast cancer in the Thames region of the UK and we compare our results with those obtained from more conventional approaches.

[1]  Linda W. Pickle,et al.  Atlas of United States Mortality , 1996 .

[2]  V. Carstairs,et al.  Deprivation and health in Scotland. , 1990, Health bulletin.

[3]  R. T. Lie,et al.  Birth Defects Registered by Double Sampling: A Bayesian Approach Incorporating Covariates and Model Uncertainty , 1995 .

[4]  J. Besag,et al.  Bayesian image restoration, with two applications in spatial statistics , 1991 .

[5]  Brian Jarman,et al.  Socio-economic confounding , 1996 .

[6]  Anthony J. Swerdlow,et al.  Cancer Registration in England and Wales: Some Aspects Relevant to Interpretation of the Data , 1986 .

[7]  N. G. Best,et al.  Disease mapping with errors in covariates. , 1997, Statistics in medicine.

[8]  L. Bernardinelli,et al.  Bayesian methods for mapping disease risk , 1996 .

[9]  M. Gulliford,et al.  The reliability of Cancer Registry records. , 1993, British Journal of Cancer.

[10]  D. Ruppert,et al.  Measurement Error in Nonlinear Models , 1995 .

[11]  Sylvia Richardson,et al.  Bayesian mapping of disease , 1995 .

[12]  N. E. Breslow Statistical Methods in Cancer Research , 1986 .

[13]  A. Swerdlow,et al.  Completeness of cancer and death follow-up obtained through the National Health Service Central Register for England and Wales. , 1992, British Journal of Cancer.

[14]  D. Clayton,et al.  Empirical Bayes estimates of age-standardized relative risks for use in disease mapping. , 1987, Biometrics.

[15]  S. Richardson,et al.  Statistical methods for geographical correlation studies , 1996 .

[16]  P Elliott,et al.  Issues in the statistical analysis of small area health data. , 1999, Statistics in medicine.