Residential address errors in public health surveillance data: a description and analysis of the impact on geocoding.

The residential addresses of persons with reportable communicable diseases are used increasingly for spatial monitoring and cluster detection, and public health may direct interventions based upon the results of routine spatial surveillance. There has been little assessment, however, of the quality of address data in reportable disease notifications and of the corresponding impact of these errors on geocoding and routine public health practices. The objectives of this study were to examine address errors for a selected reportable disease in a large urban center in Canada and to assess the impact of identified errors on geocoding and the estimated spatial distribution of the disease. We extracted data for all notifications of campylobacteriosis from the Montreal public health department from 1995 to 2008 and used an address verification algorithm to determine the validity of the residential address for each case and to suggest corrections for invalid addresses. We assessed the types of address errors as well as the resulting positional errors, calculating the distance between the original address and the correct address as well as changes in disease density. Address errors and missing addresses were prevalent in the public health records (10% and 5%, respectively) and they influenced the observed distribution of campylobacteriosis in Montreal, with address correction changing case location by a median of 1.1 km. Further examination of the extent of address errors in public health data is essential, as is the investigation of how these errors impact routine public health functions.

[1]  T L Chorba,et al.  Mandatory reporting of infectious diseases by clinicians. , 1989, JAMA.

[2]  S. Dearwent,et al.  Locational uncertainty in georeferencing public health datasets , 2001, Journal of Exposure Analysis and Environmental Epidemiology.

[3]  Duck-Hye Yang,et al.  Improving Geocoding Practices: Evaluation of Geocoding Tools , 2004, Journal of Medical Systems.

[4]  S. O'Brien,et al.  Campylobacters as zoonotic pathogens: a food production perspective. , 2007, International journal of food microbiology.

[5]  William A Ghali,et al.  Accuracy of city postal code coordinates as a proxy for location of residence , 2004, International journal of health geographics.

[6]  P. Diggle A Kernel Method for Smoothing Point Process Data , 1985 .

[7]  Gary Higgs,et al.  Positional accuracy and geographic bias of four methods of geocoding in epidemiologic research. , 2007, Annals of epidemiology.

[8]  Erik Bäck,et al.  Landscape Epidemiology of Tularemia Outbreaks in Sweden , 2009, Emerging infectious diseases.

[9]  Rob Deardon,et al.  Optimal reactive vaccination strategies for a foot-and-mouth outbreak in the UK , 2006, Nature.

[10]  S. Altekruse,et al.  Campylobacter jejuni--an emerging foodborne pathogen. , 1999, Emerging infectious diseases.

[11]  Geoffrey M Jacquez,et al.  Local indicators of geocoding accuracy (LIGA): theory and application , 2009, International journal of health geographics.

[12]  Jing Nie,et al.  Positional Accuracy of Geocoded Addresses in Epidemiologic Research , 2003, Epidemiology.

[13]  A. Correa,et al.  Quantifying geocode location error using GIS methods , 2007, Environmental health : a global access science source.

[14]  S V Subramanian,et al.  Zip code caveat: bias due to spatiotemporal mismatches between zip codes and US census-defined geographic areas--the Public Health Disparities Geocoding Project. , 2002, American journal of public health.

[15]  A. Rigby,et al.  Errors in postcode to enumeration district mapping and their effect on small area analyses of health data. , 1998, Journal of public health medicine.

[16]  L. Pickle,et al.  Geographic bias related to geocoding in epidemiologic studies , 2005, International journal of health geographics.

[17]  Hiroshi Suzuki,et al.  Impact of drainage networks on cholera outbreaks in Lusaka, Zambia. , 2009, American journal of public health.

[18]  Adrian Baddeley,et al.  spatstat: An R Package for Analyzing Spatial Point Patterns , 2005 .

[19]  Marius Gilbert,et al.  Anthropogenic factors and the risk of highly pathogenic avian influenza H5N1: prospects from a spatial-based model , 2009, Veterinary research.

[20]  Olaf Berke,et al.  Predicting geographical human risk of West Nile virus--Saskatchewan, 2003 and 2007. , 2009, Canadian journal of public health = Revue canadienne de sante publique.

[21]  Gerard Rushton,et al.  Analyzing Geographic Patterns of Disease Incidence: Rates of Late-Stage Colorectal Cancer in Iowa , 2004, Journal of Medical Systems.

[22]  Michael Jerrett,et al.  Conceptual and practical issues in the detection of local disease clusters: a study of mortality in Hamilton, Ontario , 2002 .

[23]  J W Hogan,et al.  On the wrong side of the tracts? Evaluating the accuracy of geocoding in public health research. , 2001, American journal of public health.

[24]  Aamir Fazil,et al.  Estimating the under-reporting rate for infectious gastrointestinal illness in Ontario. , 2005, Canadian journal of public health = Revue canadienne de sante publique.