On the wrong side of the tracts? Evaluating the accuracy of geocoding in public health research.

OBJECTIVES This study sought to determine the accuracy of geocoding for public health databases. METHODS A test file of 70 addresses, 50 of which involved errors, was generated, and the file was geocoded to the census tract and block group levels by 4 commercial geocoding firms. Also, the "real world" accuracy of the best-performing firm was evaluated. RESULTS Accuracy rates in regard to geocoding of the test file ranged from 44% (95% confidence interval [CI] = 32%, 56%) to 84% (95% CI = 73%, 92%). The geocoding firm identified as having the best accuracy rate correctly geocoded 96% of the addresses obtained from the public health databases. CONCLUSIONS Public health studies involving geocoded databases should evaluate and report on methods used to verify accuracy.

[1]  B. Zebrowski,et al.  Lead hot zones and childhood lead poisoning cases, Santa Clara County, California, 1995. , 1999, Journal of public health management and practice : JPHMP.

[2]  M. Szklo,et al.  Neighbourhood differences in diet: the Atherosclerosis Risk in Communities (ARIC) Study. , 1999, Journal of epidemiology and community health.

[3]  Mei-Cheng Wang,et al.  Neighborhood risk factors for low birthweight in Baltimore: a multilevel analysis. , 1997, American journal of public health.

[4]  S. E. Thrall,et al.  Geographic information system (GIS) hardware and software. , 1999, Journal of public health management and practice : JPHMP.

[5]  T. Richards,et al.  Toward a GIS sampling frame for surveys of local health departments and local boards of health. , 1999, Journal of public health management and practice : JPHMP.

[6]  L. Daly,et al.  Simple SAS macros for the calculation of exact binomial and Poisson confidence limits. , 1992, Computers in biology and medicine.

[7]  T. Carpenter,et al.  Spatial analytical methods and geographic information systems: use in health research and epidemiology. , 1999, Epidemiologic reviews.

[8]  G. Thrall,et al.  The future of GIS in public health management and practice. , 1999, Journal of public health management and practice : JPHMP.

[9]  G Rushton Methods to evaluate geographic access to health services. , 1999, Journal of public health management and practice : JPHMP.

[10]  M. Fraser,et al.  Local health departments and GIS: the perspective of the National Association of County and City Health Officials. , 1999, Journal of public health management and practice : JPHMP.

[11]  N. Krieger,et al.  Can we monitor socioeconomic inequalities in health? A survey of U.S. health departments' data collection and reporting practices. , 1997, Public health reports.

[12]  M. Cetron,et al.  Geocoding and linking data from population-based surveillance and the US Census to evaluate the impact of median household income on the epidemiology of invasive Streptococcus pneumoniae infections. , 1998, American journal of epidemiology.

[13]  T. Richards,et al.  Geographic information systems and public health: mapping the future. , 1999, Public health reports.

[14]  S V Subramanian,et al.  Zip code caveat: bias due to spatiotemporal mismatches between zip codes and US census-defined geographic areas--the Public Health Disparities Geocoding Project. , 2002, American journal of public health.

[15]  N. Krieger Overcoming the absence of socioeconomic data in medical records: validation and application of a census-based methodology. , 1992, American journal of public health.

[16]  M F MacDorman,et al.  State initiatives in geocoding vital statistics data. , 1999, Journal of public health management and practice : JPHMP.