Impacts of Positional Error on Spatial Regression Analysis: A Case Study of Address Locations in Syracuse, New York

Positional error is the error produced by the discrepancy between reference and recorded locations. In urban landscapes, locations typically are obtained from global positioning systems or geocoding software. Although these technologies have improved the locational accuracy of georeferenced data, they are not error free. This error affects results of any spatial statistical analysis performed with a georeferenced dataset. In this paper we discuss the properties of positional error in an address matching exercise and the allocation of point locations to census geography units. We focus on the error's spatial structure, and more particularly on impacts of error propagation in spatial regression analysis. For this purpose we use two geocoding sources, we briefly describe the magnitude and the nature of their discrepancies, and we evaluate the consequences that this type of locational error has on a spatial regression analysis of pediatric blood lead data for Syracuse, NY. Our findings include: (1) the confirmation of the recurrence of spatial clustering in positional error at various geographic resolutions; and, (2) the identification of a noticeable but not shockingly large impact from positional error propagation in spatial auto-binomial regression analysis results for the dataset analyzed.

[1]  Daniel A. Griffith,et al.  Spatial error propagation when computing linear combinations of spectral bands: The case of vegetation indices , 2003, Environmental and Ecological Statistics.

[2]  J. Scepan,et al.  Thematic validation of high-resolution Global Land-Cover Data sets , 1999 .

[3]  Wende A O'Neill,et al.  Location Translation Within a Geographic Information System , 1997 .

[4]  S V Subramanian,et al.  Painting a truer picture of US socioeconomic and racial/ethnic health inequalities: the Public Health Disparities Geocoding Project. , 2005, American journal of public health.

[5]  Gerard Rushton,et al.  Geocoding in cancer research: a review. , 2006, American journal of preventive medicine.

[6]  Daniel A. Griffith,et al.  Error Propagation Modeling in Raster GIS: Adding and Ratioing Operations , 1999 .

[7]  Daniel A. Griffith,et al.  Error Propagation Modelling in Raster GIS: Overlay Operations , 1998, Int. J. Geogr. Inf. Sci..

[8]  Daniel A. Griffith,et al.  A Spatial Filtering Specification for the Autologistic Model , 2004 .

[9]  Yee Leung,et al.  A Locational Error Model for Spatial Features , 1998, Int. J. Geogr. Inf. Sci..

[10]  Howard Veregin,et al.  Quantifying positional error induced by line simplification , 2000, Int. J. Geogr. Inf. Sci..

[11]  N Krieger,et al.  Re: "Use of census-based aggregate variables to proxy for socioeconomic group: evidence from national samples". , 1999, American journal of epidemiology.

[12]  Martin Kulldorff,et al.  Lumping or splitting: seeking the preferred areal unit for health geography studies , 2005, International journal of health geographics.

[13]  Frederick R. Broome,et al.  Partnering for the People: Improving the U.S. Census Bureau’s MAF/TIGER Database , 2003 .

[14]  N Krieger,et al.  Changing to the 2000 standard million: are declining racial/ethnic and socioeconomic inequalities in health real progress or statistical illusion? , 2001, American journal of public health.

[15]  Joanne S Colt,et al.  Positional Accuracy of Two Methods of Geocoding , 2005, Epidemiology.

[16]  J W Hogan,et al.  On the wrong side of the tracts? Evaluating the accuracy of geocoding in public health research. , 2001, American journal of public health.

[17]  J Bound,et al.  Use of census-based aggregate variables to proxy for socioeconomic group: evidence from national samples. , 1998, American journal of epidemiology.

[18]  Stephen V. Stehman,et al.  Design and Analysis for Thematic Map Accuracy Assessment: Fundamental Principles , 1998 .

[19]  Daniel A. Griffith,et al.  A Tale of Two Swaths: Urban Childhood Blood-Lead Levels across Syracuse, New York , 1998 .

[20]  Craig A. Knoblock,et al.  Exploiting online sources to accurately geocode addresses , 2004, GIS '04.

[21]  Jerry H. Ratcliffe,et al.  On the accuracy of TIGER-type geocoded address data in relation to cadastral and census areal units , 2001, Int. J. Geogr. Inf. Sci..

[22]  Jeremy C. Weiss,et al.  Comparing a single-stage geocoding method to a multi-stage geocoding method: how much and where do they disagree? , 2007, International Journal of Health Geographics.

[23]  N. Levine,et al.  The location of motor vehicle crashes in Honolulu: a methodology for geocoding intersections , 1998 .

[24]  Jing Nie,et al.  Positional Accuracy of Geocoded Addresses in Epidemiologic Research , 2003, Epidemiology.

[25]  M. Hansen,et al.  The DISCover validation image interpretation process , 1999 .

[26]  S V Subramanian,et al.  Zip code caveat: bias due to spatiotemporal mismatches between zip codes and US census-defined geographic areas--the Public Health Disparities Geocoding Project. , 2002, American journal of public health.