Error propagation in spatial modeling of public health data: a simulation approach using pediatric blood lead level data for Syracuse, New York

Lead poisoning produces serious health problems, which are worse when a victim is younger. The US government and society have tried to prevent lead poisoning, especially since the 1970s; however, lead exposure remains prevalent. Lead poisoning analyses frequently use georeferenced blood lead level data. Like other types of data, these spatial data may contain uncertainties, such as location and attribute measurement errors, which can propagate to analysis results. For this paper, simulation experiments are employed to investigate how selected uncertainties impact regression analyses of blood lead level data in Syracuse, New York. In these simulations, location error and attribute measurement error, as well as a combination of these two errors, are embedded into the original data, and then these data are aggregated into census block group and census tract polygons. These aggregated data are analyzed with regression techniques, and comparisons are reported between the regression coefficients and their standard errors for the error added simulation results and the original results. To account for spatial autocorrelation, the eigenvector spatial filtering method and spatial autoregressive specifications are utilized with linear and generalized linear models. Our findings confirm that location error has more of an impact on the differences than does attribute measurement error, and show that the combined error leads to the greatest deviations. Location error simulation results show that smaller administrative units experience more of a location error impact, and, interestingly, coefficients and standard errors deviate more from their true values for a variable with a low level of spatial autocorrelation. These results imply that uncertainty, especially location error, has a considerable impact on the reliability of spatial analysis results for public health data, and that the level of spatial autocorrelation in a variable also has an impact on modeling results.

[1]  R. Byers,et al.  LATE EFFECTS OF LEAD POISONING ON MENTAL DEVELOPMENT , 1944 .

[2]  Gerard B. M. Heuvelink,et al.  A probabilistic framework for representing and simulating uncertain environmental variables , 2007, Int. J. Geogr. Inf. Sci..

[3]  P. Auinger,et al.  Cognitive deficits associated with blood lead concentrations <10 microg/dL in US children and adolescents. , 2000, Public health reports.

[4]  A. Comber,et al.  Approaches to Uncertainty in Spatial Data , 2010 .

[5]  Daniel A. Griffith,et al.  A Tale of Two Swaths: Urban Childhood Blood-Lead Levels across Syracuse, New York , 1998 .

[6]  Thomas O Talbot,et al.  Positional error in automated geocoding of residential addresses , 2003, International journal of health geographics.

[7]  Y. B. Wah,et al.  Power comparisons of Shapiro-Wilk , Kolmogorov-Smirnov , Lilliefors and Anderson-Darling tests , 2011 .

[8]  Daniel A. Griffith,et al.  Impacts of Positional Error on Spatial Regression Analysis: A Case Study of Address Locations in Syracuse, New York , 2007, Trans. GIS.

[9]  S C Darby,et al.  Some aspects of measurement error in explanatory variables for continuous and binary regression models. , 1998, Statistics in medicine.

[10]  Daniel A. Griffith,et al.  Semiparametric Filtering of Spatial Autocorrelation: The Eigenvector Approach , 2007 .

[11]  S. Dearwent,et al.  Locational uncertainty in georeferencing public health datasets , 2001, Journal of Exposure Analysis and Environmental Epidemiology.

[12]  J. Annest,et al.  National estimates of blood lead levels: United States, 1976-1980: association with selected demographic and socioeconomic factors. , 1982, The New England journal of medicine.

[13]  Daniel A. Griffith,et al.  A quality assessment of eigenvector spatial filtering based parameter estimates for the normal probability model , 2014 .

[14]  Wenzhong Shi,et al.  A stochastic process-based model for the positional error of line segments in GIS , 2000, Int. J. Geogr. Inf. Sci..

[15]  Evaluating Eigenvector Spatial Filter Corrections for Omitted Georeferenced Variables , 2016 .

[16]  D. Griffith,et al.  The geographic distribution of metals in urban soils: the case of Syracuse, NY , 2009 .

[17]  S. McLafferty,et al.  GIS and Public Health , 2002 .

[18]  Gerard Rushton,et al.  Accuracy of residential geocoding in the Agricultural Health Study , 2014, International Journal of Health Geographics.

[19]  David W. S. Wong The Modifiable Areal Unit Problem (MAUP) , 2004 .

[20]  D. Griffith,et al.  Spatial Data Analysis Uncertainties Introduced by Selected Sources of Error , 2017 .

[21]  Jane Elith,et al.  Error and uncertainty in habitat models , 2006 .

[22]  Daniel A. Griffith,et al.  Uncertainty-Related Research Issues in Spatial Analysis , 2015 .

[23]  J. Schneider,et al.  Lead neurotoxicity in children: basic mechanisms and clinical correlates. , 2003, Brain : a journal of neurology.

[24]  Daniel A. Griffith,et al.  Exploring Relationships Between the Global and Regional Measures of Spatial Autocorrelation , 2003 .

[25]  Vincent B. Robinson,et al.  ABOUT DIFFERENT KINDS OF UNCERTAINTY IN COLLECTIONS OF SPATIAL DATA , 2008 .

[26]  R. Prentice,et al.  Measurement error and results from analytic epidemiology: dietary fat and breast cancer. , 1996, Journal of the National Cancer Institute.

[27]  W. Wheeler,et al.  Blood Lead Levels in Children Aged 1–5 Years — United States, 1999–2010 , 2013, MMWR. Morbidity and mortality weekly report.

[28]  Amy M. Kephart,et al.  Assessment of blood lead level declines in an area of historical mining with a holistic remediation and abatement program. , 2016, Environmental research.

[29]  Gerard B.M. Heuvelink,et al.  Uncertainty analysis in environmental modelling under a change of spatial scale , 1998, Nutrient Cycling in Agroecosystems.

[30]  P A Zandbergen,et al.  Error propagation models to examine the effects of geocoding quality on spatial analysis of individual-level datasets. , 2012, Spatial and spatio-temporal epidemiology.

[31]  Paul A. Zandbergen,et al.  A comparison of address point, parcel and street geocoding techniques , 2008, Comput. Environ. Urban Syst..

[32]  Daniel A. Griffith,et al.  Error Propagation Modelling in Raster GIS: Overlay Operations , 1998, Int. J. Geogr. Inf. Sci..

[33]  S. O’Brien,et al.  Evaluation and Integration of Genetic Signature for Prediction Risk of Nasopharyngeal Carcinoma in Southern China , 2014, BioMed research international.

[34]  Daniel W. Goldberg,et al.  Improving Geocode Accuracy with Candidate Selection Criteria , 2010 .

[35]  Jun Wu,et al.  Improving Spatial Accuracy of Roadway Networks and Geocoded Addresses , 2005, Trans. GIS.

[36]  Lisa H. Mason,et al.  Pb Neurotoxicity: Neuropsychological Effects of Lead Toxicity , 2014, BioMed research international.

[37]  Dale L Zimmerman,et al.  The effects of local street network characteristics on the positional accuracy of automated geocoding for geographic health studies , 2010, International journal of health geographics.

[38]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[39]  J. Seward,et al.  Prevention of varicella: recommendations of the Advisory Committee on Immunization Practices (ACIP). , 2007, MMWR. Recommendations and reports : Morbidity and mortality weekly report. Recommendations and reports.

[40]  M. Yassin,et al.  Blood Lead Level in Relation to Awareness and Self Reported Symptoms among Gasoline Station Workers in the Gaza Strip , 2014 .

[41]  D. Griffith Spatial Autocorrelation and Spatial Filtering , 2003 .

[42]  J. Gerberding,et al.  Interpreting and managing blood lead levels < 10 microg/dL in children and reducing childhood exposures to lead: recommendations of CDC's Advisory Committee on Childhood Lead Poisoning Prevention. , 2007, MMWR. Recommendations and reports : Morbidity and mortality weekly report. Recommendations and reports.

[43]  Craig A. Knoblock,et al.  An effective and efficient approach for manually improving geocoded data. , 2008, International journal of health geographics.

[44]  D. Qin,et al.  How Credible Are Shrinking Wage Elasticities of Married Women Labour Supply , 2015 .

[45]  Models of uncertainty in spatial data , 2022 .

[46]  R. Canfield,et al.  Intellectual Impairment in Children with Blood Lead Concentrations below 10 μg per Deciliter , 2003 .

[47]  J. Yoon,et al.  The association between blood lead level and clinical mental disorders in fifty thousand lead-exposed male workers. , 2016, Journal of affective disorders.

[48]  Maged N Kamel Boulos,et al.  The use of interactive graphical maps for browsing medical/health Internet information resources , 2003, International journal of health geographics.

[49]  Daniel A. Griffith,et al.  Error Propagation Modeling in Raster GIS: Adding and Ratioing Operations , 1999 .

[50]  J. Lin-Fu Undue absorption of lead among children--a new look at an old problem. , 1972, The New England journal of medicine.