Geographic displacement procedure and georeferenced data release policy for the Demographic and Health Surveys.

Georeferencing population-based surveys such as the Demographic and Health Surveys (DHS) have many benefits. Most important researchers can analyze respondent locations spatially to identify geographical patterns associated with specific demographic and health outcomes and programs. Second the proximity of survey communities to geographic locations such as health centers roads and cities can serve as a proxy for access to services; and third data from sampled locations can be aggregated to form new units of analysis such as climatic zones or program intervention areas rather than being constrained to administrative units. However while it is important to make available to researchers analysts and policymakers the georeferenced data from population-based surveys it is also important to maintain the confidentiality of survey respondents. This report describes the geographic displacement procedures and georeferenced data release policy developed by the DHS project to protect the identity of survey respondents. The georeferenced data release policy applies specifically to the release of georeferenced data from DHS household surveys. It aims to balance the need to protect respondent confidentiality with the need to make available to the public analytically useful data. The policy incorporates two levels of protection: first data from the same enumeration area (EA) are aggregated to a single point coordinate; then the coordinate is geomasked through use of the Global Positioning System (GPS) coordinate displacement process. In DHS household surveys the GPS coordinate displacement process is carried out as follows: urban clusters are displaced a distance up to two kilometers (0-2 km) and rural clusters are displaced a distance up to five kilometers (0-5 km) with a further randomly-selected 1% (every 100th) of rural clusters displaced a distance up to 10 kilometers (0-10 km). (Excerpt)

[1]  Leah K VanWey,et al.  Confidentiality and spatially explicit data: Concerns and challenges , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[2]  William B Allshouse,et al.  Geomasking sensitive health data and privacy protection: an evaluation using an E911 database , 2010, Geocarto international.

[3]  Ronald E. McRoberts,et al.  Estimating and circumventing the effects of perturbing and swapping inventory plot locations , 2005 .

[4]  William B Allshouse,et al.  Practice of Epidemiology Mapping Health Data: Improved Privacy Protection With Donut Method Geomasking , 2010 .

[5]  John S Brownstein,et al.  An unsupervised classification method for inferring original case locations from low-resolution disease maps , 2006, International journal of health geographics.

[6]  J. Spencer,et al.  Geographically linking population and facility surveys: methodological considerations , 2013, Population Health Metrics.

[7]  A. Curtis,et al.  Spatial confidentiality and GIS: re-engineering mortality locations from published maps about Hurricane Katrina , 2006, International journal of health geographics.

[8]  G. Rushton,et al.  Geographically masking health data to preserve confidentiality. , 1999, Statistics in medicine.

[9]  Joshua L. Warren,et al.  Guidelines on the use of DHS GPS data , 2013 .

[10]  Clara R. Burgert,et al.  Response to "Problems of spatial linkage of a geo-referenced Demographic and Health Survey (DHS) dataset to a population census: A case study of Egypt" , 2012, Comput. Environ. Urban Syst..

[11]  Josep Domingo-Ferrer,et al.  Statistical Disclosure Control , 2012 .

[12]  John S Brownstein,et al.  No place to hide--reverse identification of patients from published maps. , 2006, The New England journal of medicine.

[13]  Marc P. Armstrong,et al.  Geographic Information Technologies and Personal Privacy , 2005, Cartogr. Int. J. Geogr. Inf. Geovisualization.