Can Census Offices Publish Statistics for More than One Small Area Geography? An Analysis of the differencing Problem in Statistical Disclosure
暂无分享,去创建一个
"The paper describes a problem faced by National Statistical Offices when publishing the results of decennial censuses for small geographical areas. If they publish statistical tables for two or more sets of areas, users can compare the tables and produce new statistics for the areas formed by differencing, which may have populations below confidentiality thresholds. To investigate the problem, the authors construct a software system and carry out a series of experiments using a large synthetic population base for Yorkshire and Humberside [in England]. The results indicate that publishing statistics for zones close in size to the primary areas is not safe unless the zones have been carefully designed. However, publishing statistics for sufficiently large areas such as 5km grid squares or postal sectors alongside enumeration districts is safe."
[1] D Martin. From enumeration districts to output areas: experiments in the automated creation of a census output geography. , 1997, Population trends.
[2] C. Marsh,et al. The sample of anonymised records. , 1991, ESRC Data Archive bulletin.
[3] Jonathan Raper,et al. Postcodes: the new geography , 1992 .
[4] Stan Openshaw,et al. Census users' handbook , 1995 .
[5] C. Marsh,et al. Samples of anonymised records from the 1991 census. , 1992 .