论文信息 - Estimating the re-identification risk per record in microdata

Estimating the re-identification risk per record in microdata

A measure of re-identification risk at the record level has a variety of potential uses in statistical disclosure control for microdata. The conceptual basis of such a measure is considered. The risk is conceived of broadly as the evidence in support of a link between the record and the unit in the population from which it is derived. For discrete key variables subject to no measurement error, a measure is derived which reflects the probability that the record is unique in the population. Under certain assumptions, two approaches are described for estimating this measure from the microdata. These approaches are applied to a 10% sample of microdata from the 1991 Census in Great Britain. It is found that the resulting risk measures can indeed be used successfully to establish whether sample unique records are unique in the population. The implications of these findings are discussed.

Chris J. Skinner | C. Skinner | D. Holmes | D. J. Holmes

[1] C. J. Skinner,et al. Modelling population uniqueness , 1993 .

[2] G. Paass. Disclosure Risk and Disclosure Avoidance for Microdata , 1988 .

[3] W. Keller,et al. Disclosure control of microdata , 1990 .

[4] D. Lambert. Measures of Disclosure Risks and Harm , 1993 .

[5] C. Skinner,et al. Disclosure control for census microdata , 1994 .

[6] C. Skinner,et al. Safe data versus safe setting: access to microdata from the British Census , 1994 .

[7] Alan Agresti,et al. Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[8] Ton de Waal,et al. Statistical Disclosure Control in Practice , 1996 .

[9] S. Keller-McNulty,et al. Estimation of Identi ® cation Disclosure Risk in Microdata , 1999 .

[10] Uwe Blien,et al. Disclosure risk for microdata stemming from official statistics , 1992 .

[11] S. Fienberg,et al. Con ® dentiality , Uniqueness , and Disclosure Limitation for Categorical Data 1 , 1999 .

[12] D. Lambert,et al. The Risk of Disclosure for Microdata , 1989 .