论文信息 - TopDown-KACA: An efficient local-recoding algorithm for k-anonymity

TopDown-KACA: An efficient local-recoding algorithm for k-anonymity

K-anonymity is an effective model for protecting privacy while publishing data. KACA algorithm is a typical generalization algorithm for k-anonymity, which can generate small information loss, but its efficiency is low, especially when dataset is large. Another generalization algorithm, topDown, has high efficiency but generates heavy information loss. In this paper, we propose an efficient generalization algorithm for k-anonymity, called topDown-KACA, which combines the topDown algorithm with the KACA algorithm. The idea of topDown-KACA algorithm is to partition the whole dataset into some subsets by topDown algorithm at first, and then k-anonymize these subsets by KACA algorithm respectively. Experiments show that the proposed algorithm is more efficient than KACA algorithm with similar information loss, and generates less information loss than topDown algorithm with similar execution time.

[1] Pierangela Samarati,et al. Generalizing Data to Provide Anonymity when Disclosing Information , 1998, PODS 1998.

[2] Pierangela Samarati,et al. Protecting Respondents' Identities in Microdata Release , 2001, IEEE Trans. Knowl. Data Eng..

[3] Latanya Sweeney,et al. k-Anonymity: A Model for Protecting Privacy , 2002, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[4] David J. DeWitt,et al. Incognito: efficient full-domain K-anonymity , 2005, SIGMOD '05.

[5] Raymond Chi-Wing Wong,et al. Achieving k-Anonymity by Clustering in Attribute Hierarchical Structures , 2006, DaWaK.

[6] Raymond Chi-Wing Wong,et al. (α, k)-anonymity: an enhanced k-anonymity model for privacy preserving data publishing , 2006, KDD '06.

[7] Raymond Chi-Wing Wong,et al. Anonymization by Local Recoding in Data with Attribute Hierarchical Taxonomies , 2008, IEEE Transactions on Knowledge and Data Engineering.

[8] Ninghui Li,et al. Towards optimal k-anonymization , 2008, Data Knowl. Eng..