论文信息 - Efficient Anonymizations with Enhanced Utility

Efficient Anonymizations with Enhanced Utility

The k-anonymization method is a commonly used privacy-preserving technique. Previous studies used various measures of utility that aim at enhancing the correlation between the original public data and the generalized public data. We, bearing in mind that a primary goal in releasing the anonymized database for data mining is to deduce methods of predicting the private data from the public data, propose a new information-theoretic measure that aims at enhancing the correlation between the generalized public data and the private data. Such a measure significantly enhances the utility of the released anonymized database for data mining. We then proceed to describe a new and highly efficient algorithm that is designed to achieve $k$-anonymity with high utility. That algorithm is based on a modified version of sequential clustering which is the method of choice in clustering, and it is independent of the underlying measure of utility.

Tamir Tassa | Jacob Goldberger

[1] Raymond Chi-Wing Wong,et al. (α, k)-anonymity: an enhanced k-anonymity model for privacy preserving data publishing , 2006, KDD '06.

[2] Pierangela Samarati,et al. Generalizing Data to Provide Anonymity when Disclosing Information , 1998, PODS 1998.

[3] Adam Meyerson,et al. On the complexity of optimal K-anonymity , 2004, PODS.

[4] Rajeev Motwani,et al. Approximation Algorithms for k-Anonymity , 2005 .

[5] Samir Khuller,et al. Achieving anonymity via clustering , 2006, PODS '06.

[6] Roberto J. Bayardo,et al. Data privacy through optimal k-anonymization , 2005, 21st International Conference on Data Engineering (ICDE'05).

[7] Josep Domingo-Ferrer,et al. A Critique of k-Anonymity and Some of Its Enhancements , 2008, 2008 Third International Conference on Availability, Reliability and Security.

[8] ASHWIN MACHANAVAJJHALA,et al. L-diversity: privacy beyond k-anonymity , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[9] David J. DeWitt,et al. Incognito: efficient full-domain K-anonymity , 2005, SIGMOD '05.

[10] Elisa Bertino,et al. Efficient k -Anonymization Using Clustering Techniques , 2007, DASFAA.

[11] David J. DeWitt,et al. Mondrian Multidimensional K-Anonymity , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[12] Naftali Tishby,et al. Unsupervised document classification using sequential information maximization , 2002, SIGIR '02.

[13] Thomas M. Cover,et al. Elements of Information Theory , 2005 .

[14] Tamir Tassa,et al. k-Anonymization Revisited , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[15] Kyuseok Shim,et al. Approximate algorithms for K-anonymity , 2007, SIGMOD '07.

[16] Chris Clifton,et al. Thoughts on k-Anonymization , 2006, 22nd International Conference on Data Engineering Workshops (ICDEW'06).

[17] Ashwin Machanavajjhala,et al. l-Diversity: Privacy Beyond k-Anonymity , 2006, ICDE.

[18] Tamir Tassa,et al. A practical approximation algorithm for optimal k-anonymity , 2011, Data Mining and Knowledge Discovery.

[19] Vijay S. Iyengar,et al. Transforming data to satisfy privacy constraints , 2002, KDD.

[20] Yehuda Lindell,et al. Privacy Preserving Data Mining , 2002, Journal of Cryptology.

[21] Christos Faloutsos,et al. Analysis of the Clustering Properties of the Hilbert Space-Filling Curve , 2001, IEEE Trans. Knowl. Data Eng..

[22] Daniel Kifer,et al. Injecting utility into anonymized datasets , 2006, SIGMOD Conference.

[23] Tamir Tassa,et al. k-Anonymization with Minimal Loss of Information , 2009, IEEE Transactions on Knowledge and Data Engineering.

[24] Panos Kalnis,et al. A framework for efficient data anonymization under privacy and accuracy constraints , 2009, TODS.

[25] Yufei Tao,et al. Anatomy: simple and effective privacy preservation , 2006, VLDB.