Enhancing utility via clustering in privacy preservation

When sensitive information needs to be released involved privacy preservation, which has attracted a widely range of research interests. K-anonymity is one of the most important anonymity models that have been Extensively research and many thinking have been presented to improve it. A way to make data anonymous is generalization. Cluster is a collection of data objects, which are similar to the objects in the same cluster, and different from the objects in other clusters. Both of these two methods have the same characteristics: assign the tuples to many small groups. Based on the above problems, we propose a clustering-based k-anonymity algorithm, which achieves k-anonymity through clustering.

[1]  Kyriakos Mouratidis,et al.  Preventing Location-Based Identity Inference in Anonymous Spatial Queries , 2007, IEEE Transactions on Knowledge and Data Engineering.

[2]  David J. DeWitt,et al.  Mondrian Multidimensional K-Anonymity , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[3]  Panos Kalnis,et al.  Fast Data Anonymization with Low Information Loss , 2007, VLDB.

[4]  Jian Pei,et al.  Utility-based anonymization using local recoding , 2006, KDD '06.

[5]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[6]  Nikos Mamoulis,et al.  Non-homogeneous generalization in privacy preserving data publishing , 2010, SIGMOD Conference.

[7]  Pierangela Samarati,et al.  Protecting Respondents' Identities in Microdata Release , 2001, IEEE Trans. Knowl. Data Eng..

[8]  Latanya Sweeney,et al.  k-Anonymity: A Model for Protecting Privacy , 2002, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[9]  D. DeWitt,et al.  K-Anonymization as Spatial Indexing: Toward Scalable and Incremental Anonymization , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[10]  David J. DeWitt,et al.  Incognito: efficient full-domain K-anonymity , 2005, SIGMOD '05.

[11]  Pierangela Samarati,et al.  Generalizing Data to Provide Anonymity when Disclosing Information , 1998, PODS 1998.