High Performance Datafly based Anonymity Algorithm and Its L-Diversity

Data anonymity, as an effective privacy protection method, has been widely used in real applications. High performance data anonymity algorithm is especially attractive for those massive data applications. In this paper, the authors propose a novel and efficient Datafly based data anonymity Divide-Datafly algorithm and the experimental results show that the proposed algorithm is not only more efficient than Datafly and Incognito, but also information loss less than KACA. Moreover, in order to improve the security of anonymous data, L-Divide-Datafly is presented that it combines Divide-Datafly and efficient distance based clustering. Experimental results show that L-Divide-Datafly achieves great performance both in execution time and Information loss.

[1]  Abdel-Badeeh M. Salem,et al.  Cryptographic Cloud Computing Environment as a More Trusted Communication Environment , 2014, Int. J. Grid High Perform. Comput..

[2]  Jian Pei,et al.  Utility-based anonymization using local recoding , 2006, KDD '06.

[3]  Latanya Sweeney,et al.  Achieving k-Anonymity Privacy Protection Using Generalization and Suppression , 2002, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[4]  Qian Wang,et al.  (a, d)-Diversity: Privacy Protection Based on l-Diversity , 2009, 2009 WRI World Congress on Software Engineering.

[5]  Philip S. Yu,et al.  Bottom-up generalization: a data mining solution to privacy protection , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[6]  Traian Marius Truta,et al.  Protection : p-Sensitive k-Anonymity Property , 2006 .

[7]  Juan Yu,et al.  TopDown-KACA: An efficient local-recoding algorithm for k-anonymity , 2009, 2009 IEEE International Conference on Granular Computing.

[8]  David J. DeWitt,et al.  Incognito: efficient full-domain K-anonymity , 2005, SIGMOD '05.

[9]  Philip S. Yu,et al.  Top-down specialization for information and privacy preservation , 2005, 21st International Conference on Data Engineering (ICDE'05).

[10]  Jun Yang,et al.  A new anonymity model for privacy-preserving data publishing , 2014 .

[11]  Latanya Sweeney,et al.  k-Anonymity: A Model for Protecting Privacy , 2002, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[12]  ASHWIN MACHANAVAJJHALA,et al.  L-diversity: privacy beyond k-anonymity , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[13]  Raymond Chi-Wing Wong,et al.  Achieving k-Anonymity by Clustering in Attribute Hierarchical Structures , 2006, DaWaK.

[14]  Raymond Chi-Wing Wong,et al.  (α, k)-anonymity: an enhanced k-anonymity model for privacy preserving data publishing , 2006, KDD '06.

[15]  Pierangela Samarati,et al.  Generalizing Data to Provide Anonymity when Disclosing Information , 1998, PODS 1998.