A differential privacy multidimensional data release model

As a new privacy notion, differential privacy, a hot topic in the privacy preserving of data release and mining, has grown in popularity recently. In order to protect the privacy of users, we need to find a differential privacy multidimensional data release schema. The paper gives a simple differential privacy multidimensional data release model based on the improved kd-Tree algorithm. Firstly, divide the data cube into several parts using the kd-Tree algorithm. In this step, the information entropy was introduced to select the partition dimension to improve the utility of kd-Tree algorithm. Secondly, add the noise information to the partitioned database with the Laplace mechanism. And then, it is provided that the model could ensure differential privacy while retaining the utility of the common queries.

[1]  Cynthia Dwork,et al.  Differential Privacy: A Survey of Results , 2008, TAMC.

[2]  Cynthia Dwork,et al.  Calibrating Noise to Sensitivity in Private Data Analysis , 2006, TCC.

[3]  Tim Roughgarden,et al.  Interactive privacy via the median mechanism , 2009, STOC '10.

[4]  Cynthia Dwork,et al.  Calibrating Noise to Sensitivity in Private Data Analysis , 2016, J. Priv. Confidentiality.

[5]  Daniel A. Spielman,et al.  Spectral Graph Theory and its Applications , 2007, 48th Annual IEEE Symposium on Foundations of Computer Science (FOCS'07).

[6]  Li Xiong,et al.  DPCube: Releasing Differentially Private Data Cubes for Health Information , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[7]  Chun Yuan,et al.  Differentially Private Data Release through Multidimensional Partitioning , 2010, Secure Data Management.

[8]  Cynthia Dwork,et al.  Differential Privacy , 2006, ICALP.

[9]  Thorsten Joachims,et al.  A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization , 1997, ICML.

[10]  Benjamin C. M. Fung,et al.  Differentially private transit data publication: a case study on the montreal transportation system , 2012, KDD.

[11]  Kunal Talwar,et al.  Mechanism Design via Differential Privacy , 2007, 48th Annual IEEE Symposium on Foundations of Computer Science (FOCS'07).

[12]  Frank McSherry,et al.  Privacy integrated queries: an extensible platform for privacy-preserving data analysis , 2009, SIGMOD Conference.

[13]  Guy N. Rothblum,et al.  A Multiplicative Weights Mechanism for Privacy-Preserving Data Analysis , 2010, 2010 IEEE 51st Annual Symposium on Foundations of Computer Science.