Structural-Based Graph Publishing Under Differential Privacy

Mining data from social and communication network have been attracting recent attention from various research fields. However, these data represented by large-scale graphs are often sensitive and private. It is a necessity of developing algorithms to publish large-scale graph while not revealing sensitive information. As a standard for data privacy preservation, differential privacy based algorithm are also widely used in publishing graph-based dataset. However, previous differential privacy based methods often bring huge computational cost and lack the capability of modeling complicated graph structure. To address these challenge, we propose a novel graph publishing algorithm which combines community detection with differential privacy method. By segmenting the graph into several sub-graphs by community detection, differential privacy methods is able to handle large-scale graphs with complex structure. Experimental results on several datasets demonstrates the promising performance of the proposed algorithm compared with original differential privacy methods.

[1]  Balachander Krishnamurthy,et al.  Class-based graph anonymization for social network data , 2009, Proc. VLDB Endow..

[2]  Divesh Srivastava,et al.  Differentially Private Spatial Decompositions , 2011, 2012 IEEE 28th International Conference on Data Engineering.

[3]  Qian Xiao,et al.  Differentially private network data release via structural inference , 2014, KDD.

[4]  Christos Faloutsos,et al.  Sampling from large graphs , 2006, KDD '06.

[5]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[6]  Cynthia Dwork,et al.  Differential Privacy , 2006, ICALP.

[7]  Leting Wu,et al.  Differential Privacy Preserving Spectral Graph Analysis , 2013, PAKDD.

[8]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[9]  Philip S. Yu,et al.  Differentially Private Data Publishing and Analysis: A Survey , 2017, IEEE Transactions on Knowledge and Data Engineering.

[10]  Guido Caldarelli,et al.  Hierarchical mutual information for the comparison of hierarchical community structures in complex networks , 2015, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Ian Goodfellow,et al.  Deep Learning with Differential Privacy , 2016, CCS.

[12]  Philip S. Yu,et al.  Correlated network data publication via differential privacy , 2013, The VLDB Journal.

[13]  Rong Jin,et al.  A Random Matrix Approach to Differential Privacy and Structure Preserved Social Network Graph Publishing , 2013, ArXiv.

[14]  Anand D. Sarwate,et al.  Stochastic gradient descent with differentially private updates , 2013, 2013 IEEE Global Conference on Signal and Information Processing.

[15]  Latanya Sweeney,et al.  k-Anonymity: A Model for Protecting Privacy , 2002, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[16]  Aaron Roth,et al.  The Algorithmic Foundations of Differential Privacy , 2014, Found. Trends Theor. Comput. Sci..