Differentially Private Graphical Degree Sequences and Synthetic Graphs

We present an algorithm for releasing graphical degree sequences of simple undirected graphs under the framework of differential privacy. The algorithm is designed to provide utility for statistical inference in random graph models whose sufficient statistics are functions of degree sequences. Specifically, we focus on the tasks of existence of maximum likelihood estimates, parameter estimation and goodness-of-fit testing for the beta model of random graphs. We show the usefulness of our algorithm by evaluating it empirically on simulated and real-life datasets. As the released degree sequence is graphical, our algorithm can also be used to release synthetic graphs under the beta model.

[1]  J. F. C. Kingman,et al.  Information and Exponential Families in Statistical Theory , 1980 .

[2]  Panos M. Pardalos,et al.  Algorithms for a Class of Isotonic Regression Problems , 1999, Algorithmica.

[3]  T. Snijders Accounting for degree distributions in empirical analysis of network dynamics , 2003 .

[4]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[5]  F. T. Wright,et al.  Order restricted statistical inference , 1988 .

[6]  Hisayuki Hara,et al.  Graver basis for an undirected graph and its application to testing the beta model of random graphs , 2011, 1102.2583.

[7]  Aleksandra B. Slavkovic,et al.  Differentially Private Synthetic graphs , 2012, ArXiv.

[8]  Lei Chen,et al.  A Survey of Privacy-Preservation of Graphs and Social Networks , 2010, Managing and Mining Graph Data.

[9]  S. Hakimi On Realizability of a Set of Integers as Degrees of the Vertices of a Linear Graph. I , 1962 .

[10]  Allan Sly,et al.  Random graphs with a given degree sequence , 2010, 1005.1136.

[11]  János Pach,et al.  Bounded-Degree Graphs can have Arbitrarily Large Slope Numbers , 2006, Electron. J. Comb..

[12]  Moni Naor,et al.  Our Data, Ourselves: Privacy Via Distributed Noise Generation , 2006, EUROCRYPT.

[13]  Stephen E. Fienberg,et al.  Maximum Likelihood Estimation in Network Models , 2011, ArXiv.

[14]  S. Sivasubramanian,et al.  The Polytope of Degree Partitions , 2006, Electron. J. Comb..

[15]  David D. Jensen,et al.  Accurate Estimation of the Degree Distribution of Private Networks , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[16]  Sofya Raskhodnikova,et al.  Smooth sensitivity and sampling in private data analysis , 2007, STOC '07.

[17]  Cynthia Dwork,et al.  Calibrating Noise to Sensitivity in Private Data Analysis , 2006, TCC.

[18]  L. Amaral,et al.  The web of human sexual contacts , 2001, Nature.

[19]  Sofya Raskhodnikova,et al.  Private analysis of graph structure , 2011, Proc. VLDB Endow..

[20]  O. Barndorff-Nielsen Information and Exponential Families in Statistical Theory , 1980 .

[21]  Persi Diaconis,et al.  A Sequential Importance Sampling Algorithm for Generating Random Graphs with Prescribed Degrees , 2011, Internet Math..

[22]  Serge Vaudenay,et al.  Advances in Cryptology - EUROCRYPT 2006 , 2006, Lecture Notes in Computer Science.