Spectral Graph Forge: Graph Generation Targeting Modularity

Community structure is an important property that captures inhomogeneities common in large networks, and modularity is one of the most widely used metrics for such community structure. In this paper, we introduce a principled methodology, the Spectral Graph Forge, for generating random graphs that preserves community structure from a real network of interest, in terms of modularity. Our approach leverages the fact that the spectral structure of matrix representations of a graph encodes global information about community structure. The Spectral Graph Forge uses a low-rank approximation of the modularity matrix to generate synthetic graphs that match a target modularity within user-selectable degree of accuracy, while allowing other aspects of structure to vary. We show that the Spectral Graph Forge outperforms state-of-the-art techniques in terms of accuracy in targeting the modularity and randomness of the realizations, while also preserving other local structural properties and node attributes. We discuss extensions of the Spectral Graph Forge to target other properties beyond modularity, and its applications to anonymization.

[1]  Priya Mahadevan,et al.  Systematic topology analysis and generation using degree correlations , 2006, SIGCOMM.

[2]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Chiara Orsini,et al.  Hyperbolic graph generator , 2015, Comput. Phys. Commun..

[4]  Garry Robins,et al.  An introduction to exponential random graph (p*) models for social networks , 2007, Soc. Networks.

[5]  Jure Leskovec,et al.  Learning to Discover Social Circles in Ego Networks , 2012, NIPS.

[6]  L. Mirsky SYMMETRIC GAUGE FUNCTIONS AND UNITARILY INVARIANT NORMS , 1960 .

[7]  Benoit Donnet,et al.  Internet topology discovery: a survey , 2007, IEEE Communications Surveys & Tutorials.

[8]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[9]  T. Snijders,et al.  Estimation and Prediction for Stochastic Blockstructures , 2001 .

[10]  Kamil Saraç,et al.  Resolving Anonymous Routers in Internet Topology Measurement Studies , 2008, IEEE INFOCOM 2008 - The 27th Conference on Computer Communications.

[11]  Michael Hicks,et al.  Deanonymizing mobility traces: using social network as a side-channel , 2012, CCS.

[12]  Mark E. J. Newman,et al.  Stochastic blockmodels and community structure in networks , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[14]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[15]  Krishna P. Gummadi,et al.  Measurement and analysis of online social networks , 2007, IMC '07.

[16]  Ajay Mehra The Development of Social Network Analysis: A Study in the Sociology of Science , 2005 .

[17]  Zoran Levnajic,et al.  Revealing the Hidden Language of Complex Networks , 2014, Scientific Reports.

[18]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[19]  J. Coleman Introduction to Mathematical Sociology , 1965 .

[20]  Chiara Orsini,et al.  Quantifying randomness in real networks , 2015, Nature Communications.

[21]  Carl D. Meyer,et al.  Matrix Analysis and Applied Linear Algebra , 2000 .

[22]  B. Wellman The Development of Social Network Analysis: A Study in the Sociology of Science , 2008 .

[23]  Piet Van Mieghem,et al.  Generating graphs that approach a prescribed modularity , 2013, Comput. Commun..

[24]  Minas Gjoka,et al.  Construction of simple graphs with a target joint degree matrix and beyond , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[25]  J. V. Rauff,et al.  Introduction to Mathematical Sociology , 2012 .

[26]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[27]  T. Snijders,et al.  Estimation and Prediction for Stochastic Blockmodels for Graphs with Latent Block Structure , 1997 .

[28]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[29]  Martin G. Everett,et al.  Models of core/periphery structures , 2000, Soc. Networks.

[30]  S. Axler Linear Algebra Done Right , 1995, Undergraduate Texts in Mathematics.

[31]  Peter D. Hoff,et al.  Latent Space Approaches to Social Network Analysis , 2002 .

[32]  Stanford,et al.  Learning to Discover Social Circles in Ego Networks , 2012 .

[33]  Virgílio A. F. Almeida,et al.  Characterizing user behavior in online social networks , 2009, IMC '09.

[34]  Prateek Mittal,et al.  SecGraph: A Uniform and Open-source Evaluation System for Graph Data Anonymization and De-anonymization , 2015, USENIX Security Symposium.

[35]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.