Compression of Preferential Attachment Graphs

We study structural properties of preferential attachment graphs (with parameter m ≥ 1 giving the number of attachment choices that each new vertex makes) which intervene in two complementary algorithmic/statistical/information-theoretic problems involving the information shared between a random graph’s labels and its structure: in structural compression, we seek to compactly describe a graph’s structure by a bit string, throwing away its label information; in node arrival order recovery, we seek to recover node labels, given only a graph structure. In particular, we study the typical size of the automorphism group, as well as some shape parameters (such as the number of linear extensions and height) of the directed version of the graph, which in turn allows us to estimate the typical number of admissible labeled representatives of a given graph structure. Our result on the automorphism group positively settles a conjecture to the effect that, provided that m ≥ 3, preferential attachment graphs are asymmetric with high probability, and completes the characterization of the number of symmetries for a broad range of parameters of the model (i.e., for all fixed m). These results allow us to give an algorithmically efficient, asymptotically optimal algorithm for compression of unlabeled preferential attachment graphs. To show the optimality of our scheme, we also derive new, precise estimates of the Shannon entropy of both the unlabeled and labeled version of the model. Our results also imply inapproximability results for the problem of node arrival order recovery.

[1]  Silvio Lattanzi,et al.  On compressing social networks , 2009, KDD.

[2]  Sergio Verdú,et al.  Compressing data on graphs with clusters , 2017, 2017 IEEE International Symposium on Information Theory (ISIT).

[3]  Martin Sauerhoff Krengelstrasse On the Entropy of Models for the Web Graph , 2009 .

[4]  Emmanuel Abbe,et al.  Graph compression: The effect of clusters , 2016, 2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[5]  Thomas M. Cover,et al.  Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing) , 2006 .

[6]  Svante Janson,et al.  On Symmetry of Uniform and Preferential Attachment Graphs , 2014, Electron. J. Comb..

[7]  Wojciech Szpankowski,et al.  Compression of Graphical Structures: Fundamental Limits, Algorithms, and Experiments , 2012, IEEE Transactions on Information Theory.

[8]  Jie Zhang,et al.  A Universal Grammar-Based Code for Lossless Compression of Binary Trees , 2013, IEEE Transactions on Information Theory.

[9]  Wojciech Szpankowski,et al.  Recovery of vertex orderings in dynamic graphs , 2017, 2017 IEEE International Symposium on Information Theory (ISIT).

[10]  Venkat Anantharam,et al.  Universal lossless compression of graphical data , 2017, 2017 IEEE International Symposium on Information Theory (ISIT).

[11]  Alan M. Frieze,et al.  A general model of web graphs , 2003, Random Struct. Algorithms.

[12]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[13]  Wojciech Szpankowski,et al.  Asymmetry and structural information in preferential attachment graphs , 2016, Random Struct. Algorithms.