Degree Distribution for Duplication-Divergence Graphs: Large Deviations

We present a rigorous and precise analysis of the degree distribution in a dynamic graph model introduced by Solé et al. in which nodes are added according to a duplication-divergence mechanism, i.e. by iteratively copying a node and then randomly inserting and deleting some edges for a copied node. This graph model finds many applications since it well captures the growth of some real-world processes e.g. biological or social networks. However, there are only a handful of rigorous results concerning this model. In this paper we present rigorous results concerning the degree distribution. We focus on two related problems: the expected value and large deviation for the degree of a fixed node through the evolution of the graph and the expected value and large deviation of the average degree in the graph. We present exact and asymptotic results showing that both quantities may decrease or increase over time depending on the model parameters. Our findings are a step towards a better understanding of the overall graph behaviors, especially, degree distribution, symmetry, and compression, important open problems in this area.

[1]  Frank Harary,et al.  Graph Theory , 2016 .

[2]  M. Abramowitz,et al.  Handbook of Mathematical Functions With Formulas, Graphs and Mathematical Tables (National Bureau of Standards Applied Mathematics Series No. 55) , 1965 .

[3]  Wojciech Szpankowski,et al.  Compression of Dynamic Graphs Generated by a Duplication Model , 2018, Algorithmica.

[4]  Jonathan Jordan,et al.  The connected component of the partial duplication graph , 2017, 1712.06411.

[5]  Wojciech Szpankowski,et al.  Asymmetry and structural information in preferential attachment graphs , 2016, Random Struct. Algorithms.

[6]  Wojciech Szpankowski,et al.  Temporal Ordered Clustering in Dynamic Networks , 2019, 2020 IEEE International Symposium on Information Theory (ISIT).

[7]  Yi Yang,et al.  Choosing appropriate models for protein-protein interaction networks: a comparison study , 2014, Briefings Bioinform..

[8]  Si Li,et al.  Degree distribution of large networks generated by the partial duplication model , 2013, Theor. Comput. Sci..

[9]  Wojciech Szpankowski,et al.  Compression of Graphical Structures: Fundamental Limits, Algorithms, and Experiments , 2012, IEEE Transactions on Information Theory.

[10]  Michalis Faloutsos,et al.  On power-law relationships of the Internet topology , 1999, SIGCOMM '99.

[11]  Wojciech Szpankowski,et al.  Inferring Temporal Information from a Snapshot of a Dynamic Network , 2019, Scientific Reports.

[12]  Jianzhi Zhang Evolution by gene duplication: an update , 2003 .

[13]  W. Szpankowski Average Case Analysis of Algorithms on Sequences , 2001 .

[14]  Martin Ester,et al.  Dense Graphlet Statistics of Protein Interaction and Random Networks , 2009, Pacific Symposium on Biocomputing.

[15]  Wojciech Szpankowski,et al.  Revisiting Parameter Estimation in Biological Networks: Influence of Symmetries , 2019, bioRxiv.

[16]  F. Hermann,et al.  Large-scale behavior of the partial duplication random graph , 2014, 1408.0904.

[17]  Ricard V. Solé,et al.  A Model of Large-Scale proteome Evolution , 2002, Adv. Complex Syst..