Exact results for directed random networks that grow by node duplication

We present exact analytical results for the degree distribution and for the distribution of shortest path lengths (DSPL) in a directed network model that grows by node duplication. Such models are useful in the study of the structure and growth dynamics of gene regulatory and scientific citation networks. Starting from an initial seed network, at each time step a random node, referred to as a mother node, is selected for duplication. Its daughter node is added to the network and duplicates, with probability p, each one of the outgoing links of the mother node. In addition, the daughter node forms a directed link to the mother node itself. Thus, the model is referred to as the corded directed-node-duplication (DND) model. We obtain analytical results for the in-degree distribution, $P_t(K_{in})$, and for the out-degree distribution, $P_t(K_{out})$, of the network at time t. It is found that the in-degrees follow a shifted power-law distribution, so the network is asymptotically scale free. In contrast, the out-degree distribution is a narrow distribution, that converges to a Poisson distribution in the sparse limit and to a Gaussian distribution in the dense limit. Using these distributions we calculate the mean degree, $\langle K_{in} \rangle_t = \langle K_{out} \rangle_t$. To calculate the DSPL we derive a master equation for the time evolution of the probability $P_t(L=\ell)$, $\ell=1,2,\dots$, that for two nodes, i and j, selected randomly at time t, the shortest path from i to j is of length $\ell$. Solving the master equation, we obtain a closed form expression for $P_t(L=\ell)$. It is found that the DSPL at time t consists of a convolution of the initial DSPL, $P_0(L=\ell)$, with a Poisson distribution and a sum of Poisson distributions. The mean distance, $ _t$, is found to depend logarithmically on the network size, $N_t$, namely the corded DND network is a small-world network.

[1]  S. Teichmann,et al.  Gene regulatory network growth by duplication , 2004, Nature Genetics.

[2]  Reimer Kühn,et al.  Revealing the microstructure of the giant component in random graph ensembles. , 2018, Physical review. E.

[3]  Michael Drmota,et al.  On the profile of random trees , 1997, Random Struct. Algorithms.

[4]  S. Redner,et al.  Infinite-order percolation and giant fluctuations in a protein interaction network. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[6]  Distribution of shortest path lengths in a class of node duplication network models. , 2017, Physical review. E.

[7]  I. Ispolatov,et al.  Cliques and duplication–divergence network growth , 2005, New journal of physics.

[8]  S. Redner Citation statistics from 110 years of physical review , 2005, physics/0506056.

[9]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[11]  Shirley Soukup,et al.  Evolution by gene duplication. S. Ohno. Springer‐Verlag, New York. 1970. 160 pp , 1974 .

[12]  S. Redner How popular is your paper? An empirical study of the citation distribution , 1998, cond-mat/9804163.

[13]  Michael Golosovsky,et al.  Growing complex network of citations of scientific papers: Modeling and measurements. , 2016, Physical review. E.

[14]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[15]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[16]  H. Stanley,et al.  Structure of shells in complex networks. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[17]  Piet Van Mieghem,et al.  Epidemic processes in complex networks , 2014, ArXiv.

[18]  Peter Sanders,et al.  Engineering Route Planning Algorithms , 2009, Algorithmics of Large and Complex Networks.

[19]  Béla Bollobás,et al.  Random Graphs , 1985 .

[20]  Bruce A. Reed,et al.  A Critical Point for Random Graphs with a Given Degree Sequence , 1995, Random Struct. Algorithms.

[21]  U. Alon An introduction to systems biology : design principles of biological circuits , 2019 .

[22]  Reimer Kühn,et al.  Distribution of shortest cycle lengths in random networks. , 2017, Physical review. E.

[23]  Mor Nitzan,et al.  Distance distribution in configuration-model networks. , 2016, Physical review. E.

[24]  I. Ispolatov,et al.  Duplication-divergence model of protein interaction network. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  S. Redner,et al.  Connectivity of growing random networks. , 2000, Physical review letters.

[26]  Michael Golosovsky,et al.  Power-law citation distributions are not scale-free , 2017, Physical review. E.

[27]  Bruce A. Reed,et al.  The Size of the Giant Component of a Random Graph with a Given Degree Sequence , 1998, Combinatorics, Probability and Computing.

[28]  James R. Knight,et al.  A Protein Interaction Map of Drosophila melanogaster , 2003, Science.

[29]  V. Blondel,et al.  Distance distribution in random graphs and application to network exploration. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  David J. Galas,et al.  A duplication growth model of gene expression networks , 2002, Bioinform..

[31]  A. Vázquez Growing network with local rules: preferential attachment, clustering hierarchy, and degree correlations. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[32]  Prahlad T. Ram,et al.  Formation of Regulatory Patterns During Signal Propagation in a Mammalian Cellular Network , 2005, Science.

[33]  Agata Fronczak,et al.  Average path length in random networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[34]  Fan Chung Graham,et al.  Coupling Online and Offline Analyses for Random Power Law Graphs , 2004, Internet Math..

[35]  F. Chung,et al.  The average distances in random graphs with given expected degrees , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[36]  R. Durrett Random Graph Dynamics: References , 2006 .

[37]  S. N. Dorogovtsev,et al.  Metric structure of random networks , 2002, cond-mat/0210085.

[38]  Guido Caldarelli,et al.  Scale-Free Networks , 2007 .

[40]  Ido Kanter,et al.  Oscillations in networks of networks stem from adaptive nodes with memory , 2017, Scientific Reports.

[41]  Ernesto Estrada,et al.  The Structure of Complex Networks: Theory and Applications , 2011 .

[42]  H. Stanley,et al.  Fractal boundaries of complex networks , 2008, 0804.1968.

[43]  Ido Kanter,et al.  Broadband macroscopic cortical oscillations emerge from intrinsic neuronal response failures , 2015, Front. Neural Circuits.

[44]  S. Havlin,et al.  Scale-free networks are ultrasmall. , 2002, Physical review letters.

[45]  Fan Chung Graham,et al.  Duplication Models for Biological Networks , 2002, J. Comput. Biol..

[46]  S. Redner,et al.  Organization of growing random networks. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[47]  S. Redner,et al.  Structural Transitions in Densifying Networks. , 2016, Physical review letters.

[48]  Si Li,et al.  Degree distribution of large networks generated by the partial duplication model , 2013, Theor. Comput. Sci..

[49]  Jari Saramäki,et al.  A comparative study of social network models: Network evolution models and nodal attribute models , 2008, Soc. Networks.

[50]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[51]  Reuven Cohen,et al.  Complex Networks: Structure, Robustness and Function , 2010 .

[52]  S. Redner,et al.  Network growth by copying. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[53]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[54]  Michael Golosovsky,et al.  Stochastic dynamical model of a growing network based on self-exciting point process , 2012, Physical review letters.

[55]  Colin Cooper,et al.  The degree distribution of the generalized duplication model , 2006, Theor. Comput. Sci..

[56]  Claudio Castellano,et al.  Universality of citation distributions: Toward an objective measure of scientific impact , 2008, Proceedings of the National Academy of Sciences.

[57]  Steve Pressé,et al.  Nonuniversal power law scaling in the probability distribution of scientific citations , 2010, Proceedings of the National Academy of Sciences.

[58]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[59]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[60]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[61]  S. Redner,et al.  Densification and structural transitions in networks that grow by node copying. , 2016, Physical review. E.

[62]  Panos M. Pardalos,et al.  Quantification of network structural dissimilarities , 2017, Nature Communications.

[63]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[64]  M. Newman,et al.  Random graphs with arbitrary degree distributions and their applications. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[65]  O. Biham,et al.  Distribution of shortest path lengths in subcritical Erdős-Rényi networks. , 2018, Physical review. E.

[66]  Alexander K. Hartmann,et al.  Distribution of diameters for Erdős-Rényi random graphs. , 2017, Physical review. E.