Exploring biological network structure with clustered random networks

BackgroundComplex biological systems are often modeled as networks of interacting units. Networks of biochemical interactions among proteins, epidemiological contacts among hosts, and trophic interactions in ecosystems, to name a few, have provided useful insights into the dynamical processes that shape and traverse these systems. The degrees of nodes (numbers of interactions) and the extent of clustering (the tendency for a set of three nodes to be interconnected) are two of many well-studied network properties that can fundamentally shape a system. Disentangling the interdependent effects of the various network properties, however, can be difficult. Simple network models can help us quantify the structure of empirical networked systems and understand the impact of various topological properties on dynamics.ResultsHere we develop and implement a new Markov chain simulation algorithm to generate simple, connected random graphs that have a specified degree sequence and level of clustering, but are random in all other respects. The implementation of the algorithm (ClustRNet: Clustered Random Networks) provides the generation of random graphs optimized according to a local or global, and relative or absolute measure of clustering. We compare our algorithm to other similar methods and show that ours more successfully produces desired network characteristics.Finding appropriate null models is crucial in bioinformatics research, and is often difficult, particularly for biological networks. As we demonstrate, the networks generated by ClustRNet can serve as random controls when investigating the impacts of complex network features beyond the byproduct of degree and clustering in empirical networks.ConclusionClustRNet generates ensembles of graphs of specified edge structure and clustering. These graphs allow for systematic study of the impacts of connectivity and redundancies on network function and dynamics. This process is a key step in unraveling the functional consequences of the structural properties of empirical biological systems and uncovering the mechanisms that drive these systems.

[1]  Alessandro Flammini,et al.  Characterization and modeling of protein–protein interaction networks , 2005 .

[2]  Jean-Loup Guillaume,et al.  Bipartite graphs as models of complex networks , 2006 .

[3]  Garry Robins,et al.  An introduction to exponential random graph (p*) models for social networks , 2007, Soc. Networks.

[4]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Mark Newman,et al.  Detecting community structure in networks , 2004 .

[6]  R. Tsien,et al.  Specificity and Stability in Topology of Protein Networks , 2022 .

[7]  D. Gale A theorem on flows in networks , 1957 .

[8]  S. Shen-Orr,et al.  Superfamilies of Evolved and Designed Networks , 2004, Science.

[9]  Christos Gkantsidis,et al.  The Markov Chain Simulation Method for Generating Connected Power Law Random Graphs , 2003, ALENEX.

[10]  M E J Newman Assortative mixing in networks. , 2002, Physical review letters.

[11]  Bruce A. Reed,et al.  A Critical Point for Random Graphs with a Given Degree Sequence , 1995, Random Struct. Algorithms.

[12]  M. Newman,et al.  Random graphs with arbitrary degree distributions and their applications. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  Neo D. Martinez Artifacts or Attributes? Effects of Resolution on the Little Rock Lake Food Web , 1991 .

[14]  M. Newman,et al.  Network theory and SARS: predicting outbreak diversity , 2004, Journal of Theoretical Biology.

[15]  P. Yodzis,et al.  The Compartmentation of Real and Assembled Ecosystems , 1982, The American Naturalist.

[16]  John H. Lawton,et al.  ARE FOOD WEBS DIVIDED INTO COMPARTMENTS , 1980 .

[17]  P. Holme,et al.  Exploring the assortativity-clustering space of a network's degree sequence. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Massimo Marchiori,et al.  Error and attacktolerance of complex network s , 2004 .

[20]  A. Wagner The yeast protein interaction network evolves rapidly and contains few redundant duplicate genes. , 2001, Molecular biology and evolution.

[21]  Ricard V. Solé,et al.  MODULARITY IN BIOLOGICAL NETWORKS , 2007 .

[22]  S H Strogatz,et al.  Random graph models of social networks , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[23]  M. Newman Properties of highly clustered networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  Marián Boguñá,et al.  Tuning clustering in random networks with arbitrary degree distributions. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  Albert-László Barabási,et al.  Hierarchical organization in complex networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  K. Sneppen,et al.  Specificity and Stability in Topology of Protein Networks , 2002, Science.

[27]  Marián Boguñá,et al.  Clustering in complex networks. I. General formalism. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  M. Newman Random Graphs as Models of Networks , 2002, cond-mat/0202208.

[29]  L. Meyers,et al.  When individual behaviour matters: homogeneous and network models in epidemiology , 2007, Journal of The Royal Society Interface.

[30]  M. Keeling,et al.  Networks and epidemic models , 2005, Journal of The Royal Society Interface.

[31]  Michalis Faloutsos,et al.  On power-law relationships of the Internet topology , 1999, SIGCOMM '99.

[32]  M. Keeling The implications of network structure for epidemic dynamics. , 2005, Theoretical population biology.

[33]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[34]  M. Newman Erratum: Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality (Physical Review e (2001) 64 (016132)) , 2006 .

[35]  Alessandro Vespignani,et al.  Epidemic spreading in complex networks with degree correlations , 2003, cond-mat/0301149.

[36]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[37]  M. Keeling,et al.  The effects of local spatial structure on epidemiological invasions , 1999, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[38]  Albert-László Barabási,et al.  Internet: Diameter of the World-Wide Web , 1999, Nature.

[39]  János Kertész,et al.  Clustering in complex networks , 2004 .

[40]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[41]  Steven M. Goodreau,et al.  Advances in exponential random graph (p*) models applied to a large social network , 2007, Soc. Networks.

[42]  Neo D. Martinez,et al.  Food-web structure and network theory: The role of connectance and size , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[43]  J. P. Trapman,et al.  On stochastic models for the spread of infections , 2006 .

[44]  M. Newman,et al.  Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[45]  A Vázquez,et al.  The topological relationship between the large-scale attributes and local interaction patterns of complex networks , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[46]  S. Shen-Orr,et al.  Networks Network Motifs : Simple Building Blocks of Complex , 2002 .

[47]  M. Newman,et al.  Reply to ``Comment on `Subgraphs in random networks' '' , 2004 .

[48]  M. Newman,et al.  Why social networks are different from other types of networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[49]  Sarel J Fleishman,et al.  Comment on "Network Motifs: Simple Building Blocks of Complex Networks" and "Superfamilies of Evolved and Designed Networks" , 2004, Science.

[50]  J. Hopfield,et al.  From molecular to modular cell biology , 1999, Nature.

[51]  E. Volz,et al.  Random networks with tunable degree distribution and clustering. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[52]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[53]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[54]  Caroline C. Friedel,et al.  Influence of degree correlations on network structure and stability in protein-protein interaction networks , 2007, BMC Bioinformatics.

[55]  Frank Harary,et al.  Graph Theory , 2016 .

[56]  P. Pattison,et al.  New Specifications for Exponential Random Graph Models , 2006 .

[57]  Anat Kreimer,et al.  The evolution of modularity in bacterial metabolic networks , 2008, Proceedings of the National Academy of Sciences.

[58]  Albert-László Barabási,et al.  Error and attack tolerance of complex networks , 2000, Nature.

[59]  U. Alon,et al.  Spontaneous evolution of modularity and network motifs. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[60]  Thomas Petermann,et al.  Role of clustering and gridlike ordering in epidemic spreading. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[61]  J. Montoya,et al.  Small world patterns in food webs. , 2002, Journal of theoretical biology.

[62]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[63]  A. Vázquez,et al.  Network clustering coefficient without degree-correlation biases. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.