Computational Intractability Generates the Topology of Biological Networks

Virtually all molecular interactions networks, independent of organism and physiological context, have majority-leaves minority-hubs (mLmH) topology. Current generative models of this topology are based on controversial hypotheses that, controversy aside, demonstrate sufficient but not necessary evolutionary conditions for its emergence. Here we show that the circumvention of computational intractability provides sufficient and (assuming P!=NP) necessary conditions for the emergence of the mLmH property. Evolutionary pressure on molecular interaction networks is simulated by randomly labelling some interactions as 'beneficial' and others 'detrimental'. Each gene is subsequently given a benefit (damage) score according to how many beneficial (detrimental) interactions it is projecting onto or attracting from other genes. The problem of identifying which subset of genes should ideally be conserved and which deleted, so as to maximize (minimize) the total number of beneficial (detrimental) interactions network-wide, is NP-hard. An evolutionary algorithm that simulates hypothetical instances of this problem and selects for networks that produce the easiest instances leads to networks that possess the mLmH property. The degree distributions of synthetically evolved networks match those of publicly available experimentally-validated biological networks from many phylogenetically-distant organisms.

[1]  J. Doyle,et al.  Some protein interaction data do not exhibit power law statistics , 2005, FEBS letters.

[2]  Bas Teusink,et al.  A critical view of metabolic network adaptations , 2009, HFSP journal.

[3]  Bridget E. Begg,et al.  A Proteome-Scale Map of the Human Interactome Network , 2014, Cell.

[4]  Mark Gerstein,et al.  Interpretation of Genomic Variants Using a Unified Biological Network Approach , 2013, PLoS Comput. Biol..

[5]  David Pisinger,et al.  Where are the hard knapsack problems? , 2005, Comput. Oper. Res..

[6]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[7]  Canglin Wu,et al.  RegNetwork: an integrated database of transcriptional and post-transcriptional regulatory networks in human and mouse , 2015, Database J. Biol. Databases Curation.

[8]  M. Porter,et al.  Critical Truths About Power Laws , 2012, Science.

[9]  David Z. Chen,et al.  Architecture of the human regulatory network derived from ENCODE data , 2012, Nature.

[10]  Shane J. Neph,et al.  Circuitry and Dynamics of Human Transcription Factor Regulatory Networks , 2012, Cell.

[11]  Alexander D. Johnson,et al.  Making Sense of Transcription Networks , 2015, Cell.

[12]  Jonathan D. G. Jones,et al.  Evidence for Network Evolution in an Arabidopsis Interactome Map , 2011, Science.

[13]  A. Vespignani,et al.  Modeling of Protein Interaction Networks , 2001, Complexus.

[14]  M. Lynch The frailty of adaptive hypotheses for the origins of organismal complexity , 2007, Proceedings of the National Academy of Sciences.

[15]  Jung Eun Shim,et al.  TRRUST: a reference database of human transcriptional regulatory interactions , 2015, Scientific Reports.

[16]  César A. Hidalgo,et al.  Proto-genes and de novo gene birth , 2012, Nature.

[17]  Matjaz Perc,et al.  The Matthew effect in empirical data , 2014, Journal of The Royal Society Interface.

[18]  M. Lynch The evolution of genetic networks by non-adaptive processes , 2007, Nature Reviews Genetics.

[19]  John Doyle,et al.  Contrasting Views of Complexity and Their Implications For Network-Centric Infrastructures , 2010, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[20]  Matthew W. Hahn,et al.  Molecular Evolution in Large Genetic Networks: Does Connectivity Equal Constraint? , 2004, Journal of Molecular Evolution.

[21]  A. E. Hirsh,et al.  Evolutionary Rate in the Protein Interaction Network , 2002, Science.

[22]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[23]  K. Gunsalus,et al.  Empirically controlled mapping of the Caenorhabditis elegans protein-protein interactome network , 2009, Nature Methods.

[24]  Albert-László Barabási,et al.  Error and attack tolerance of complex networks , 2000, Nature.

[25]  David J. Galas,et al.  A duplication growth model of gene expression networks , 2002, Bioinform..

[26]  Raya Khanin,et al.  How Scale-Free Are Biological Networks , 2006, J. Comput. Biol..

[27]  Yanhui Hu,et al.  Integrating protein-protein interaction networks with phenotypes reveals signs of interactions , 2013, Nature Methods.

[28]  D. Turcotte,et al.  Self-organized criticality , 1999 .

[29]  J M Carlson,et al.  Highly optimized tolerance: a mechanism for power laws in designed systems. , 1999, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[30]  Gipsi Lima-Mendez,et al.  The powerful law of the power law and other myths in network biology. , 2009, Molecular bioSystems.

[31]  Gloria M. Sheynkman,et al.  Widespread Expansion of Protein Interaction Capabilities by Alternative Splicing , 2016, Cell.

[32]  P. Uetz,et al.  The binary protein-protein interaction landscape of Escherichia coli , 2014, Nature Biotechnology.

[33]  Evelyn Fox Keller,et al.  Revisiting "scale-free" networks. , 2005, BioEssays : news and reviews in molecular, cellular and developmental biology.

[34]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[35]  A. Barabasi,et al.  High-Quality Binary Protein Interaction Map of the Yeast Interactome Network , 2008, Science.

[36]  Jan O. Korbel,et al.  Positive selection at the protein network periphery: Evaluation in terms of structural constraints and cellular context , 2007, Proceedings of the National Academy of Sciences.

[37]  Fabio Rinaldi,et al.  RegulonDB version 9.0: high-level integration of gene regulation, coexpression, motif clustering and beyond , 2015, Nucleic Acids Res..

[38]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..